An Infrared Small Target Detection Method for Unmanned Aerial Vehicles Integrating Adaptive Feature Focusing Diffusion and Edge Enhancement

  • Jiale Wang School of Computer Science and Technology, Taiyuan Normal University, Jinzhong 030619, Shanxi, China
Keywords: Infrared detection of unmanned aerial vehicles, YOLOv11, Adaptive feature fusion, Edge enhancement, Small target detection

Abstract

In the context of target detection under infrared conditions for drones, the common issues of high missed detection rates, low signal-to-noise ratio, and blurred edge features for small targets are prevalent. To address these challenges, this paper proposes an improved detection algorithm based on YOLOv11n. First, a Dynamic Multi-Scale Feature Fusion and Adaptive Weighting approach is employed to design an Adaptive Focused Diffusion Pyramid Network (AFDPN), which enhances the feature expression and transmission capability of shallow small targets, thereby reducing the loss of detailed information. Then, combined with an Edge Enhancement (EE) module, the model improves the extraction of infrared small target edge features through low-frequency suppression and high-frequency enhancement strategies. Experimental results on the publicly available HIT-UAV dataset show that the improved model achieves a 3.8% increase in average detection accuracy and a 3.0% improvement in recall rate compared to YOLOv11n, with a computational cost of only 9.1 GFLOPS. In comparison experiments, the detection accuracy and model size balance achieved the optimal solution, meeting the lightweight deployment requirements for drone-based systems. This method provides a high-precision, lightweight solution for small target detection in drone-based infrared imagery.

References

Wu X, Li W, Hong D, et al., 2021, Deep Learning for Unmanned Aerial Vehicle-Based Object Detection and Tracking: A Survey. IEEE Geoscience and Remote Sensing Magazine, 10(1): 91–124.

Qiu Z, Bai H, Chen T, 2023, Special Vehicle Detection from UAV Perspective via YOLO-GNS Based Deep Learning Network. Drones, 7(2) 117.

Terven J, Córdova-Esparza DM, Romero-González JA, 2023, A Comprehensive Review of YOLO Architectures in Computer Vision: From YOLOv1 to YOLOv8 and YOLO-NAS. Machine Learning and Knowledge Extraction, 5(4): 1680–1716.

Khanam R, Hussain M, 2024, YOLOv11: An Overview of the Key Architectural Enhancements. arXiv preprint arXiv:2410.17725.

Xiangming Q, Yiran Z, 2024, Fire Detection Algorithm with Multi-Scale Feature Focus and Diffusion. 2024 International Conference on Electronics and Devices, Computational Science (ICEDCS), IEEE: 71–76.

Xu S, Zheng S, Xu W, et al., 2024, HCF-Net: Hierarchical Context Fusion Network for Infrared Small Object Detection. 2024 IEEE International Conference on Multimedia and Expo (ICME), IEEE: 1–6.

Liu W, Lu H, Fu H, et al., 2023, Learning to Upsample by Learning to Sample. Proceedings of the IEEE/CVF International Conference on Computer Vision, 6027–6037.

Li S, 2023, Salient Object Detection via High-Frequency Edge Detail Enhancement. International Conference on Electronic Information Engineering and Computer Science (EIECS 2022), SPIE, 12602: 293–300.

Suo J, Wang T, Zhang X, et al., 2023, HIT-UAV: A High-Altitude Infrared Thermal Dataset for Unmanned Aerial Vehicle-Based Object Detection. Scientific Data, 10(1): 227.

Published
2025-10-31