

浏览全部资源
扫码关注微信
1.南京邮电大学自动化学院、人工智能学院,江苏 南京 210023
2.南京航空航天大学航空航天结构力学及控制全国重点实验室,江苏 南京 210016
Received:15 December 2023,
Revised:2024-05-31,
Published:10 September 2024
移动端阅览
刘园,赵静,蒋国平等.融合多尺度和上下文的无人机救援小目标检测算法[J].物联网学报,2024,08(03):146-156.
LIU Yuan,ZHAO Jing,JIANG Guoping,et al.Fusion of multi-scale and context for small target detection algorithm of unmanned aerial vehicle rescue[J].Chinese Journal on Internet of Things,2024,08(03):146-156.
刘园,赵静,蒋国平等.融合多尺度和上下文的无人机救援小目标检测算法[J].物联网学报,2024,08(03):146-156. DOI: 10.11959/j.issn.2096-3750.2024.00390.
LIU Yuan,ZHAO Jing,JIANG Guoping,et al.Fusion of multi-scale and context for small target detection algorithm of unmanned aerial vehicle rescue[J].Chinese Journal on Internet of Things,2024,08(03):146-156. DOI: 10.11959/j.issn.2096-3750.2024.00390.
针对无人机(UAV
unmanned aerial vehicle)图像中小目标所包含的特征信息少,导致模型检测精度不足的问题,面向无人机海面救援任务提出了一种融合多尺度和上下文信息的图像小目标检测算法。首先,针对小目标特征信息设计上下文增强模块,通过增强特征层的上下文信息,有效地增加了模型对小目标的处理能力。其次,为提高模型的鲁棒性,设计了空间注意力模块加强对重要特征的学习。最后,使用平衡L1损失函数优化基线算法的损失函数,加强了模型检测时的稳定性。基于Tiny-Person数据集,与基准算法进行大量实验对比,所提算法在AP50_tiny上提高了2.06%,一定程度上提高了对海面小目标的检测性能,对救援行动具有积极影响。
Aiming at the problem of insufficient feature information contained in small targets under unmanned aerial vehicle (UAV) images that led to insufficient detection accuracy of the model
a small target detection algorithm for UAV sea rescue images that integrated multi-scale and contextual information was proposed. Firstly
context enhancement module was designed for small target feature information
which effectively enhanced the ability of the model to process small targets by enhancing the contextual information of the feature layer. Secondly
to improve the robustness of the model
spatial attention module was designed to enhance the learning of important features. Finally
balance L1 loss was used to optimize the loss function of the baseline algorithm and enhance the stability of the model during the process of detection. Based on the Tiny-Person dataset
through extensive experimental comparison with the benchmark algorithm
the proposed algorithm improves the detection performance of small targets on the sea surface by 2.06% on AP50_tiny
which has a positive impact on rescue operations.
CHEN C Y , LIU M Y , TUZEL O , et al . R-CNN for small object detection [C ] // Proceedings of the Asian Conference on Computer Vision . Cham : Springer , 2017 : 214 - 230 .
LIN T Y , MAIRE M , BELONGIE S , et al . Microsoft COCO: common objects in context [C ] // Proceedings of the European Conference on Computer Vision . Cham : Springer , 2014 : 740 - 755 .
YU X H , GONG Y Q , JIANG N , et al . Scale match for tiny person detection [C ] // Proceedings of the 2020 IEEE Winter Conference on Applications of Computer Vision (WACV) . Piscataway : IEEE Press , 2020 : 1246 - 1254 .
魏泽发 , 崔华 . 基于SqueezeNet卷积神经网络的车辆检测 [J ] . 物联网学报 , 2020 , 4 ( 3 ): 120 - 125 .
WEI Z F , CUI H . Vehicle detection based on SqueezeNet convolutional neural network [J ] . Chinese Journal on Internet of Things , 2020 , 4 ( 3 ): 120 - 125 .
LIU W , ANGUELOV D , ERHAN D , et al . SSD: single shot MultiBox detector [C ] // Proceedings of the European Conference on Computer Vision . Cham : Springer , 2016 : 21 - 37 .
REDMON J , DIVVALA S , GIRSHICK R , et al . You only look once: unified, real-time object detection [C ] // Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) . Piscataway : IEEE Press , 2016 : 779 - 788 .
刘洋 , 战荫伟 . 基于深度学习的小目标检测算法综述 [J ] . 计算机工程与应用 , 2021 , 57 ( 2 ): 37 - 48 .
LIU Y , ZHAN Y W . Survey of small object detection algorithms based on deep learning [J ] . Computer Engineering and Applications , 2021 , 57 ( 2 ): 37 - 48 .
GIRSHICK R . Fast R-CNN [C ] // Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV) . Piscataway : IEEE Press , 2015 : 1440 - 1448 .
REN S Q , HE K M , GIRSHICK R , et al . Faster R-CNN: towards real-time object detection with region proposal networks [J ] . IEEE Transactions on Pattern Analysis and Machine Intelligence , 2017 , 39 ( 6 ): 1137 - 1149 .
GIRSHICK R , DONAHUE J , DARRELL T , et al . Rich feature hierarchies for accurate object detection and semantic segmentation [C ] // Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE Press , 2014 : 580 - 587 .
HE K M , GKIOXARI G , DOLLÁR P , et al . Mask R-CNN [C ] // Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV) . Piscataway : IEEE Press , 2017 : 2980 - 2988 .
闾海庆 , 雷远华 , 王静 , 等 . 基于改进Libra-RCNN的输电线路绝缘子识别 [J ] . 湖南电力 , 2022 , 42 ( 2 ): 44 - 49 .
LYU H Q , LEI Y H , WANG J , et al . Transmission line insulator identification based on improved libra-RCNN [J ] . Hunan Electric Power , 2022 , 42 ( 2 ): 44 - 49 .
LIN T Y , GOYAL P , GIRSHICK R , et al . Focal loss for dense object detection [J ] . IEEE Transactions on Pattern Analysis and Machine Intelligence , 2020 , 42 ( 2 ): 318 - 327 .
HE K M , ZHANG X Y , REN S Q , et al . Spatial pyramid pooling in deep convolutional networks for visual recognition [J ] . IEEE Transactions on Pattern Analysis and Machine Intelligence , 2015 , 37 ( 9 ): 1904 - 1916 .
LIU S , QI L , QIN H F , et al . Path aggregation network for instance segmentation [C ] // Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE Press , 2018 : 8759 - 8768 .
CAI Z W , VASCONCELOS N . Cascade R-CNN: delving into high quality object detection [C ] // Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE Press , 2018 : 6154 - 6162 .
SANG H W , WANG Q H , ZHAO Y . Multi-scale context attention network for stereo matching [J ] . IEEE Access , 2019 , 7 : 15152 - 15161 .
WANG X , LV R R , ZHAO Y , et al . Multi-scale context aggregation network with attention-guided for crowd counting [C ] // Proceedings of the 2020 15th IEEE International Conference on Signal Processing (ICSP) . Piscataway : IEEE Press , 2020 : 240 - 245 .
吕晓华 , 魏铭辰 , 刘立波 . 基于位置可学习视觉中心机制的零售商品检测方法 [J ] . 物联网学报 , 2023 , 7 ( 4 ): 142 - 152 .
LYU X H , WEI M C , LIU L B . Retail commodity detection method based on location learnable visual center mechanism [J ] . Chinese Journal on Internet of Things , 2023 , 7 ( 4 ): 142 - 152 .
HONG M B , LI S W , YANG Y C , et al . SSPNet: scale selection pyramid network for tiny person detection from UAV images [J ] . IEEE Geoscience and Remote Sensing Letters , 2021 , 19 : 8018505 .
HE K M , ZHANG X Y , REN S Q , et al . Deep residual learning for image recognition [C ] // Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) . Piscataway : IEEE Press , 2016 : 770 - 778 .
林椿珉 , 曾烈康 , 陈旭 . 边缘智能驱动的高能效无人机自主导航算法研究 [J ] . 物联网学报 , 2021 , 5 ( 2 ): 87 - 96 .
LIN C M , ZENG L K , CHEN X . Research on power efficient autonomous UAV navigation algorithm: an edge intelligence driven approach [J ] . Chinese Journal on Internet of Things , 2021 , 5 ( 2 ): 87 - 96 .
王正文 , 宋慧慧 , 樊佳庆 , 等 . 基于语义引导特征聚合的显著性目标检测网络 [J ] . 自动化学报 , 2023 , 49 ( 11 ): 2386 - 2395 .
WANG Z W , SONG H H , FAN J Q , et al . Semantic guided feature aggregation network for salient object detection [J ] . Acta Automatica Sinica , 2023 , 49 ( 11 ): 2386 - 2395 .
姚红革 , 张玮 , 杨浩琪 , 等 . 深度强化学习联合回归目标定位 [J ] . 自动化学报 , 2023 , 49 ( 5 ): 1089 - 1098 .
YAO H G , ZHANG W , YANG H Q , et al . Union regression object localization based on deep reinforcement learning [J ] . Acta Automatica Sinica , 2023 , 49 ( 5 ): 1089 - 1098 .
杜鹏 , 宋永红 , 张鑫瑶 . 基于自注意力模态融合网络的跨模态行人再识别方法研究 [J ] . 自动化学报 , 2022 , 48 ( 6 ): 1457 - 1468 .
DU P , SONG Y H , ZHANG X Y . Self-attention cross-modality fusion network for cross-modality person re-identification [J ] . Acta Automatica Sinica , 2022 , 48 ( 6 ): 1457 - 1468 .
PANG J M , CHEN K , SHI J P , et al . Libra R-CNN: towards balanced learning for object detection [C ] // Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) . Piscataway : IEEE Press , 2019 : 821 - 830 .
潘翔 , 陈前斌 , 黄昂 , 等 . 基于改进YOLOX的无人机航拍图像小目标检测算法 [J ] . 南京邮电大学学报(自然科学版) , 2024 , 44 ( 1 ): 90 - 100
PAN X , CHEN Q B , HUANG A , et al . A small target detection algorithm of UAV aerial photography images based on improved YOLOX [J ] . Journal of Nanjing University of Posts and Telecommunications (Natural Science Edition) , 2024 , 44 ( 1 ): 90 - 100 .
陈旭 , 彭冬亮 , 谷雨 . 基于改进YOLOv5s的无人机图像实时目标检测 [J ] . 光电工程 , 2022 , 49 ( 3 ): 69 - 81 .
CHEN X , PENG D L , GU Y . Real-time object detection for UAV images based on improved YOLOv5s [J ] . Opto-Electronic Engineering , 2022 , 49 ( 3 ): 69 - 81 .
宁欣 , 田伟娟 , 于丽娜 , 等 . 面向小目标和遮挡目标检测的脑启发CIRA-DETR全推理方法 [J ] . 计算机学报 , 2022 , 45 ( 10 ): 2080 - 2092 .
NING X , TIAN W J , YU L N , et al . Brain-inspired CIRA-DETR full inference model for small and occluded object detection [J ] . Chinese Journal of Computers , 2022 , 45 ( 10 ): 2080 - 2092 .
廖龙杰 , 吕文涛 , 叶冬 , 等 . 基于深度学习的小目标检测算法研究进展 [J ] . 浙江理工大学学报(自然科学) , 2023 , 49 ( 3 ): 331 - 343 .
LIAO L J , LYU W T , YE D , et al . Research progress of small target detection based on deep learning [J ] . Journal of Zhejiang Sci-Tech University (Natural Sciences) , 2023 , 49 ( 3 ): 331 - 343 .
CAO J , SU Z , YU L Y , et al . Softmax cross entropy loss with unbiased decision boundary for image classification [C ] // Proceedings of the 2018 Chinese Automation Congress (CAC) . Xi’an, China , 2018 : 2028 - 2032 .
邵香迎 , 郭颖 , 王友伟 . AF-RetinaNet: 一种基于自适应融合与特征细化的微小行人检测算法 [J ] . 控制与决策 , 2024 , 39 ( 3 ): 939 - 946 .
SHAO X Y , GUO Y , WANG Y W . AF-RetinaNet: a tiny person detection algorithm based on adaptive fusion and feature refinement [J ] . Control and Decision , 2024 , 39 ( 3 ): 939 - 946 .
GONG Y Q , YU X H , DING Y , et al . Effective fusion factor in FPN for tiny object detection [C ] // Proceedings of the 2021 IEEE Winter Conference on Applications of Computer Vision (WACV) . Piscataway : IEEE Press , 2021 : 1159 - 1167 .
张源川 . 深度卷积神经网络下的小目标检测方法 [D ] . 重庆 : 重庆邮电大学 , 2022 .
ZHANG Y C . Small object detection method based on deep convolutional neural network [D ] . Chongqing : Chongqing University of Posts and Telecommunications , 2022 .
GUO G Q , CHEN P F , YU X H , et al . Save the tiny, save the all: hierarchical activation network for tiny object detection [J ] . IEEE Transactions on Circuits and Systems for Video Technology , 2024 , 34 ( 1 ): 221 - 234 .
0
Views
211
下载量
0
CSCD
Publicity Resources
Related Articles
Related Author
Related Institution
京公网安备11010802024621