Gaofeng Zhu;Zhixue Wang;Fenghua Zhu;Gang Xiong;Zheng Li
{"title":"基于混合控制和特征融合的小物体识别算法","authors":"Gaofeng Zhu;Zhixue Wang;Fenghua Zhu;Gang Xiong;Zheng Li","doi":"10.1109/JRFID.2024.3384483","DOIUrl":null,"url":null,"abstract":"Drone detection plays a key role in various fields, but from the perspective of drones, factors such as the size of the target, interference from different backgrounds, and lighting affect the detection effect, which can easily lead to missed detections and false detections. To address this problem, this paper proposes a small target detection algorithm. First, the hybrid control of attention mechanism and a convolutional module (HCAC) are used to effectively extract contextual details of targets of different scales, directions, and shapes, while relative position encoding is used to associate targets with position information. Secondly, in view of the small size characteristics of small targets, a high-resolution detection branch is introduced, the large target detection head and its redundant network layers are pruned, and a multi-level weighted feature fusion network (MWFN) is used for multi-dimensional fusion. Finally, the WIoU loss is used as a bounding box regression loss, combined with a dynamic non-monotonic focusing mechanism, to evaluate the quality of anchor boxes so that the detector can handle anchor boxes of different qualities, thus improving the overall performance. Experiments were conducted on the UAV aerial photography data set VisDrone2019. The results showed that the accuracy of P increased by 9.0% and MAP by 9.8%, with higher detection results.","PeriodicalId":73291,"journal":{"name":"IEEE journal of radio frequency identification","volume":null,"pages":null},"PeriodicalIF":2.3000,"publicationDate":"2024-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Small Object Recognition Algorithm Based on Hybrid Control and Feature Fusion\",\"authors\":\"Gaofeng Zhu;Zhixue Wang;Fenghua Zhu;Gang Xiong;Zheng Li\",\"doi\":\"10.1109/JRFID.2024.3384483\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Drone detection plays a key role in various fields, but from the perspective of drones, factors such as the size of the target, interference from different backgrounds, and lighting affect the detection effect, which can easily lead to missed detections and false detections. To address this problem, this paper proposes a small target detection algorithm. First, the hybrid control of attention mechanism and a convolutional module (HCAC) are used to effectively extract contextual details of targets of different scales, directions, and shapes, while relative position encoding is used to associate targets with position information. Secondly, in view of the small size characteristics of small targets, a high-resolution detection branch is introduced, the large target detection head and its redundant network layers are pruned, and a multi-level weighted feature fusion network (MWFN) is used for multi-dimensional fusion. Finally, the WIoU loss is used as a bounding box regression loss, combined with a dynamic non-monotonic focusing mechanism, to evaluate the quality of anchor boxes so that the detector can handle anchor boxes of different qualities, thus improving the overall performance. Experiments were conducted on the UAV aerial photography data set VisDrone2019. The results showed that the accuracy of P increased by 9.0% and MAP by 9.8%, with higher detection results.\",\"PeriodicalId\":73291,\"journal\":{\"name\":\"IEEE journal of radio frequency identification\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2024-04-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE journal of radio frequency identification\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10490089/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE journal of radio frequency identification","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10490089/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Small Object Recognition Algorithm Based on Hybrid Control and Feature Fusion
Drone detection plays a key role in various fields, but from the perspective of drones, factors such as the size of the target, interference from different backgrounds, and lighting affect the detection effect, which can easily lead to missed detections and false detections. To address this problem, this paper proposes a small target detection algorithm. First, the hybrid control of attention mechanism and a convolutional module (HCAC) are used to effectively extract contextual details of targets of different scales, directions, and shapes, while relative position encoding is used to associate targets with position information. Secondly, in view of the small size characteristics of small targets, a high-resolution detection branch is introduced, the large target detection head and its redundant network layers are pruned, and a multi-level weighted feature fusion network (MWFN) is used for multi-dimensional fusion. Finally, the WIoU loss is used as a bounding box regression loss, combined with a dynamic non-monotonic focusing mechanism, to evaluate the quality of anchor boxes so that the detector can handle anchor boxes of different qualities, thus improving the overall performance. Experiments were conducted on the UAV aerial photography data set VisDrone2019. The results showed that the accuracy of P increased by 9.0% and MAP by 9.8%, with higher detection results.