{"title":"DI-YOLOv5: An Improved Dual-Wavelet-Based YOLOv5 for Dense Small Object Detection","authors":"Zi-Xin Li;Yu-Long Wang;Fei Wang","doi":"10.1109/JAS.2024.124368","DOIUrl":null,"url":null,"abstract":"This letter focuses on the fact that small objects with few pixels disappear in feature maps with large receptive fields, as the network deepens, in object detection tasks. Therefore, the detection of dense small objects is challenging. A DI-YOLOv5 object detection algorithm is proposed. Specifically, a dual-wavelet convolution module (DWCM), which contains DWT_Conv and IWT_Conv, is proposed to reduce the loss of feature map information while obtaining feature maps with a large receptive field. The DWT _ Conv and IWT _ Conv can be used as replacements for downsampling and upsampling operations. Moreover, in the process of information transmission to the deep layer, a CSPCoA module is proposed to further capture the location information and information dependencies in different spatial directions. DWCM and CSPCoA are single, generic, plug-and-play units. We propose DI-YOLOv5 with YOLOv5 [1] as the baseline, and extensively evaluate the performance of these two modules on small object detection. Experiments demonstrate that DI-YOLOv5 can effectively improve the accuracy of object detection.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"12 2","pages":"457-459"},"PeriodicalIF":15.3000,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10846924","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ieee-Caa Journal of Automatica Sinica","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10846924/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
This letter focuses on the fact that small objects with few pixels disappear in feature maps with large receptive fields, as the network deepens, in object detection tasks. Therefore, the detection of dense small objects is challenging. A DI-YOLOv5 object detection algorithm is proposed. Specifically, a dual-wavelet convolution module (DWCM), which contains DWT_Conv and IWT_Conv, is proposed to reduce the loss of feature map information while obtaining feature maps with a large receptive field. The DWT _ Conv and IWT _ Conv can be used as replacements for downsampling and upsampling operations. Moreover, in the process of information transmission to the deep layer, a CSPCoA module is proposed to further capture the location information and information dependencies in different spatial directions. DWCM and CSPCoA are single, generic, plug-and-play units. We propose DI-YOLOv5 with YOLOv5 [1] as the baseline, and extensively evaluate the performance of these two modules on small object detection. Experiments demonstrate that DI-YOLOv5 can effectively improve the accuracy of object detection.
期刊介绍:
The IEEE/CAA Journal of Automatica Sinica is a reputable journal that publishes high-quality papers in English on original theoretical/experimental research and development in the field of automation. The journal covers a wide range of topics including automatic control, artificial intelligence and intelligent control, systems theory and engineering, pattern recognition and intelligent systems, automation engineering and applications, information processing and information systems, network-based automation, robotics, sensing and measurement, and navigation, guidance, and control.
Additionally, the journal is abstracted/indexed in several prominent databases including SCIE (Science Citation Index Expanded), EI (Engineering Index), Inspec, Scopus, SCImago, DBLP, CNKI (China National Knowledge Infrastructure), CSCD (Chinese Science Citation Database), and IEEE Xplore.