{"title":"DSOD-YOLO:用于小目标检测的轻量级双特征提取方法","authors":"Yuan Nie , Huicheng Lai , Guxue Gao","doi":"10.1016/j.dsp.2025.105268","DOIUrl":null,"url":null,"abstract":"<div><div>As object detection techniques advance, large-object detection has become less challenging. However, small-object detection remains a significant hurdle. DSOD-YOLO is a lightweight small-object detection network based on YOLOv8, designed to balance detection accuracy with model efficiency. To accurately detect small objects, the network employs a dual-backbone feature extraction architecture, which enhances the extraction of small-object details. This addresses the issue of detail loss in deep models. Additionally, a Channel-Scale Adaptive Module (FASD) is introduced to adaptively select feature channels and image sizes based on the required feature information. This helps mitigate the problem of sparse feature information and information loss during feature propagation for small objects. To strengthen contextual information and further improve small-object detection, a lightweight Context and Spatial Feature Calibration Network (CSFCN) is integrated. CSFCN performs context correction and spatial feature calibration through its two core modules, Context Feature Calibration (CFC) and Spatial Feature Calibration (SFC), based on pixel context similarity and channel dimensions, respectively. To reduce model complexity, the network undergoes a pruning process, achieving lightweight small-object detection. Furthermore, knowledge distillation is employed, with a large model acting as a teacher network to guide DSOD-YOLO, leading to further accuracy improvements. Experimental results demonstrate that DSOD-YOLO outperforms state-of-the-art algorithms like YOLOv9 and YOLOv10 on multiple small-object datasets. Additionally, a new small-object dataset (SmallDark) is created for low-light conditions, and the proposed method surpasses existing algorithms on this custom dataset.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"164 ","pages":"Article 105268"},"PeriodicalIF":2.9000,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DSOD-YOLO: A lightweight dual feature extraction method for small target detection\",\"authors\":\"Yuan Nie , Huicheng Lai , Guxue Gao\",\"doi\":\"10.1016/j.dsp.2025.105268\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>As object detection techniques advance, large-object detection has become less challenging. However, small-object detection remains a significant hurdle. DSOD-YOLO is a lightweight small-object detection network based on YOLOv8, designed to balance detection accuracy with model efficiency. To accurately detect small objects, the network employs a dual-backbone feature extraction architecture, which enhances the extraction of small-object details. This addresses the issue of detail loss in deep models. Additionally, a Channel-Scale Adaptive Module (FASD) is introduced to adaptively select feature channels and image sizes based on the required feature information. This helps mitigate the problem of sparse feature information and information loss during feature propagation for small objects. To strengthen contextual information and further improve small-object detection, a lightweight Context and Spatial Feature Calibration Network (CSFCN) is integrated. CSFCN performs context correction and spatial feature calibration through its two core modules, Context Feature Calibration (CFC) and Spatial Feature Calibration (SFC), based on pixel context similarity and channel dimensions, respectively. To reduce model complexity, the network undergoes a pruning process, achieving lightweight small-object detection. Furthermore, knowledge distillation is employed, with a large model acting as a teacher network to guide DSOD-YOLO, leading to further accuracy improvements. Experimental results demonstrate that DSOD-YOLO outperforms state-of-the-art algorithms like YOLOv9 and YOLOv10 on multiple small-object datasets. Additionally, a new small-object dataset (SmallDark) is created for low-light conditions, and the proposed method surpasses existing algorithms on this custom dataset.</div></div>\",\"PeriodicalId\":51011,\"journal\":{\"name\":\"Digital Signal Processing\",\"volume\":\"164 \",\"pages\":\"Article 105268\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-04-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1051200425002908\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425002908","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
DSOD-YOLO: A lightweight dual feature extraction method for small target detection
As object detection techniques advance, large-object detection has become less challenging. However, small-object detection remains a significant hurdle. DSOD-YOLO is a lightweight small-object detection network based on YOLOv8, designed to balance detection accuracy with model efficiency. To accurately detect small objects, the network employs a dual-backbone feature extraction architecture, which enhances the extraction of small-object details. This addresses the issue of detail loss in deep models. Additionally, a Channel-Scale Adaptive Module (FASD) is introduced to adaptively select feature channels and image sizes based on the required feature information. This helps mitigate the problem of sparse feature information and information loss during feature propagation for small objects. To strengthen contextual information and further improve small-object detection, a lightweight Context and Spatial Feature Calibration Network (CSFCN) is integrated. CSFCN performs context correction and spatial feature calibration through its two core modules, Context Feature Calibration (CFC) and Spatial Feature Calibration (SFC), based on pixel context similarity and channel dimensions, respectively. To reduce model complexity, the network undergoes a pruning process, achieving lightweight small-object detection. Furthermore, knowledge distillation is employed, with a large model acting as a teacher network to guide DSOD-YOLO, leading to further accuracy improvements. Experimental results demonstrate that DSOD-YOLO outperforms state-of-the-art algorithms like YOLOv9 and YOLOv10 on multiple small-object datasets. Additionally, a new small-object dataset (SmallDark) is created for low-light conditions, and the proposed method surpasses existing algorithms on this custom dataset.
期刊介绍:
Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal.
The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as:
• big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,