DSOD-YOLO: A lightweight dual feature extraction method for small target detection

IF 2.9 3区工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

Digital Signal Processing Pub Date : 2025-04-28 DOI:10.1016/j.dsp.2025.105268

Yuan Nie , Huicheng Lai , Guxue Gao

{"title":"DSOD-YOLO: A lightweight dual feature extraction method for small target detection","authors":"Yuan Nie , Huicheng Lai , Guxue Gao","doi":"10.1016/j.dsp.2025.105268","DOIUrl":null,"url":null,"abstract":"<div><div>As object detection techniques advance, large-object detection has become less challenging. However, small-object detection remains a significant hurdle. DSOD-YOLO is a lightweight small-object detection network based on YOLOv8, designed to balance detection accuracy with model efficiency. To accurately detect small objects, the network employs a dual-backbone feature extraction architecture, which enhances the extraction of small-object details. This addresses the issue of detail loss in deep models. Additionally, a Channel-Scale Adaptive Module (FASD) is introduced to adaptively select feature channels and image sizes based on the required feature information. This helps mitigate the problem of sparse feature information and information loss during feature propagation for small objects. To strengthen contextual information and further improve small-object detection, a lightweight Context and Spatial Feature Calibration Network (CSFCN) is integrated. CSFCN performs context correction and spatial feature calibration through its two core modules, Context Feature Calibration (CFC) and Spatial Feature Calibration (SFC), based on pixel context similarity and channel dimensions, respectively. To reduce model complexity, the network undergoes a pruning process, achieving lightweight small-object detection. Furthermore, knowledge distillation is employed, with a large model acting as a teacher network to guide DSOD-YOLO, leading to further accuracy improvements. Experimental results demonstrate that DSOD-YOLO outperforms state-of-the-art algorithms like YOLOv9 and YOLOv10 on multiple small-object datasets. Additionally, a new small-object dataset (SmallDark) is created for low-light conditions, and the proposed method surpasses existing algorithms on this custom dataset.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"164 ","pages":"Article 105268"},"PeriodicalIF":2.9000,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425002908","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

As object detection techniques advance, large-object detection has become less challenging. However, small-object detection remains a significant hurdle. DSOD-YOLO is a lightweight small-object detection network based on YOLOv8, designed to balance detection accuracy with model efficiency. To accurately detect small objects, the network employs a dual-backbone feature extraction architecture, which enhances the extraction of small-object details. This addresses the issue of detail loss in deep models. Additionally, a Channel-Scale Adaptive Module (FASD) is introduced to adaptively select feature channels and image sizes based on the required feature information. This helps mitigate the problem of sparse feature information and information loss during feature propagation for small objects. To strengthen contextual information and further improve small-object detection, a lightweight Context and Spatial Feature Calibration Network (CSFCN) is integrated. CSFCN performs context correction and spatial feature calibration through its two core modules, Context Feature Calibration (CFC) and Spatial Feature Calibration (SFC), based on pixel context similarity and channel dimensions, respectively. To reduce model complexity, the network undergoes a pruning process, achieving lightweight small-object detection. Furthermore, knowledge distillation is employed, with a large model acting as a teacher network to guide DSOD-YOLO, leading to further accuracy improvements. Experimental results demonstrate that DSOD-YOLO outperforms state-of-the-art algorithms like YOLOv9 and YOLOv10 on multiple small-object datasets. Additionally, a new small-object dataset (SmallDark) is created for low-light conditions, and the proposed method surpasses existing algorithms on this custom dataset.

查看原文本刊更多论文

DSOD-YOLO：用于小目标检测的轻量级双特征提取方法

随着目标检测技术的进步，大目标检测变得不那么具有挑战性。然而，小目标检测仍然是一个重大障碍。DSOD-YOLO是基于YOLOv8的轻量级小目标检测网络，旨在平衡检测精度和模型效率。为了准确检测小目标，该网络采用双主干特征提取架构，增强了对小目标细节的提取。这解决了深度模型中细节丢失的问题。此外，引入通道尺度自适应模块（FASD），根据需要的特征信息自适应选择特征通道和图像尺寸。这有助于缓解特征信息稀疏和小对象特征传播过程中的信息丢失问题。为了增强上下文信息，进一步提高小目标检测能力，本文集成了一个轻量级的上下文和空间特征校准网络（CSFCN）。CSFCN通过上下文特征校准（context feature calibration， CFC）和空间特征校准（spatial feature calibration， SFC）两个核心模块，分别基于像素上下文相似度和通道维度进行上下文校正和空间特征校准。为了降低模型复杂度，网络进行了修剪过程，实现了轻量级的小目标检测。在此基础上，采用知识蒸馏的方法，利用一个大型模型作为教师网络来指导DSOD-YOLO，进一步提高准确率。实验结果表明，在多个小目标数据集上，DSOD-YOLO优于YOLOv9和YOLOv10等最先进的算法。此外，在低光照条件下创建了一个新的小目标数据集（SmallDark），该方法超越了该自定义数据集上的现有算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Digital Signal Processing 工程技术-工程：电子与电气

CiteScore

5.30

自引率

17.20%

发文量

435

审稿时长

66 days

期刊介绍： Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal. The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as: • big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,