DD-YOLO：一种用于模糊车辆目标检测的双通道双路径YOLO网络

IF 3 3区工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

Digital Signal Processing Pub Date : 2025-08-22 DOI:10.1016/j.dsp.2025.105565

Chaoqun Duan , Yuhan Guo , Xuelian Duan , Guoqiang Li , Bo Sheng

{"title":"DD-YOLO：一种用于模糊车辆目标检测的双通道双路径YOLO网络","authors":"Chaoqun Duan , Yuhan Guo , Xuelian Duan , Guoqiang Li , Bo Sheng","doi":"10.1016/j.dsp.2025.105565","DOIUrl":null,"url":null,"abstract":"<div><div>The detection of blurred vehicle targets is essential for maintaining traffic efficiency and ensuring road safety. Although various you-only-look-once (YOLO)-based models exist, few studies have focused on blurred vehicle detection under real-world traffic conditions. To fill this gap, we propose a novel dual-channel dual-path YOLO (DD-YOLO) network, featuring dual-channel feature extraction and dual-path feature fusion. The network comprises a hybrid pooling pyramid (HPP) module, a dual-channel feature extraction backbone, and a dual-path fusion pooling neck (DPFP-neck). Within the DD-YOLO network, we first introduce the HPP module to reduce dependence on key features by combining max and average pooling, incorporating background information to mitigate false positives. Subsequently, the dual-channel backbone is designed to enhance DD-YOLO’s sensitivity for blurred vehicle targets by integrating multiple convolution and attention mechanisms, including the convolutional block attention module (CBAM), simple and parameter-free attention module (SIMAM), standard convolution, and ghost convolution, to capture richer features and improve recall. Finally, the DPFP-neck is developed to fuse diverse information and expand the receptive field across network depths, providing a satisfactory balance between precision and recall. Experiments on the BDD100K and KITTI datasets show that DD-YOLO improves detection accuracy by 4.9% and 4.0%, respectively, with [email protected] gains of 2.4% and 2.7% over the baseline, demonstrating its effectiveness and real-time capability in detecting blurred vehicle targets.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"168 ","pages":"Article 105565"},"PeriodicalIF":3.0000,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DD-YOLO: A dual-channel dual-path YOLO network for target detection of blurred vehicles\",\"authors\":\"Chaoqun Duan , Yuhan Guo , Xuelian Duan , Guoqiang Li , Bo Sheng\",\"doi\":\"10.1016/j.dsp.2025.105565\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The detection of blurred vehicle targets is essential for maintaining traffic efficiency and ensuring road safety. Although various you-only-look-once (YOLO)-based models exist, few studies have focused on blurred vehicle detection under real-world traffic conditions. To fill this gap, we propose a novel dual-channel dual-path YOLO (DD-YOLO) network, featuring dual-channel feature extraction and dual-path feature fusion. The network comprises a hybrid pooling pyramid (HPP) module, a dual-channel feature extraction backbone, and a dual-path fusion pooling neck (DPFP-neck). Within the DD-YOLO network, we first introduce the HPP module to reduce dependence on key features by combining max and average pooling, incorporating background information to mitigate false positives. Subsequently, the dual-channel backbone is designed to enhance DD-YOLO’s sensitivity for blurred vehicle targets by integrating multiple convolution and attention mechanisms, including the convolutional block attention module (CBAM), simple and parameter-free attention module (SIMAM), standard convolution, and ghost convolution, to capture richer features and improve recall. Finally, the DPFP-neck is developed to fuse diverse information and expand the receptive field across network depths, providing a satisfactory balance between precision and recall. Experiments on the BDD100K and KITTI datasets show that DD-YOLO improves detection accuracy by 4.9% and 4.0%, respectively, with [email protected] gains of 2.4% and 2.7% over the baseline, demonstrating its effectiveness and real-time capability in detecting blurred vehicle targets.</div></div>\",\"PeriodicalId\":51011,\"journal\":{\"name\":\"Digital Signal Processing\",\"volume\":\"168 \",\"pages\":\"Article 105565\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-08-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1051200425005871\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425005871","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

模糊车辆目标的检测对于保持交通效率和确保道路安全至关重要。尽管存在各种基于you-only-look-once （YOLO）的模型，但很少有研究关注真实交通条件下的模糊车辆检测。为了填补这一空白，我们提出了一种新的双通道双路径YOLO （DD-YOLO）网络，该网络具有双通道特征提取和双路径特征融合。该网络由混合池金字塔（HPP）模块、双通道特征提取骨干和双路径融合池颈（dpfp -颈）组成。在DD-YOLO网络中，我们首先引入HPP模块，通过结合最大池和平均池来减少对关键特征的依赖，并结合背景信息来减少误报。随后，设计了双通道主干网，通过集成卷积块注意模块（CBAM）、简单无参数注意模块（SIMAM）、标准卷积和鬼卷积等多种卷积和注意机制，提高DD-YOLO对模糊车辆目标的灵敏度，以捕获更丰富的特征，提高召回率。最后，开发了dpfp颈来融合不同的信息，并在网络深度上扩展接受域，在准确率和召回率之间提供了令人满意的平衡。在BDD100K和KITTI数据集上的实验表明，DD-YOLO的检测精度分别提高了4.9%和4.0%，[email protected]比基线分别提高了2.4%和2.7%，证明了其在检测模糊车辆目标方面的有效性和实时性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

DD-YOLO: A dual-channel dual-path YOLO network for target detection of blurred vehicles

The detection of blurred vehicle targets is essential for maintaining traffic efficiency and ensuring road safety. Although various you-only-look-once (YOLO)-based models exist, few studies have focused on blurred vehicle detection under real-world traffic conditions. To fill this gap, we propose a novel dual-channel dual-path YOLO (DD-YOLO) network, featuring dual-channel feature extraction and dual-path feature fusion. The network comprises a hybrid pooling pyramid (HPP) module, a dual-channel feature extraction backbone, and a dual-path fusion pooling neck (DPFP-neck). Within the DD-YOLO network, we first introduce the HPP module to reduce dependence on key features by combining max and average pooling, incorporating background information to mitigate false positives. Subsequently, the dual-channel backbone is designed to enhance DD-YOLO’s sensitivity for blurred vehicle targets by integrating multiple convolution and attention mechanisms, including the convolutional block attention module (CBAM), simple and parameter-free attention module (SIMAM), standard convolution, and ghost convolution, to capture richer features and improve recall. Finally, the DPFP-neck is developed to fuse diverse information and expand the receptive field across network depths, providing a satisfactory balance between precision and recall. Experiments on the BDD100K and KITTI datasets show that DD-YOLO improves detection accuracy by 4.9% and 4.0%, respectively, with [email protected] gains of 2.4% and 2.7% over the baseline, demonstrating its effectiveness and real-time capability in detecting blurred vehicle targets.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Digital Signal Processing 工程技术-工程：电子与电气

CiteScore

5.30

自引率

17.20%

发文量

435

审稿时长

66 days

期刊介绍： Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal. The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as: • big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,