DD-YOLO:一种用于模糊车辆目标检测的双通道双路径YOLO网络

IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC
Chaoqun Duan , Yuhan Guo , Xuelian Duan , Guoqiang Li , Bo Sheng
{"title":"DD-YOLO:一种用于模糊车辆目标检测的双通道双路径YOLO网络","authors":"Chaoqun Duan ,&nbsp;Yuhan Guo ,&nbsp;Xuelian Duan ,&nbsp;Guoqiang Li ,&nbsp;Bo Sheng","doi":"10.1016/j.dsp.2025.105565","DOIUrl":null,"url":null,"abstract":"<div><div>The detection of blurred vehicle targets is essential for maintaining traffic efficiency and ensuring road safety. Although various you-only-look-once (YOLO)-based models exist, few studies have focused on blurred vehicle detection under real-world traffic conditions. To fill this gap, we propose a novel dual-channel dual-path YOLO (DD-YOLO) network, featuring dual-channel feature extraction and dual-path feature fusion. The network comprises a hybrid pooling pyramid (HPP) module, a dual-channel feature extraction backbone, and a dual-path fusion pooling neck (DPFP-neck). Within the DD-YOLO network, we first introduce the HPP module to reduce dependence on key features by combining max and average pooling, incorporating background information to mitigate false positives. Subsequently, the dual-channel backbone is designed to enhance DD-YOLO’s sensitivity for blurred vehicle targets by integrating multiple convolution and attention mechanisms, including the convolutional block attention module (CBAM), simple and parameter-free attention module (SIMAM), standard convolution, and ghost convolution, to capture richer features and improve recall. Finally, the DPFP-neck is developed to fuse diverse information and expand the receptive field across network depths, providing a satisfactory balance between precision and recall. Experiments on the BDD100K and KITTI datasets show that DD-YOLO improves detection accuracy by 4.9% and 4.0%, respectively, with [email protected] gains of 2.4% and 2.7% over the baseline, demonstrating its effectiveness and real-time capability in detecting blurred vehicle targets.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"168 ","pages":"Article 105565"},"PeriodicalIF":3.0000,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DD-YOLO: A dual-channel dual-path YOLO network for target detection of blurred vehicles\",\"authors\":\"Chaoqun Duan ,&nbsp;Yuhan Guo ,&nbsp;Xuelian Duan ,&nbsp;Guoqiang Li ,&nbsp;Bo Sheng\",\"doi\":\"10.1016/j.dsp.2025.105565\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The detection of blurred vehicle targets is essential for maintaining traffic efficiency and ensuring road safety. Although various you-only-look-once (YOLO)-based models exist, few studies have focused on blurred vehicle detection under real-world traffic conditions. To fill this gap, we propose a novel dual-channel dual-path YOLO (DD-YOLO) network, featuring dual-channel feature extraction and dual-path feature fusion. The network comprises a hybrid pooling pyramid (HPP) module, a dual-channel feature extraction backbone, and a dual-path fusion pooling neck (DPFP-neck). Within the DD-YOLO network, we first introduce the HPP module to reduce dependence on key features by combining max and average pooling, incorporating background information to mitigate false positives. Subsequently, the dual-channel backbone is designed to enhance DD-YOLO’s sensitivity for blurred vehicle targets by integrating multiple convolution and attention mechanisms, including the convolutional block attention module (CBAM), simple and parameter-free attention module (SIMAM), standard convolution, and ghost convolution, to capture richer features and improve recall. Finally, the DPFP-neck is developed to fuse diverse information and expand the receptive field across network depths, providing a satisfactory balance between precision and recall. Experiments on the BDD100K and KITTI datasets show that DD-YOLO improves detection accuracy by 4.9% and 4.0%, respectively, with [email protected] gains of 2.4% and 2.7% over the baseline, demonstrating its effectiveness and real-time capability in detecting blurred vehicle targets.</div></div>\",\"PeriodicalId\":51011,\"journal\":{\"name\":\"Digital Signal Processing\",\"volume\":\"168 \",\"pages\":\"Article 105565\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-08-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1051200425005871\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425005871","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

模糊车辆目标的检测对于保持交通效率和确保道路安全至关重要。尽管存在各种基于you-only-look-once (YOLO)的模型,但很少有研究关注真实交通条件下的模糊车辆检测。为了填补这一空白,我们提出了一种新的双通道双路径YOLO (DD-YOLO)网络,该网络具有双通道特征提取和双路径特征融合。该网络由混合池金字塔(HPP)模块、双通道特征提取骨干和双路径融合池颈(dpfp -颈)组成。在DD-YOLO网络中,我们首先引入HPP模块,通过结合最大池和平均池来减少对关键特征的依赖,并结合背景信息来减少误报。随后,设计了双通道主干网,通过集成卷积块注意模块(CBAM)、简单无参数注意模块(SIMAM)、标准卷积和鬼卷积等多种卷积和注意机制,提高DD-YOLO对模糊车辆目标的灵敏度,以捕获更丰富的特征,提高召回率。最后,开发了dpfp颈来融合不同的信息,并在网络深度上扩展接受域,在准确率和召回率之间提供了令人满意的平衡。在BDD100K和KITTI数据集上的实验表明,DD-YOLO的检测精度分别提高了4.9%和4.0%,[email protected]比基线分别提高了2.4%和2.7%,证明了其在检测模糊车辆目标方面的有效性和实时性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
DD-YOLO: A dual-channel dual-path YOLO network for target detection of blurred vehicles
The detection of blurred vehicle targets is essential for maintaining traffic efficiency and ensuring road safety. Although various you-only-look-once (YOLO)-based models exist, few studies have focused on blurred vehicle detection under real-world traffic conditions. To fill this gap, we propose a novel dual-channel dual-path YOLO (DD-YOLO) network, featuring dual-channel feature extraction and dual-path feature fusion. The network comprises a hybrid pooling pyramid (HPP) module, a dual-channel feature extraction backbone, and a dual-path fusion pooling neck (DPFP-neck). Within the DD-YOLO network, we first introduce the HPP module to reduce dependence on key features by combining max and average pooling, incorporating background information to mitigate false positives. Subsequently, the dual-channel backbone is designed to enhance DD-YOLO’s sensitivity for blurred vehicle targets by integrating multiple convolution and attention mechanisms, including the convolutional block attention module (CBAM), simple and parameter-free attention module (SIMAM), standard convolution, and ghost convolution, to capture richer features and improve recall. Finally, the DPFP-neck is developed to fuse diverse information and expand the receptive field across network depths, providing a satisfactory balance between precision and recall. Experiments on the BDD100K and KITTI datasets show that DD-YOLO improves detection accuracy by 4.9% and 4.0%, respectively, with [email protected] gains of 2.4% and 2.7% over the baseline, demonstrating its effectiveness and real-time capability in detecting blurred vehicle targets.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Digital Signal Processing
Digital Signal Processing 工程技术-工程:电子与电气
CiteScore
5.30
自引率
17.20%
发文量
435
审稿时长
66 days
期刊介绍: Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal. The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as: • big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信