智能交通多尺度小目标鲁棒检测

IF 1.5 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING
Keyou Guo, Jiangnan Wang, Haibing Jiang, Pei Zhang, Huangcheng Qin
{"title":"智能交通多尺度小目标鲁棒检测","authors":"Keyou Guo,&nbsp;Jiangnan Wang,&nbsp;Haibing Jiang,&nbsp;Pei Zhang,&nbsp;Huangcheng Qin","doi":"10.1002/cpe.70268","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Existing multi-scale vehicle detection methods often falter in crowded traffic scenarios—particularly when it comes to locating small vehicles, resolving occlusions, and adapting to scale variations—which leads to a marked drop in overall accuracy. To overcome these challenges, we introduce <i>SC-YOLO</i>, a lightweight detection framework built upon YOLOv10n and optimized for greater efficiency. First, we replace the standard downsampling between the backbone and neck with a Space-to-Depth Convolution (SPDConv) module, preserving fine-grained details in the lower levels of the feature pyramid so that cues for small targets remain intact. Next, we propose a Context-Guided Rectangular Feature Pyramid Network (CGRFPN) equipped with a self-calibration mechanism; by enabling cross-scale interactions and adaptive feature-map calibration, it significantly enhances multi-scale fusion. Finally, guided by extensive empirical evaluation, we adopt the Wise-IoUv3 dynamic loss function, whose adaptive gradient allocation refines bounding-box regression. On the Pascal VOC, KITTI, and Cars datasets, SC-YOLO attains mAP@50 scores of 79.0%, 87.3%, and 74.6%, respectively—improving upon the YOLOv10n baseline by 2.5%, 2.1%, and 2.3%. Crucially, it maintains high accuracy under challenging traffic conditions, especially for small-vehicle detection and occlusion resolution, while scaling more efficiently, requiring fewer computations than other models with comparable parameter counts. These combined advantages underscore SC-YOLO's resource-efficient design and its practicality for intelligent transportation and autonomous-driving applications.</p>\n </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"37 23-24","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SC-YOLO: Robust Multi-Scale Small Object Detection for Intelligent Transportation\",\"authors\":\"Keyou Guo,&nbsp;Jiangnan Wang,&nbsp;Haibing Jiang,&nbsp;Pei Zhang,&nbsp;Huangcheng Qin\",\"doi\":\"10.1002/cpe.70268\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Existing multi-scale vehicle detection methods often falter in crowded traffic scenarios—particularly when it comes to locating small vehicles, resolving occlusions, and adapting to scale variations—which leads to a marked drop in overall accuracy. To overcome these challenges, we introduce <i>SC-YOLO</i>, a lightweight detection framework built upon YOLOv10n and optimized for greater efficiency. First, we replace the standard downsampling between the backbone and neck with a Space-to-Depth Convolution (SPDConv) module, preserving fine-grained details in the lower levels of the feature pyramid so that cues for small targets remain intact. Next, we propose a Context-Guided Rectangular Feature Pyramid Network (CGRFPN) equipped with a self-calibration mechanism; by enabling cross-scale interactions and adaptive feature-map calibration, it significantly enhances multi-scale fusion. Finally, guided by extensive empirical evaluation, we adopt the Wise-IoUv3 dynamic loss function, whose adaptive gradient allocation refines bounding-box regression. On the Pascal VOC, KITTI, and Cars datasets, SC-YOLO attains mAP@50 scores of 79.0%, 87.3%, and 74.6%, respectively—improving upon the YOLOv10n baseline by 2.5%, 2.1%, and 2.3%. Crucially, it maintains high accuracy under challenging traffic conditions, especially for small-vehicle detection and occlusion resolution, while scaling more efficiently, requiring fewer computations than other models with comparable parameter counts. These combined advantages underscore SC-YOLO's resource-efficient design and its practicality for intelligent transportation and autonomous-driving applications.</p>\\n </div>\",\"PeriodicalId\":55214,\"journal\":{\"name\":\"Concurrency and Computation-Practice & Experience\",\"volume\":\"37 23-24\",\"pages\":\"\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2025-09-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Concurrency and Computation-Practice & Experience\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70268\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurrency and Computation-Practice & Experience","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70268","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0

摘要

现有的多尺度车辆检测方法在拥挤的交通场景中往往会出现问题,特别是在定位小型车辆、解决遮挡和适应尺度变化时,这导致整体精度显著下降。为了克服这些挑战,我们推出了SC-YOLO,这是一种基于YOLOv10n的轻量级检测框架,并针对更高的效率进行了优化。首先,我们用空间到深度卷积(SPDConv)模块取代了脊柱和颈部之间的标准下采样,在特征金字塔的较低层次保留了细粒度的细节,以便小目标的线索保持完整。接下来,我们提出了一种具有自校准机制的上下文引导矩形特征金字塔网络(CGRFPN);通过实现跨尺度交互和自适应特征图校准,显著增强了多尺度融合。最后,在广泛的经验评估的指导下,我们采用了Wise-IoUv3动态损失函数,其自适应梯度分配改进了边界盒回归。在Pascal VOC、KITTI和Cars数据集上,SC-YOLO的mAP@50得分分别为79.0%、87.3%和74.6%,比YOLOv10n基线分别提高了2.5%、2.1%和2.3%。至关重要的是,它在具有挑战性的交通条件下保持高精度,特别是对于小型车辆检测和遮挡分辨率,同时更有效地缩放,比具有可比参数数的其他模型需要更少的计算。这些综合优势强调了SC-YOLO的资源节约型设计及其在智能交通和自动驾驶应用中的实用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
SC-YOLO: Robust Multi-Scale Small Object Detection for Intelligent Transportation

Existing multi-scale vehicle detection methods often falter in crowded traffic scenarios—particularly when it comes to locating small vehicles, resolving occlusions, and adapting to scale variations—which leads to a marked drop in overall accuracy. To overcome these challenges, we introduce SC-YOLO, a lightweight detection framework built upon YOLOv10n and optimized for greater efficiency. First, we replace the standard downsampling between the backbone and neck with a Space-to-Depth Convolution (SPDConv) module, preserving fine-grained details in the lower levels of the feature pyramid so that cues for small targets remain intact. Next, we propose a Context-Guided Rectangular Feature Pyramid Network (CGRFPN) equipped with a self-calibration mechanism; by enabling cross-scale interactions and adaptive feature-map calibration, it significantly enhances multi-scale fusion. Finally, guided by extensive empirical evaluation, we adopt the Wise-IoUv3 dynamic loss function, whose adaptive gradient allocation refines bounding-box regression. On the Pascal VOC, KITTI, and Cars datasets, SC-YOLO attains mAP@50 scores of 79.0%, 87.3%, and 74.6%, respectively—improving upon the YOLOv10n baseline by 2.5%, 2.1%, and 2.3%. Crucially, it maintains high accuracy under challenging traffic conditions, especially for small-vehicle detection and occlusion resolution, while scaling more efficiently, requiring fewer computations than other models with comparable parameter counts. These combined advantages underscore SC-YOLO's resource-efficient design and its practicality for intelligent transportation and autonomous-driving applications.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Concurrency and Computation-Practice & Experience
Concurrency and Computation-Practice & Experience 工程技术-计算机:理论方法
CiteScore
5.00
自引率
10.00%
发文量
664
审稿时长
9.6 months
期刊介绍: Concurrency and Computation: Practice and Experience (CCPE) publishes high-quality, original research papers, and authoritative research review papers, in the overlapping fields of: Parallel and distributed computing; High-performance computing; Computational and data science; Artificial intelligence and machine learning; Big data applications, algorithms, and systems; Network science; Ontologies and semantics; Security and privacy; Cloud/edge/fog computing; Green computing; and Quantum computing.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信