智能交通多尺度小目标鲁棒检测

IF 1.5 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Concurrency and Computation-Practice & Experience Pub Date : 2025-09-03 DOI:10.1002/cpe.70268

Keyou Guo, Jiangnan Wang, Haibing Jiang, Pei Zhang, Huangcheng Qin

{"title":"智能交通多尺度小目标鲁棒检测","authors":"Keyou Guo, Jiangnan Wang, Haibing Jiang, Pei Zhang, Huangcheng Qin","doi":"10.1002/cpe.70268","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Existing multi-scale vehicle detection methods often falter in crowded traffic scenarios—particularly when it comes to locating small vehicles, resolving occlusions, and adapting to scale variations—which leads to a marked drop in overall accuracy. To overcome these challenges, we introduce <i>SC-YOLO</i>, a lightweight detection framework built upon YOLOv10n and optimized for greater efficiency. First, we replace the standard downsampling between the backbone and neck with a Space-to-Depth Convolution (SPDConv) module, preserving fine-grained details in the lower levels of the feature pyramid so that cues for small targets remain intact. Next, we propose a Context-Guided Rectangular Feature Pyramid Network (CGRFPN) equipped with a self-calibration mechanism; by enabling cross-scale interactions and adaptive feature-map calibration, it significantly enhances multi-scale fusion. Finally, guided by extensive empirical evaluation, we adopt the Wise-IoUv3 dynamic loss function, whose adaptive gradient allocation refines bounding-box regression. On the Pascal VOC, KITTI, and Cars datasets, SC-YOLO attains mAP@50 scores of 79.0%, 87.3%, and 74.6%, respectively—improving upon the YOLOv10n baseline by 2.5%, 2.1%, and 2.3%. Crucially, it maintains high accuracy under challenging traffic conditions, especially for small-vehicle detection and occlusion resolution, while scaling more efficiently, requiring fewer computations than other models with comparable parameter counts. These combined advantages underscore SC-YOLO's resource-efficient design and its practicality for intelligent transportation and autonomous-driving applications.</p>\n </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"37 23-24","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SC-YOLO: Robust Multi-Scale Small Object Detection for Intelligent Transportation\",\"authors\":\"Keyou Guo, Jiangnan Wang, Haibing Jiang, Pei Zhang, Huangcheng Qin\",\"doi\":\"10.1002/cpe.70268\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Existing multi-scale vehicle detection methods often falter in crowded traffic scenarios—particularly when it comes to locating small vehicles, resolving occlusions, and adapting to scale variations—which leads to a marked drop in overall accuracy. To overcome these challenges, we introduce <i>SC-YOLO</i>, a lightweight detection framework built upon YOLOv10n and optimized for greater efficiency. First, we replace the standard downsampling between the backbone and neck with a Space-to-Depth Convolution (SPDConv) module, preserving fine-grained details in the lower levels of the feature pyramid so that cues for small targets remain intact. Next, we propose a Context-Guided Rectangular Feature Pyramid Network (CGRFPN) equipped with a self-calibration mechanism; by enabling cross-scale interactions and adaptive feature-map calibration, it significantly enhances multi-scale fusion. Finally, guided by extensive empirical evaluation, we adopt the Wise-IoUv3 dynamic loss function, whose adaptive gradient allocation refines bounding-box regression. On the Pascal VOC, KITTI, and Cars datasets, SC-YOLO attains mAP@50 scores of 79.0%, 87.3%, and 74.6%, respectively—improving upon the YOLOv10n baseline by 2.5%, 2.1%, and 2.3%. Crucially, it maintains high accuracy under challenging traffic conditions, especially for small-vehicle detection and occlusion resolution, while scaling more efficiently, requiring fewer computations than other models with comparable parameter counts. These combined advantages underscore SC-YOLO's resource-efficient design and its practicality for intelligent transportation and autonomous-driving applications.</p>\\n </div>\",\"PeriodicalId\":55214,\"journal\":{\"name\":\"Concurrency and Computation-Practice & Experience\",\"volume\":\"37 23-24\",\"pages\":\"\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2025-09-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Concurrency and Computation-Practice & Experience\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70268\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurrency and Computation-Practice & Experience","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70268","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

摘要

现有的多尺度车辆检测方法在拥挤的交通场景中往往会出现问题，特别是在定位小型车辆、解决遮挡和适应尺度变化时，这导致整体精度显著下降。为了克服这些挑战，我们推出了SC-YOLO，这是一种基于YOLOv10n的轻量级检测框架，并针对更高的效率进行了优化。首先，我们用空间到深度卷积（SPDConv）模块取代了脊柱和颈部之间的标准下采样，在特征金字塔的较低层次保留了细粒度的细节，以便小目标的线索保持完整。接下来，我们提出了一种具有自校准机制的上下文引导矩形特征金字塔网络（CGRFPN）；通过实现跨尺度交互和自适应特征图校准，显著增强了多尺度融合。最后，在广泛的经验评估的指导下，我们采用了Wise-IoUv3动态损失函数，其自适应梯度分配改进了边界盒回归。在Pascal VOC、KITTI和Cars数据集上，SC-YOLO的mAP@50得分分别为79.0%、87.3%和74.6%，比YOLOv10n基线分别提高了2.5%、2.1%和2.3%。至关重要的是，它在具有挑战性的交通条件下保持高精度，特别是对于小型车辆检测和遮挡分辨率，同时更有效地缩放，比具有可比参数数的其他模型需要更少的计算。这些综合优势强调了SC-YOLO的资源节约型设计及其在智能交通和自动驾驶应用中的实用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

SC-YOLO: Robust Multi-Scale Small Object Detection for Intelligent Transportation

Existing multi-scale vehicle detection methods often falter in crowded traffic scenarios—particularly when it comes to locating small vehicles, resolving occlusions, and adapting to scale variations—which leads to a marked drop in overall accuracy. To overcome these challenges, we introduce SC-YOLO, a lightweight detection framework built upon YOLOv10n and optimized for greater efficiency. First, we replace the standard downsampling between the backbone and neck with a Space-to-Depth Convolution (SPDConv) module, preserving fine-grained details in the lower levels of the feature pyramid so that cues for small targets remain intact. Next, we propose a Context-Guided Rectangular Feature Pyramid Network (CGRFPN) equipped with a self-calibration mechanism; by enabling cross-scale interactions and adaptive feature-map calibration, it significantly enhances multi-scale fusion. Finally, guided by extensive empirical evaluation, we adopt the Wise-IoUv3 dynamic loss function, whose adaptive gradient allocation refines bounding-box regression. On the Pascal VOC, KITTI, and Cars datasets, SC-YOLO attains mAP@50 scores of 79.0%, 87.3%, and 74.6%, respectively—improving upon the YOLOv10n baseline by 2.5%, 2.1%, and 2.3%. Crucially, it maintains high accuracy under challenging traffic conditions, especially for small-vehicle detection and occlusion resolution, while scaling more efficiently, requiring fewer computations than other models with comparable parameter counts. These combined advantages underscore SC-YOLO's resource-efficient design and its practicality for intelligent transportation and autonomous-driving applications.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Concurrency and Computation-Practice & Experience 工程技术-计算机：理论方法

CiteScore

5.00

自引率

10.00%

发文量

664

审稿时长

9.6 months

期刊介绍： Concurrency and Computation: Practice and Experience (CCPE) publishes high-quality, original research papers, and authoritative research review papers, in the overlapping fields of: Parallel and distributed computing; High-performance computing; Computational and data science; Artificial intelligence and machine learning; Big data applications, algorithms, and systems; Network science; Ontologies and semantics; Security and privacy; Cloud/edge/fog computing; Green computing; and Quantum computing.