{"title":"智能交通多尺度小目标鲁棒检测","authors":"Keyou Guo, Jiangnan Wang, Haibing Jiang, Pei Zhang, Huangcheng Qin","doi":"10.1002/cpe.70268","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Existing multi-scale vehicle detection methods often falter in crowded traffic scenarios—particularly when it comes to locating small vehicles, resolving occlusions, and adapting to scale variations—which leads to a marked drop in overall accuracy. To overcome these challenges, we introduce <i>SC-YOLO</i>, a lightweight detection framework built upon YOLOv10n and optimized for greater efficiency. First, we replace the standard downsampling between the backbone and neck with a Space-to-Depth Convolution (SPDConv) module, preserving fine-grained details in the lower levels of the feature pyramid so that cues for small targets remain intact. Next, we propose a Context-Guided Rectangular Feature Pyramid Network (CGRFPN) equipped with a self-calibration mechanism; by enabling cross-scale interactions and adaptive feature-map calibration, it significantly enhances multi-scale fusion. Finally, guided by extensive empirical evaluation, we adopt the Wise-IoUv3 dynamic loss function, whose adaptive gradient allocation refines bounding-box regression. On the Pascal VOC, KITTI, and Cars datasets, SC-YOLO attains mAP@50 scores of 79.0%, 87.3%, and 74.6%, respectively—improving upon the YOLOv10n baseline by 2.5%, 2.1%, and 2.3%. Crucially, it maintains high accuracy under challenging traffic conditions, especially for small-vehicle detection and occlusion resolution, while scaling more efficiently, requiring fewer computations than other models with comparable parameter counts. These combined advantages underscore SC-YOLO's resource-efficient design and its practicality for intelligent transportation and autonomous-driving applications.</p>\n </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"37 23-24","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SC-YOLO: Robust Multi-Scale Small Object Detection for Intelligent Transportation\",\"authors\":\"Keyou Guo, Jiangnan Wang, Haibing Jiang, Pei Zhang, Huangcheng Qin\",\"doi\":\"10.1002/cpe.70268\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Existing multi-scale vehicle detection methods often falter in crowded traffic scenarios—particularly when it comes to locating small vehicles, resolving occlusions, and adapting to scale variations—which leads to a marked drop in overall accuracy. To overcome these challenges, we introduce <i>SC-YOLO</i>, a lightweight detection framework built upon YOLOv10n and optimized for greater efficiency. First, we replace the standard downsampling between the backbone and neck with a Space-to-Depth Convolution (SPDConv) module, preserving fine-grained details in the lower levels of the feature pyramid so that cues for small targets remain intact. Next, we propose a Context-Guided Rectangular Feature Pyramid Network (CGRFPN) equipped with a self-calibration mechanism; by enabling cross-scale interactions and adaptive feature-map calibration, it significantly enhances multi-scale fusion. Finally, guided by extensive empirical evaluation, we adopt the Wise-IoUv3 dynamic loss function, whose adaptive gradient allocation refines bounding-box regression. On the Pascal VOC, KITTI, and Cars datasets, SC-YOLO attains mAP@50 scores of 79.0%, 87.3%, and 74.6%, respectively—improving upon the YOLOv10n baseline by 2.5%, 2.1%, and 2.3%. Crucially, it maintains high accuracy under challenging traffic conditions, especially for small-vehicle detection and occlusion resolution, while scaling more efficiently, requiring fewer computations than other models with comparable parameter counts. These combined advantages underscore SC-YOLO's resource-efficient design and its practicality for intelligent transportation and autonomous-driving applications.</p>\\n </div>\",\"PeriodicalId\":55214,\"journal\":{\"name\":\"Concurrency and Computation-Practice & Experience\",\"volume\":\"37 23-24\",\"pages\":\"\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2025-09-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Concurrency and Computation-Practice & Experience\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70268\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurrency and Computation-Practice & Experience","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70268","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
SC-YOLO: Robust Multi-Scale Small Object Detection for Intelligent Transportation
Existing multi-scale vehicle detection methods often falter in crowded traffic scenarios—particularly when it comes to locating small vehicles, resolving occlusions, and adapting to scale variations—which leads to a marked drop in overall accuracy. To overcome these challenges, we introduce SC-YOLO, a lightweight detection framework built upon YOLOv10n and optimized for greater efficiency. First, we replace the standard downsampling between the backbone and neck with a Space-to-Depth Convolution (SPDConv) module, preserving fine-grained details in the lower levels of the feature pyramid so that cues for small targets remain intact. Next, we propose a Context-Guided Rectangular Feature Pyramid Network (CGRFPN) equipped with a self-calibration mechanism; by enabling cross-scale interactions and adaptive feature-map calibration, it significantly enhances multi-scale fusion. Finally, guided by extensive empirical evaluation, we adopt the Wise-IoUv3 dynamic loss function, whose adaptive gradient allocation refines bounding-box regression. On the Pascal VOC, KITTI, and Cars datasets, SC-YOLO attains mAP@50 scores of 79.0%, 87.3%, and 74.6%, respectively—improving upon the YOLOv10n baseline by 2.5%, 2.1%, and 2.3%. Crucially, it maintains high accuracy under challenging traffic conditions, especially for small-vehicle detection and occlusion resolution, while scaling more efficiently, requiring fewer computations than other models with comparable parameter counts. These combined advantages underscore SC-YOLO's resource-efficient design and its practicality for intelligent transportation and autonomous-driving applications.
期刊介绍:
Concurrency and Computation: Practice and Experience (CCPE) publishes high-quality, original research papers, and authoritative research review papers, in the overlapping fields of:
Parallel and distributed computing;
High-performance computing;
Computational and data science;
Artificial intelligence and machine learning;
Big data applications, algorithms, and systems;
Network science;
Ontologies and semantics;
Security and privacy;
Cloud/edge/fog computing;
Green computing; and
Quantum computing.