Small object detection algorithm based on improved YOLOv10 for traffic sign

IF 3.8 Q2 TRANSPORTATION

Transportation Research Interdisciplinary Perspectives Pub Date : 2025-07-01 DOI:10.1016/j.trip.2025.101501

Yukang Zou, Scarlett Liu

{"title":"Small object detection algorithm based on improved YOLOv10 for traffic sign","authors":"Yukang Zou, Scarlett Liu","doi":"10.1016/j.trip.2025.101501","DOIUrl":null,"url":null,"abstract":"<div><div>Traffic sign detection (TSD) is a critical task in intelligent transportation systems (ITS) and autonomous driving, facing challenges such as complex backgrounds and small-scale objects. Existing methods often suffer from high miss and false alarm rates, particularly in dynamic or cluttered environments, limiting their practical applicability. To address these issues, we propose LTS-YOLOv10, an improved version of YOLOv10, designed to enhance small object detection accuracy and overall performance in complex real-world conditions. Our approach introduces Omni-Dimensional Dynamic Convolution (ODConv), which utilizes a four-dimensional dynamic convolution mechanism to improve the capture of multi-scale and complex background features. Additionally, we integrate an attention-guided bidirectional feature pyramid network (EMA-BiFPN) to enhance feature fusion, further improving the detection accuracy for small objects. The MPDIoU loss function is employed during bounding box regression to optimize precision and recall for irregularly shaped targets. Experimental results on three public datasets demonstrate that LTS-YOLOv10 achieves a 3.8% improvement in mAP on the CCTSDB dataset compared to the original YOLOv10, with notable gains on the TT100K and DFG datasets as well. These improvements are achieved with only a slight increase in parameters, demonstrating the model’s superiority in terms of accuracy, robustness, and real-time performance. LTS-YOLOv10 provides a promising solution for practical traffic sign detection, with future work focusing on further enhancing the model’s real-time capabilities and optimizing its application in edge computing environments.</div></div>","PeriodicalId":36621,"journal":{"name":"Transportation Research Interdisciplinary Perspectives","volume":"32 ","pages":"Article 101501"},"PeriodicalIF":3.8000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Interdisciplinary Perspectives","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590198225001800","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TRANSPORTATION","Score":null,"Total":0}

引用次数: 0

Abstract

Traffic sign detection (TSD) is a critical task in intelligent transportation systems (ITS) and autonomous driving, facing challenges such as complex backgrounds and small-scale objects. Existing methods often suffer from high miss and false alarm rates, particularly in dynamic or cluttered environments, limiting their practical applicability. To address these issues, we propose LTS-YOLOv10, an improved version of YOLOv10, designed to enhance small object detection accuracy and overall performance in complex real-world conditions. Our approach introduces Omni-Dimensional Dynamic Convolution (ODConv), which utilizes a four-dimensional dynamic convolution mechanism to improve the capture of multi-scale and complex background features. Additionally, we integrate an attention-guided bidirectional feature pyramid network (EMA-BiFPN) to enhance feature fusion, further improving the detection accuracy for small objects. The MPDIoU loss function is employed during bounding box regression to optimize precision and recall for irregularly shaped targets. Experimental results on three public datasets demonstrate that LTS-YOLOv10 achieves a 3.8% improvement in mAP on the CCTSDB dataset compared to the original YOLOv10, with notable gains on the TT100K and DFG datasets as well. These improvements are achieved with only a slight increase in parameters, demonstrating the model’s superiority in terms of accuracy, robustness, and real-time performance. LTS-YOLOv10 provides a promising solution for practical traffic sign detection, with future work focusing on further enhancing the model’s real-time capabilities and optimizing its application in edge computing environments.

查看原文本刊更多论文

基于改进YOLOv10的交通标志小目标检测算法

交通标志检测（TSD）是智能交通系统（ITS）和自动驾驶中的一项关键任务，面临着复杂背景和小尺度物体等挑战。现有的方法往往存在高漏报率和虚警率，特别是在动态或混乱的环境中，限制了它们的实际适用性。为了解决这些问题，我们提出了LTS-YOLOv10，这是YOLOv10的改进版本，旨在提高复杂现实条件下的小目标检测精度和整体性能。我们的方法引入了全维动态卷积（ODConv），它利用四维动态卷积机制来提高对多尺度和复杂背景特征的捕获。此外，我们还集成了一个注意引导的双向特征金字塔网络（EMA-BiFPN）来增强特征融合，进一步提高了对小目标的检测精度。在边界盒回归中采用MPDIoU损失函数优化不规则形状目标的精度和召回率。在三个公共数据集上的实验结果表明，LTS-YOLOv10在CCTSDB数据集上的mAP比原来的YOLOv10提高了3.8%，在TT100K和DFG数据集上也有显著的提高。这些改进仅在参数略有增加的情况下实现，表明该模型在准确性、鲁棒性和实时性方面具有优势。LTS-YOLOv10为实际交通标志检测提供了一个很有前途的解决方案，未来的工作重点是进一步增强模型的实时能力，并优化其在边缘计算环境中的应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊