{"title":"基于yolov10改进的轻型多尺度特征融合道路缺陷检测模型及其应用","authors":"Jianxi Ou , Jianqin Zhang , Haoyu Li , Bin Duan","doi":"10.1016/j.advengsoft.2025.103976","DOIUrl":null,"url":null,"abstract":"<div><div>Intelligent road damage detection is critical for ensuring traffic safety and extending the lifespan of roads. However, existing methods struggle to balance high accuracy and real-time performance in complex detection scenarios and resource-constrained environments. To address this issue, this study proposes a lightweight multi-scale feature fusion model based on an improved YOLOv10—GAS-YOLO. The model utilizes a novel lightweight architecture (GSF-ST) designed through a combination of feature generation, asymmetric convolution, and grouped channel shuffling optimization strategies, significantly reducing computational complexity and parameter count while enhancing both global and local feature representation. To improve multi-scale damage detection performance, GAS-YOLO incorporates an improved bidirectional feature pyramid network (BiFPN) and Swin Transformer module. A resolution halving and channel doubling strategy enhances the detection ability of small targets. Moreover, the WiOU loss function further optimizes bounding box regression accuracy, mitigating errors caused by sample imbalance. Channel pruning techniques are applied to achieve secondary lightweight compression of the model, resulting in significant resource savings. Through comparative experiments and ablation analysis with several advanced damage detection models, this study demonstrates a significant performance improvement of GAS-YOLO. Experimental results show that GAS-YOLO exhibits outstanding performance in multi-scale damage detection tasks, with 5.6 M parameters, 8.4GFLOPs of computational complexity, and a model size of only 5.8 MB. Compared to baseline models, detection accuracy improves by 10.8 %, computational complexity is reduced by 2.57 times, and parameter count is reduced by 1.29 times, with an average detection accuracy of 86.5 % and a single image processing time of 6.1 ms. Validation on both public datasets and self-constructed datasets further proves its real-time processing capability while maintaining high accuracy. The GAS-YOLO model proposed in this study not only provides a practical solution for road damage detection in resource-constrained environments but also offers new insights for intelligent management of intelligent transportation and urban infrastructure, with broad application prospects.</div></div>","PeriodicalId":50866,"journal":{"name":"Advances in Engineering Software","volume":"208 ","pages":"Article 103976"},"PeriodicalIF":5.7000,"publicationDate":"2025-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An improved YOLOv10-based lightweight multi-scale feature fusion model for road defect detection and its applications\",\"authors\":\"Jianxi Ou , Jianqin Zhang , Haoyu Li , Bin Duan\",\"doi\":\"10.1016/j.advengsoft.2025.103976\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Intelligent road damage detection is critical for ensuring traffic safety and extending the lifespan of roads. However, existing methods struggle to balance high accuracy and real-time performance in complex detection scenarios and resource-constrained environments. To address this issue, this study proposes a lightweight multi-scale feature fusion model based on an improved YOLOv10—GAS-YOLO. The model utilizes a novel lightweight architecture (GSF-ST) designed through a combination of feature generation, asymmetric convolution, and grouped channel shuffling optimization strategies, significantly reducing computational complexity and parameter count while enhancing both global and local feature representation. To improve multi-scale damage detection performance, GAS-YOLO incorporates an improved bidirectional feature pyramid network (BiFPN) and Swin Transformer module. A resolution halving and channel doubling strategy enhances the detection ability of small targets. Moreover, the WiOU loss function further optimizes bounding box regression accuracy, mitigating errors caused by sample imbalance. Channel pruning techniques are applied to achieve secondary lightweight compression of the model, resulting in significant resource savings. Through comparative experiments and ablation analysis with several advanced damage detection models, this study demonstrates a significant performance improvement of GAS-YOLO. Experimental results show that GAS-YOLO exhibits outstanding performance in multi-scale damage detection tasks, with 5.6 M parameters, 8.4GFLOPs of computational complexity, and a model size of only 5.8 MB. Compared to baseline models, detection accuracy improves by 10.8 %, computational complexity is reduced by 2.57 times, and parameter count is reduced by 1.29 times, with an average detection accuracy of 86.5 % and a single image processing time of 6.1 ms. Validation on both public datasets and self-constructed datasets further proves its real-time processing capability while maintaining high accuracy. The GAS-YOLO model proposed in this study not only provides a practical solution for road damage detection in resource-constrained environments but also offers new insights for intelligent management of intelligent transportation and urban infrastructure, with broad application prospects.</div></div>\",\"PeriodicalId\":50866,\"journal\":{\"name\":\"Advances in Engineering Software\",\"volume\":\"208 \",\"pages\":\"Article 103976\"},\"PeriodicalIF\":5.7000,\"publicationDate\":\"2025-06-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advances in Engineering Software\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0965997825001140\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Engineering Software","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0965997825001140","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
An improved YOLOv10-based lightweight multi-scale feature fusion model for road defect detection and its applications
Intelligent road damage detection is critical for ensuring traffic safety and extending the lifespan of roads. However, existing methods struggle to balance high accuracy and real-time performance in complex detection scenarios and resource-constrained environments. To address this issue, this study proposes a lightweight multi-scale feature fusion model based on an improved YOLOv10—GAS-YOLO. The model utilizes a novel lightweight architecture (GSF-ST) designed through a combination of feature generation, asymmetric convolution, and grouped channel shuffling optimization strategies, significantly reducing computational complexity and parameter count while enhancing both global and local feature representation. To improve multi-scale damage detection performance, GAS-YOLO incorporates an improved bidirectional feature pyramid network (BiFPN) and Swin Transformer module. A resolution halving and channel doubling strategy enhances the detection ability of small targets. Moreover, the WiOU loss function further optimizes bounding box regression accuracy, mitigating errors caused by sample imbalance. Channel pruning techniques are applied to achieve secondary lightweight compression of the model, resulting in significant resource savings. Through comparative experiments and ablation analysis with several advanced damage detection models, this study demonstrates a significant performance improvement of GAS-YOLO. Experimental results show that GAS-YOLO exhibits outstanding performance in multi-scale damage detection tasks, with 5.6 M parameters, 8.4GFLOPs of computational complexity, and a model size of only 5.8 MB. Compared to baseline models, detection accuracy improves by 10.8 %, computational complexity is reduced by 2.57 times, and parameter count is reduced by 1.29 times, with an average detection accuracy of 86.5 % and a single image processing time of 6.1 ms. Validation on both public datasets and self-constructed datasets further proves its real-time processing capability while maintaining high accuracy. The GAS-YOLO model proposed in this study not only provides a practical solution for road damage detection in resource-constrained environments but also offers new insights for intelligent management of intelligent transportation and urban infrastructure, with broad application prospects.
期刊介绍:
The objective of this journal is to communicate recent and projected advances in computer-based engineering techniques. The fields covered include mechanical, aerospace, civil and environmental engineering, with an emphasis on research and development leading to practical problem-solving.
The scope of the journal includes:
• Innovative computational strategies and numerical algorithms for large-scale engineering problems
• Analysis and simulation techniques and systems
• Model and mesh generation
• Control of the accuracy, stability and efficiency of computational process
• Exploitation of new computing environments (eg distributed hetergeneous and collaborative computing)
• Advanced visualization techniques, virtual environments and prototyping
• Applications of AI, knowledge-based systems, computational intelligence, including fuzzy logic, neural networks and evolutionary computations
• Application of object-oriented technology to engineering problems
• Intelligent human computer interfaces
• Design automation, multidisciplinary design and optimization
• CAD, CAE and integrated process and product development systems
• Quality and reliability.