Huixia Li , Ruohan Chen , Nyirandayisabye Ritha , Jian Wang , Zi’ang Chen
{"title":"基于多层次感知和特征聚合的无人机图像道路损伤自动检测方法","authors":"Huixia Li , Ruohan Chen , Nyirandayisabye Ritha , Jian Wang , Zi’ang Chen","doi":"10.1016/j.aei.2025.103814","DOIUrl":null,"url":null,"abstract":"<div><div>Road infrastructure monitoring is critical for transportation safety and maintenance efficiency. However, existing road damage detection methods face challenges in multi-scale target recognition within complex environments, primarily attributed to their inadequate detection accuracy when processing drone-captured images. To overcome these limitations, this paper introduces the MFD-YOLO model. To tackle the insufficient detection accuracy of existing methods under complex backgrounds, this study designs two key structures: (1) A DS-MDPB backbone network that integrates three feature focusing mechanisms—MANet, SimAM, and BRA—to enhance feature extraction capabilities for pavement cracks and irregular distresses. (2) An MFDPN architecture that employs bidirectional feature propagation and dynamic weight allocation strategies, optimizing feature fusion through DG-C and RE-C modules. Experimental results demonstrate that on the RDD2022_China_Drone dataset, the model achieves an mAP50 of 73.1 %, reflecting a 5.5 % relative improvement over the baseline, and a 30.03 % relative enhancement in AP<sub>s</sub>. The model also exhibits consistent performance on UAV-PDD2023 and self-constructed datasets, demonstrating cross-scenario generalization capability. This method maintains high efficiency (FPS > 300) while improving detection accuracy for road damage in complex environments, providing reliable technical support for intelligent road maintenance.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"69 ","pages":"Article 103814"},"PeriodicalIF":9.9000,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An automatic detection method for road damage in UAV images based on multi-level perception and feature aggregation\",\"authors\":\"Huixia Li , Ruohan Chen , Nyirandayisabye Ritha , Jian Wang , Zi’ang Chen\",\"doi\":\"10.1016/j.aei.2025.103814\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Road infrastructure monitoring is critical for transportation safety and maintenance efficiency. However, existing road damage detection methods face challenges in multi-scale target recognition within complex environments, primarily attributed to their inadequate detection accuracy when processing drone-captured images. To overcome these limitations, this paper introduces the MFD-YOLO model. To tackle the insufficient detection accuracy of existing methods under complex backgrounds, this study designs two key structures: (1) A DS-MDPB backbone network that integrates three feature focusing mechanisms—MANet, SimAM, and BRA—to enhance feature extraction capabilities for pavement cracks and irregular distresses. (2) An MFDPN architecture that employs bidirectional feature propagation and dynamic weight allocation strategies, optimizing feature fusion through DG-C and RE-C modules. Experimental results demonstrate that on the RDD2022_China_Drone dataset, the model achieves an mAP50 of 73.1 %, reflecting a 5.5 % relative improvement over the baseline, and a 30.03 % relative enhancement in AP<sub>s</sub>. The model also exhibits consistent performance on UAV-PDD2023 and self-constructed datasets, demonstrating cross-scenario generalization capability. This method maintains high efficiency (FPS > 300) while improving detection accuracy for road damage in complex environments, providing reliable technical support for intelligent road maintenance.</div></div>\",\"PeriodicalId\":50941,\"journal\":{\"name\":\"Advanced Engineering Informatics\",\"volume\":\"69 \",\"pages\":\"Article 103814\"},\"PeriodicalIF\":9.9000,\"publicationDate\":\"2025-09-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Engineering Informatics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1474034625007074\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034625007074","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
An automatic detection method for road damage in UAV images based on multi-level perception and feature aggregation
Road infrastructure monitoring is critical for transportation safety and maintenance efficiency. However, existing road damage detection methods face challenges in multi-scale target recognition within complex environments, primarily attributed to their inadequate detection accuracy when processing drone-captured images. To overcome these limitations, this paper introduces the MFD-YOLO model. To tackle the insufficient detection accuracy of existing methods under complex backgrounds, this study designs two key structures: (1) A DS-MDPB backbone network that integrates three feature focusing mechanisms—MANet, SimAM, and BRA—to enhance feature extraction capabilities for pavement cracks and irregular distresses. (2) An MFDPN architecture that employs bidirectional feature propagation and dynamic weight allocation strategies, optimizing feature fusion through DG-C and RE-C modules. Experimental results demonstrate that on the RDD2022_China_Drone dataset, the model achieves an mAP50 of 73.1 %, reflecting a 5.5 % relative improvement over the baseline, and a 30.03 % relative enhancement in APs. The model also exhibits consistent performance on UAV-PDD2023 and self-constructed datasets, demonstrating cross-scenario generalization capability. This method maintains high efficiency (FPS > 300) while improving detection accuracy for road damage in complex environments, providing reliable technical support for intelligent road maintenance.
期刊介绍:
Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.