{"title":"基于边缘信息和高效多尺度卷积的道路缺陷扩散检测模型","authors":"Xueqiu Wang , Huanbing Gao , Zemeng Jia","doi":"10.1016/j.asoc.2025.113332","DOIUrl":null,"url":null,"abstract":"<div><div>Roads are vital infrastructure components, and the prompt detection and repair of defects are critical for their longevity and safety. This paper introduces the Edge Efficient Multi-Scale Focusing Diffusion Network (EEFNet), a precise method for road defect detection. The Edge Information Enhancement Module (EIEM) accentuates crack contours while minimizing background noise. An Efficient Multi-Scale Convolution (EMSConv) is proposed. The EMSConv captures features across multiple scales, thereby enhancing model efficiency through reduced computational demands and parameter count. The Focusing Diffusion Pyramid Network (FDPN) collects and distributes context-rich features across various scales using a diffusion mechanism, thereby improving detection capabilities. Additionally, the Task Dynamic Align Detection Head (TADDH) facilitates parameter sharing among detection heads, which enhances classification and localization accuracy. EEFNet has demonstrated a 92.7 % accuracy rate at 126 FPS (Frames Per Second) on a road defect dataset and has proven robustness on several diverse datasets including Unmanned Aerial Vehicle Asphalt Pavement Distress Dataset (UAPD), Visual Object Classes 2007 (VOC2007), Global Road Damage Detection2022 (GRDD2022), and Vision Meets Drone 2019 (Visdrone2019). In addition, by pruning the model and deploying it onto edge computing devices, practical experiments have demonstrated that the EEFNet model has substantial practical application value.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"178 ","pages":"Article 113332"},"PeriodicalIF":7.2000,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Diffusion-based road defect detection model integrating edge information and efficient multi-scale convolution\",\"authors\":\"Xueqiu Wang , Huanbing Gao , Zemeng Jia\",\"doi\":\"10.1016/j.asoc.2025.113332\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Roads are vital infrastructure components, and the prompt detection and repair of defects are critical for their longevity and safety. This paper introduces the Edge Efficient Multi-Scale Focusing Diffusion Network (EEFNet), a precise method for road defect detection. The Edge Information Enhancement Module (EIEM) accentuates crack contours while minimizing background noise. An Efficient Multi-Scale Convolution (EMSConv) is proposed. The EMSConv captures features across multiple scales, thereby enhancing model efficiency through reduced computational demands and parameter count. The Focusing Diffusion Pyramid Network (FDPN) collects and distributes context-rich features across various scales using a diffusion mechanism, thereby improving detection capabilities. Additionally, the Task Dynamic Align Detection Head (TADDH) facilitates parameter sharing among detection heads, which enhances classification and localization accuracy. EEFNet has demonstrated a 92.7 % accuracy rate at 126 FPS (Frames Per Second) on a road defect dataset and has proven robustness on several diverse datasets including Unmanned Aerial Vehicle Asphalt Pavement Distress Dataset (UAPD), Visual Object Classes 2007 (VOC2007), Global Road Damage Detection2022 (GRDD2022), and Vision Meets Drone 2019 (Visdrone2019). In addition, by pruning the model and deploying it onto edge computing devices, practical experiments have demonstrated that the EEFNet model has substantial practical application value.</div></div>\",\"PeriodicalId\":50737,\"journal\":{\"name\":\"Applied Soft Computing\",\"volume\":\"178 \",\"pages\":\"Article 113332\"},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2025-05-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Soft Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S156849462500643X\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S156849462500643X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Diffusion-based road defect detection model integrating edge information and efficient multi-scale convolution
Roads are vital infrastructure components, and the prompt detection and repair of defects are critical for their longevity and safety. This paper introduces the Edge Efficient Multi-Scale Focusing Diffusion Network (EEFNet), a precise method for road defect detection. The Edge Information Enhancement Module (EIEM) accentuates crack contours while minimizing background noise. An Efficient Multi-Scale Convolution (EMSConv) is proposed. The EMSConv captures features across multiple scales, thereby enhancing model efficiency through reduced computational demands and parameter count. The Focusing Diffusion Pyramid Network (FDPN) collects and distributes context-rich features across various scales using a diffusion mechanism, thereby improving detection capabilities. Additionally, the Task Dynamic Align Detection Head (TADDH) facilitates parameter sharing among detection heads, which enhances classification and localization accuracy. EEFNet has demonstrated a 92.7 % accuracy rate at 126 FPS (Frames Per Second) on a road defect dataset and has proven robustness on several diverse datasets including Unmanned Aerial Vehicle Asphalt Pavement Distress Dataset (UAPD), Visual Object Classes 2007 (VOC2007), Global Road Damage Detection2022 (GRDD2022), and Vision Meets Drone 2019 (Visdrone2019). In addition, by pruning the model and deploying it onto edge computing devices, practical experiments have demonstrated that the EEFNet model has substantial practical application value.
期刊介绍:
Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities.
Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.