{"title":"Fourier-Mixture of Experts YOLO for concrete crack segmentation with visual interpretability","authors":"Haochen Chang , David Bassir , Anicet Barrios , Gongfa Chen","doi":"10.1016/j.autcon.2025.106452","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate crack segmentation is essential for structural health monitoring, yet most deep-learning studies treat the task as binary and struggle with varied morphologies. This paper introduces FMOE-YOLO, a Fourier-enhanced Mixture-of-Experts extension of YOLO that integrates a Multi-branched Auxiliary Feature Pyramid Network (MAFPN) and an SPPF_LSKA large-kernel attention head. The Fourier expert captures high-frequency crack cues, while MAFPN with LSKA supplies rich multiscale context. Experiments on three datasets of rising difficulty — Individual, Single-Crack (four classes), and Complex (six classes) — show consistent gains over standard YOLOv8. On the Single-Crack set the model attains 86.2% [email protected], improving performance by 4.7 percentage points. t-SNE and UMAP embeddings reveal tighter, better separated clusters, and Grad-CAM maps confirm sharper crack localization, demonstrating enhanced interpretability. The proposed approach offers strong potential for real-world monitoring, effectively handling diverse crack morphologies and challenging geometric conditions.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"179 ","pages":"Article 106452"},"PeriodicalIF":11.5000,"publicationDate":"2025-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Automation in Construction","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0926580525004923","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate crack segmentation is essential for structural health monitoring, yet most deep-learning studies treat the task as binary and struggle with varied morphologies. This paper introduces FMOE-YOLO, a Fourier-enhanced Mixture-of-Experts extension of YOLO that integrates a Multi-branched Auxiliary Feature Pyramid Network (MAFPN) and an SPPF_LSKA large-kernel attention head. The Fourier expert captures high-frequency crack cues, while MAFPN with LSKA supplies rich multiscale context. Experiments on three datasets of rising difficulty — Individual, Single-Crack (four classes), and Complex (six classes) — show consistent gains over standard YOLOv8. On the Single-Crack set the model attains 86.2% [email protected], improving performance by 4.7 percentage points. t-SNE and UMAP embeddings reveal tighter, better separated clusters, and Grad-CAM maps confirm sharper crack localization, demonstrating enhanced interpretability. The proposed approach offers strong potential for real-world monitoring, effectively handling diverse crack morphologies and challenging geometric conditions.
期刊介绍:
Automation in Construction is an international journal that focuses on publishing original research papers related to the use of Information Technologies in various aspects of the construction industry. The journal covers topics such as design, engineering, construction technologies, and the maintenance and management of constructed facilities.
The scope of Automation in Construction is extensive and covers all stages of the construction life cycle. This includes initial planning and design, construction of the facility, operation and maintenance, as well as the eventual dismantling and recycling of buildings and engineering structures.