{"title":"GLoU-MiT: Lightweight Global-Local Mamba-Guided U-mix transformer for UAV-based pavement crack segmentation","authors":"Jinhuan Shan , Yue Huang , Wei Jiang , Dongdong Yuan , Feiyang Guo","doi":"10.1016/j.aei.2025.103384","DOIUrl":null,"url":null,"abstract":"<div><div>The utility of Unmanned Aerial Vehicles (UAVs) for routine pavement distresses inspection has been increasingly recognized due to their efficiency, flexibility, safety, and low-cost automation. However, UAV-acquired high-altitude images present unique challenges for deep learning-based semantic segmentation models, such as minute crack details, blurred boundaries, and high levels of environmental noise. We propose GLoU-MiT, a lightweight segmentation model designed to address the difficulties of UAV-based pavement crack segmentation. Our model integrates a U-shaped Mix Transformer architecture for efficient hierarchical feature extraction, a Global-Local Mamba-Guided Skip Connection for improved feature alignment and computational efficiency, and a Boundary / Semantic Deep Supervision Refinement Module to enhance segmentation precision in complex scenarios. Extensive experiments on UAV-Crack500, CrackSC and Crack500 datasets demonstrate that GLoU-MiT effectively improves segmentation accuracy, particularly in low-contrast and complex background environments, making it a robust solution for UAV-based pavement crack inspection tasks. Furthermore, inference speed and energy consumption evaluations conducted on the Jetson Orin Nano (8 GB) show that our model achieves an excellent balance between accuracy, energy efficiency, and speed. The code will be released at: <span><span>https://github.com/SHAN-JH/GLoU-MiT</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103384"},"PeriodicalIF":8.0000,"publicationDate":"2025-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034625002770","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The utility of Unmanned Aerial Vehicles (UAVs) for routine pavement distresses inspection has been increasingly recognized due to their efficiency, flexibility, safety, and low-cost automation. However, UAV-acquired high-altitude images present unique challenges for deep learning-based semantic segmentation models, such as minute crack details, blurred boundaries, and high levels of environmental noise. We propose GLoU-MiT, a lightweight segmentation model designed to address the difficulties of UAV-based pavement crack segmentation. Our model integrates a U-shaped Mix Transformer architecture for efficient hierarchical feature extraction, a Global-Local Mamba-Guided Skip Connection for improved feature alignment and computational efficiency, and a Boundary / Semantic Deep Supervision Refinement Module to enhance segmentation precision in complex scenarios. Extensive experiments on UAV-Crack500, CrackSC and Crack500 datasets demonstrate that GLoU-MiT effectively improves segmentation accuracy, particularly in low-contrast and complex background environments, making it a robust solution for UAV-based pavement crack inspection tasks. Furthermore, inference speed and energy consumption evaluations conducted on the Jetson Orin Nano (8 GB) show that our model achieves an excellent balance between accuracy, energy efficiency, and speed. The code will be released at: https://github.com/SHAN-JH/GLoU-MiT.
期刊介绍:
Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.