Yanyan Wang , Kechen Song , Yuyuan Liu , Tianze Li , Yunhui Yan , Gustavo Carneiro
{"title":"Bimodal defect segmentation with Geometric Prior-supported Anti-imbalance Learning for pavement defect evaluation and repair","authors":"Yanyan Wang , Kechen Song , Yuyuan Liu , Tianze Li , Yunhui Yan , Gustavo Carneiro","doi":"10.1016/j.autcon.2025.106497","DOIUrl":null,"url":null,"abstract":"<div><div>Pavement defect segmentation benefits from integrating 2D images with 3D depth data, enabling more accurate evaluation and repair decisions. However, current methods suffer from indiscriminate feature extraction that overlooks severe class imbalance, limited adaptability to defects with diverse shapes and appearances, and a lack of publicly available bimodal datasets with multi-class annotations. To address these challenges, this paper proposes Geometric Prior-supported Anti-imbalance Learning (GPAL), a bimodal segmentation framework based on the Segment Anything Model (SAM). GPAL introduces: (1) Defect-centric Reinforcing (DR) prompting, which leverages geometric priors derived from depth flow to enhance the image encoder’s focus on defect details in a defect-oriented manner; and (2) Depth Prototype-support Uniform Prompting (DPUP), which extends SAM for bimodal, multi-class defect segmentation by learning class-balanced depth prototypes. Additionally, the Bimodal Multi-category Pavement Defect (BMPD) dataset is constructed, containing 5,059 samples across four defect categories. Experimental results demonstrate that the proposed method effectively highlights defects under imbalanced conditions and adapts to diverse defects, achieving an mF1 score of 86.01% on the BMPD dataset. This advancement supports reliable pavement assessment and repair decisions, while laying a foundation for addressing class imbalance in diverse tasks through geometric priors and prompting strategies.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"180 ","pages":"Article 106497"},"PeriodicalIF":11.5000,"publicationDate":"2025-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Automation in Construction","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0926580525005370","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Pavement defect segmentation benefits from integrating 2D images with 3D depth data, enabling more accurate evaluation and repair decisions. However, current methods suffer from indiscriminate feature extraction that overlooks severe class imbalance, limited adaptability to defects with diverse shapes and appearances, and a lack of publicly available bimodal datasets with multi-class annotations. To address these challenges, this paper proposes Geometric Prior-supported Anti-imbalance Learning (GPAL), a bimodal segmentation framework based on the Segment Anything Model (SAM). GPAL introduces: (1) Defect-centric Reinforcing (DR) prompting, which leverages geometric priors derived from depth flow to enhance the image encoder’s focus on defect details in a defect-oriented manner; and (2) Depth Prototype-support Uniform Prompting (DPUP), which extends SAM for bimodal, multi-class defect segmentation by learning class-balanced depth prototypes. Additionally, the Bimodal Multi-category Pavement Defect (BMPD) dataset is constructed, containing 5,059 samples across four defect categories. Experimental results demonstrate that the proposed method effectively highlights defects under imbalanced conditions and adapts to diverse defects, achieving an mF1 score of 86.01% on the BMPD dataset. This advancement supports reliable pavement assessment and repair decisions, while laying a foundation for addressing class imbalance in diverse tasks through geometric priors and prompting strategies.
期刊介绍:
Automation in Construction is an international journal that focuses on publishing original research papers related to the use of Information Technologies in various aspects of the construction industry. The journal covers topics such as design, engineering, construction technologies, and the maintenance and management of constructed facilities.
The scope of Automation in Construction is extensive and covers all stages of the construction life cycle. This includes initial planning and design, construction of the facility, operation and maintenance, as well as the eventual dismantling and recycling of buildings and engineering structures.