{"title":"CBRFormer:基于渲染技术的桥梁裂缝图像精细分割转换器","authors":"Honghu Chu , Jiahao Gai , Weiwei Chen , Jun Ma","doi":"10.1016/j.aei.2025.103868","DOIUrl":null,"url":null,"abstract":"<div><div>High-resolution (HR) imaging devices are crucial for ensuring the safety and efficiency of unmanned aerial vehicles (UAVs) during bridge crack detection tasks. However, due to the limitations of executing sampling discretely in traditional deep learning (DL) architectures and the constraints of GPU computing resources, it is challenging to perform fine-grained segmentation for HR crack images. To effectively address the challenge, the authors drew inspiration from the fine-grained rendering technology in the field of computer graphics (CG) and proposed the Crack Boundary Refinement Transformer (CBRFormer). Through three customized improvements, this architecture fully leverages the advantages of the rendering head in the refined representation of HR crack images. Firstly, a lightweight Transformer-based encoding architecture is designed, enabling the network to accurately capture crack backbone features from complex backgrounds. Subsequently, a boundary-guided branch based on super-resolution reconstruction technology is introduced to assist the network in capturing deep semantic information about crack boundary details. Additionally, two types of refined rendering point sampling methods are tailored for hard example areas during training and inference stages, ensuring that the prediction head used for refined rendering effectively focuses on ambiguous crack boundaries and tiny crack regions. Finally, the effectiveness of each component in the CBRFormer and the network’s practicality are demonstrated through ablation and the field experiment. Compared to the current advanced HR segmentation architectures like CascadePSP and Segfix, the CBRFormer achieved average performance improvements of 2.16% in mean Intersection over Union (IoU), 7.80% in mean Edge Accuracy (mEA), and 2.46% in Dice coefficient, respectively. The utilization of the CBRFormer enables precise segmentation of HR crack images, providing inspectors with more comprehensive and accurate structural crack information, thereby offering technical support for structural safety assessment and maintenance decision-making.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"69 ","pages":"Article 103868"},"PeriodicalIF":9.9000,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CBRFormer: rendering technology-based transformer for refinement segmentation of bridge crack images\",\"authors\":\"Honghu Chu , Jiahao Gai , Weiwei Chen , Jun Ma\",\"doi\":\"10.1016/j.aei.2025.103868\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>High-resolution (HR) imaging devices are crucial for ensuring the safety and efficiency of unmanned aerial vehicles (UAVs) during bridge crack detection tasks. However, due to the limitations of executing sampling discretely in traditional deep learning (DL) architectures and the constraints of GPU computing resources, it is challenging to perform fine-grained segmentation for HR crack images. To effectively address the challenge, the authors drew inspiration from the fine-grained rendering technology in the field of computer graphics (CG) and proposed the Crack Boundary Refinement Transformer (CBRFormer). Through three customized improvements, this architecture fully leverages the advantages of the rendering head in the refined representation of HR crack images. Firstly, a lightweight Transformer-based encoding architecture is designed, enabling the network to accurately capture crack backbone features from complex backgrounds. Subsequently, a boundary-guided branch based on super-resolution reconstruction technology is introduced to assist the network in capturing deep semantic information about crack boundary details. Additionally, two types of refined rendering point sampling methods are tailored for hard example areas during training and inference stages, ensuring that the prediction head used for refined rendering effectively focuses on ambiguous crack boundaries and tiny crack regions. Finally, the effectiveness of each component in the CBRFormer and the network’s practicality are demonstrated through ablation and the field experiment. Compared to the current advanced HR segmentation architectures like CascadePSP and Segfix, the CBRFormer achieved average performance improvements of 2.16% in mean Intersection over Union (IoU), 7.80% in mean Edge Accuracy (mEA), and 2.46% in Dice coefficient, respectively. The utilization of the CBRFormer enables precise segmentation of HR crack images, providing inspectors with more comprehensive and accurate structural crack information, thereby offering technical support for structural safety assessment and maintenance decision-making.</div></div>\",\"PeriodicalId\":50941,\"journal\":{\"name\":\"Advanced Engineering Informatics\",\"volume\":\"69 \",\"pages\":\"Article 103868\"},\"PeriodicalIF\":9.9000,\"publicationDate\":\"2025-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Engineering Informatics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S147403462500761X\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S147403462500761X","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
CBRFormer: rendering technology-based transformer for refinement segmentation of bridge crack images
High-resolution (HR) imaging devices are crucial for ensuring the safety and efficiency of unmanned aerial vehicles (UAVs) during bridge crack detection tasks. However, due to the limitations of executing sampling discretely in traditional deep learning (DL) architectures and the constraints of GPU computing resources, it is challenging to perform fine-grained segmentation for HR crack images. To effectively address the challenge, the authors drew inspiration from the fine-grained rendering technology in the field of computer graphics (CG) and proposed the Crack Boundary Refinement Transformer (CBRFormer). Through three customized improvements, this architecture fully leverages the advantages of the rendering head in the refined representation of HR crack images. Firstly, a lightweight Transformer-based encoding architecture is designed, enabling the network to accurately capture crack backbone features from complex backgrounds. Subsequently, a boundary-guided branch based on super-resolution reconstruction technology is introduced to assist the network in capturing deep semantic information about crack boundary details. Additionally, two types of refined rendering point sampling methods are tailored for hard example areas during training and inference stages, ensuring that the prediction head used for refined rendering effectively focuses on ambiguous crack boundaries and tiny crack regions. Finally, the effectiveness of each component in the CBRFormer and the network’s practicality are demonstrated through ablation and the field experiment. Compared to the current advanced HR segmentation architectures like CascadePSP and Segfix, the CBRFormer achieved average performance improvements of 2.16% in mean Intersection over Union (IoU), 7.80% in mean Edge Accuracy (mEA), and 2.46% in Dice coefficient, respectively. The utilization of the CBRFormer enables precise segmentation of HR crack images, providing inspectors with more comprehensive and accurate structural crack information, thereby offering technical support for structural safety assessment and maintenance decision-making.
期刊介绍:
Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.