CBRFormer：基于渲染技术的桥梁裂缝图像精细分割转换器

IF 9.9 1区工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Advanced Engineering Informatics Pub Date : 2025-09-15 DOI:10.1016/j.aei.2025.103868

Honghu Chu , Jiahao Gai , Weiwei Chen , Jun Ma

{"title":"CBRFormer：基于渲染技术的桥梁裂缝图像精细分割转换器","authors":"Honghu Chu , Jiahao Gai , Weiwei Chen , Jun Ma","doi":"10.1016/j.aei.2025.103868","DOIUrl":null,"url":null,"abstract":"<div><div>High-resolution (HR) imaging devices are crucial for ensuring the safety and efficiency of unmanned aerial vehicles (UAVs) during bridge crack detection tasks. However, due to the limitations of executing sampling discretely in traditional deep learning (DL) architectures and the constraints of GPU computing resources, it is challenging to perform fine-grained segmentation for HR crack images. To effectively address the challenge, the authors drew inspiration from the fine-grained rendering technology in the field of computer graphics (CG) and proposed the Crack Boundary Refinement Transformer (CBRFormer). Through three customized improvements, this architecture fully leverages the advantages of the rendering head in the refined representation of HR crack images. Firstly, a lightweight Transformer-based encoding architecture is designed, enabling the network to accurately capture crack backbone features from complex backgrounds. Subsequently, a boundary-guided branch based on super-resolution reconstruction technology is introduced to assist the network in capturing deep semantic information about crack boundary details. Additionally, two types of refined rendering point sampling methods are tailored for hard example areas during training and inference stages, ensuring that the prediction head used for refined rendering effectively focuses on ambiguous crack boundaries and tiny crack regions. Finally, the effectiveness of each component in the CBRFormer and the network’s practicality are demonstrated through ablation and the field experiment. Compared to the current advanced HR segmentation architectures like CascadePSP and Segfix, the CBRFormer achieved average performance improvements of 2.16% in mean Intersection over Union (IoU), 7.80% in mean Edge Accuracy (mEA), and 2.46% in Dice coefficient, respectively. The utilization of the CBRFormer enables precise segmentation of HR crack images, providing inspectors with more comprehensive and accurate structural crack information, thereby offering technical support for structural safety assessment and maintenance decision-making.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"69 ","pages":"Article 103868"},"PeriodicalIF":9.9000,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CBRFormer: rendering technology-based transformer for refinement segmentation of bridge crack images\",\"authors\":\"Honghu Chu , Jiahao Gai , Weiwei Chen , Jun Ma\",\"doi\":\"10.1016/j.aei.2025.103868\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>High-resolution (HR) imaging devices are crucial for ensuring the safety and efficiency of unmanned aerial vehicles (UAVs) during bridge crack detection tasks. However, due to the limitations of executing sampling discretely in traditional deep learning (DL) architectures and the constraints of GPU computing resources, it is challenging to perform fine-grained segmentation for HR crack images. To effectively address the challenge, the authors drew inspiration from the fine-grained rendering technology in the field of computer graphics (CG) and proposed the Crack Boundary Refinement Transformer (CBRFormer). Through three customized improvements, this architecture fully leverages the advantages of the rendering head in the refined representation of HR crack images. Firstly, a lightweight Transformer-based encoding architecture is designed, enabling the network to accurately capture crack backbone features from complex backgrounds. Subsequently, a boundary-guided branch based on super-resolution reconstruction technology is introduced to assist the network in capturing deep semantic information about crack boundary details. Additionally, two types of refined rendering point sampling methods are tailored for hard example areas during training and inference stages, ensuring that the prediction head used for refined rendering effectively focuses on ambiguous crack boundaries and tiny crack regions. Finally, the effectiveness of each component in the CBRFormer and the network’s practicality are demonstrated through ablation and the field experiment. Compared to the current advanced HR segmentation architectures like CascadePSP and Segfix, the CBRFormer achieved average performance improvements of 2.16% in mean Intersection over Union (IoU), 7.80% in mean Edge Accuracy (mEA), and 2.46% in Dice coefficient, respectively. The utilization of the CBRFormer enables precise segmentation of HR crack images, providing inspectors with more comprehensive and accurate structural crack information, thereby offering technical support for structural safety assessment and maintenance decision-making.</div></div>\",\"PeriodicalId\":50941,\"journal\":{\"name\":\"Advanced Engineering Informatics\",\"volume\":\"69 \",\"pages\":\"Article 103868\"},\"PeriodicalIF\":9.9000,\"publicationDate\":\"2025-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Engineering Informatics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S147403462500761X\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S147403462500761X","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

高分辨率（HR）成像设备对于确保无人机在桥梁裂缝检测任务中的安全性和效率至关重要。然而，由于传统深度学习（DL）架构中离散采样的局限性和GPU计算资源的限制，对HR裂缝图像进行细粒度分割是一个挑战。为了有效应对这一挑战，作者从计算机图形学（CG）领域的细粒度渲染技术中汲取灵感，提出了裂纹边界细化变压器（CBRFormer）。通过三个定制的改进，该架构充分利用了渲染头在HR裂缝图像精细表示方面的优势。首先，设计了一种基于transformer的轻量级编码架构，使网络能够在复杂背景下准确捕获裂缝骨干特征；随后，引入了一种基于超分辨率重构技术的边界引导分支，帮助网络获取裂缝边界细节的深层语义信息。此外，针对训练和推理阶段的硬样例区域，定制了两种精细渲染点采样方法，确保用于精细渲染的预测头有效地关注模糊裂纹边界和微小裂纹区域。最后，通过烧蚀和现场实验验证了CBRFormer中各组成部分的有效性和网络的实用性。与当前先进的HR分割架构（如CascadePSP和Segfix）相比，CBRFormer在平均交叉优于联合（IoU）、平均边缘精度（mEA）和骰子系数（Dice coefficient）方面的平均性能分别提高了2.16%、7.80%和2.46%。利用CBRFormer可以对HR裂缝图像进行精确分割，为检查员提供更全面、准确的结构裂缝信息，为结构安全评估和维修决策提供技术支持。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

CBRFormer: rendering technology-based transformer for refinement segmentation of bridge crack images

High-resolution (HR) imaging devices are crucial for ensuring the safety and efficiency of unmanned aerial vehicles (UAVs) during bridge crack detection tasks. However, due to the limitations of executing sampling discretely in traditional deep learning (DL) architectures and the constraints of GPU computing resources, it is challenging to perform fine-grained segmentation for HR crack images. To effectively address the challenge, the authors drew inspiration from the fine-grained rendering technology in the field of computer graphics (CG) and proposed the Crack Boundary Refinement Transformer (CBRFormer). Through three customized improvements, this architecture fully leverages the advantages of the rendering head in the refined representation of HR crack images. Firstly, a lightweight Transformer-based encoding architecture is designed, enabling the network to accurately capture crack backbone features from complex backgrounds. Subsequently, a boundary-guided branch based on super-resolution reconstruction technology is introduced to assist the network in capturing deep semantic information about crack boundary details. Additionally, two types of refined rendering point sampling methods are tailored for hard example areas during training and inference stages, ensuring that the prediction head used for refined rendering effectively focuses on ambiguous crack boundaries and tiny crack regions. Finally, the effectiveness of each component in the CBRFormer and the network’s practicality are demonstrated through ablation and the field experiment. Compared to the current advanced HR segmentation architectures like CascadePSP and Segfix, the CBRFormer achieved average performance improvements of 2.16% in mean Intersection over Union (IoU), 7.80% in mean Edge Accuracy (mEA), and 2.46% in Dice coefficient, respectively. The utilization of the CBRFormer enables precise segmentation of HR crack images, providing inspectors with more comprehensive and accurate structural crack information, thereby offering technical support for structural safety assessment and maintenance decision-making.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Advanced Engineering Informatics 工程技术-工程：综合

CiteScore

12.40

自引率

18.20%

发文量

292

审稿时长

45 days

期刊介绍： Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.