{"title":"用于水下桥墩裂缝机器人检测的增强型实时检测变压器 (RT-DETR)","authors":"Zhenming Lv, Shaojiang Dong, Zongyou Xia, Jingyao He, Jiawei Zhang","doi":"10.1016/j.autcon.2024.105921","DOIUrl":null,"url":null,"abstract":"The inadequate visual environment reduces the accuracy of underwater bridge pier fracture detection. Consequently, this paper suggests enhancing the backbone of the Real-Time Detection Transformer(RT-DETR) model to serve as the backbone of the YOLOv8 model. This will be achieved by substituting the Faster Implementation of CSP Bottleneck with 2 convolutions(C2f) module with the Poly Kernel Inception(PKI) Block, which is composed of the PKI Module and Context Anchor Attention(CAA) Block. Its strong capability to distinguish cracks and background features enables accurate recognition of underwater bridge pier cracks. To provide data for detecting these cracks, the enhanced Unpaired Image to Image Translation(CycleGAN) network converts land-style bridge crack images to underwater-style fracture images. The proposed model achieved an F1 score of 0.85 and a mAP50 of 0.84. The real-time detection of underwater bridge fractures by the underwater robot was facilitated by the FPS index of 87.47, which optimizes the detection efficiency.","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"12 1","pages":""},"PeriodicalIF":9.6000,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhanced real-time detection transformer (RT-DETR) for robotic inspection of underwater bridge pier cracks\",\"authors\":\"Zhenming Lv, Shaojiang Dong, Zongyou Xia, Jingyao He, Jiawei Zhang\",\"doi\":\"10.1016/j.autcon.2024.105921\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The inadequate visual environment reduces the accuracy of underwater bridge pier fracture detection. Consequently, this paper suggests enhancing the backbone of the Real-Time Detection Transformer(RT-DETR) model to serve as the backbone of the YOLOv8 model. This will be achieved by substituting the Faster Implementation of CSP Bottleneck with 2 convolutions(C2f) module with the Poly Kernel Inception(PKI) Block, which is composed of the PKI Module and Context Anchor Attention(CAA) Block. Its strong capability to distinguish cracks and background features enables accurate recognition of underwater bridge pier cracks. To provide data for detecting these cracks, the enhanced Unpaired Image to Image Translation(CycleGAN) network converts land-style bridge crack images to underwater-style fracture images. The proposed model achieved an F1 score of 0.85 and a mAP50 of 0.84. The real-time detection of underwater bridge fractures by the underwater robot was facilitated by the FPS index of 87.47, which optimizes the detection efficiency.\",\"PeriodicalId\":8660,\"journal\":{\"name\":\"Automation in Construction\",\"volume\":\"12 1\",\"pages\":\"\"},\"PeriodicalIF\":9.6000,\"publicationDate\":\"2024-12-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Automation in Construction\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1016/j.autcon.2024.105921\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CONSTRUCTION & BUILDING TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Automation in Construction","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.autcon.2024.105921","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
Enhanced real-time detection transformer (RT-DETR) for robotic inspection of underwater bridge pier cracks
The inadequate visual environment reduces the accuracy of underwater bridge pier fracture detection. Consequently, this paper suggests enhancing the backbone of the Real-Time Detection Transformer(RT-DETR) model to serve as the backbone of the YOLOv8 model. This will be achieved by substituting the Faster Implementation of CSP Bottleneck with 2 convolutions(C2f) module with the Poly Kernel Inception(PKI) Block, which is composed of the PKI Module and Context Anchor Attention(CAA) Block. Its strong capability to distinguish cracks and background features enables accurate recognition of underwater bridge pier cracks. To provide data for detecting these cracks, the enhanced Unpaired Image to Image Translation(CycleGAN) network converts land-style bridge crack images to underwater-style fracture images. The proposed model achieved an F1 score of 0.85 and a mAP50 of 0.84. The real-time detection of underwater bridge fractures by the underwater robot was facilitated by the FPS index of 87.47, which optimizes the detection efficiency.
期刊介绍:
Automation in Construction is an international journal that focuses on publishing original research papers related to the use of Information Technologies in various aspects of the construction industry. The journal covers topics such as design, engineering, construction technologies, and the maintenance and management of constructed facilities.
The scope of Automation in Construction is extensive and covers all stages of the construction life cycle. This includes initial planning and design, construction of the facility, operation and maintenance, as well as the eventual dismantling and recycling of buildings and engineering structures.