{"title":"变形管CCTV视频的时间缺陷点定位","authors":"Zhu Huang , Gang Pan , Chao Kang , YaoZhi Lv","doi":"10.1016/j.autcon.2025.106160","DOIUrl":null,"url":null,"abstract":"<div><div>During the inspection and maintenance of underground pipe systems, technicians often spend considerable time searching for subtle defects in inspection videos captured under varying pipe conditions using Closed-Circuit Television (CCTV). The lack of feature extractors tailored for pipe images, combined with the complexity of pipe CCTV videos, poses substantial challenges to the performance and applicability of conventional frame-by-frame, image-based localization algorithms. To address these challenges, this paper introduces PipeTR, a transformer-driven, end-to-end network, offering enhanced insights into pipe CCTV video analysis by shifting from a frame-based to a video-based approach. The development of PipeTR aims to assist technicians by automating the most time-consuming step of the assessment, thereby improving both efficiency and accuracy. Experiments demonstrate that PipeTR outperforms other image-based, frame-by-frame analysis methods on real-world CCTV pipe inspection video datasets, achieving an average F1 score of 43.04%, which is a 5.35% improvement over the current state-of-the-art methods.</div></div>","PeriodicalId":8660,"journal":{"name":"Automation in Construction","volume":"175 ","pages":"Article 106160"},"PeriodicalIF":9.6000,"publicationDate":"2025-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Temporal defect point localization in pipe CCTV Videos with Transformers\",\"authors\":\"Zhu Huang , Gang Pan , Chao Kang , YaoZhi Lv\",\"doi\":\"10.1016/j.autcon.2025.106160\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>During the inspection and maintenance of underground pipe systems, technicians often spend considerable time searching for subtle defects in inspection videos captured under varying pipe conditions using Closed-Circuit Television (CCTV). The lack of feature extractors tailored for pipe images, combined with the complexity of pipe CCTV videos, poses substantial challenges to the performance and applicability of conventional frame-by-frame, image-based localization algorithms. To address these challenges, this paper introduces PipeTR, a transformer-driven, end-to-end network, offering enhanced insights into pipe CCTV video analysis by shifting from a frame-based to a video-based approach. The development of PipeTR aims to assist technicians by automating the most time-consuming step of the assessment, thereby improving both efficiency and accuracy. Experiments demonstrate that PipeTR outperforms other image-based, frame-by-frame analysis methods on real-world CCTV pipe inspection video datasets, achieving an average F1 score of 43.04%, which is a 5.35% improvement over the current state-of-the-art methods.</div></div>\",\"PeriodicalId\":8660,\"journal\":{\"name\":\"Automation in Construction\",\"volume\":\"175 \",\"pages\":\"Article 106160\"},\"PeriodicalIF\":9.6000,\"publicationDate\":\"2025-04-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Automation in Construction\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0926580525002006\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CONSTRUCTION & BUILDING TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Automation in Construction","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0926580525002006","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
Temporal defect point localization in pipe CCTV Videos with Transformers
During the inspection and maintenance of underground pipe systems, technicians often spend considerable time searching for subtle defects in inspection videos captured under varying pipe conditions using Closed-Circuit Television (CCTV). The lack of feature extractors tailored for pipe images, combined with the complexity of pipe CCTV videos, poses substantial challenges to the performance and applicability of conventional frame-by-frame, image-based localization algorithms. To address these challenges, this paper introduces PipeTR, a transformer-driven, end-to-end network, offering enhanced insights into pipe CCTV video analysis by shifting from a frame-based to a video-based approach. The development of PipeTR aims to assist technicians by automating the most time-consuming step of the assessment, thereby improving both efficiency and accuracy. Experiments demonstrate that PipeTR outperforms other image-based, frame-by-frame analysis methods on real-world CCTV pipe inspection video datasets, achieving an average F1 score of 43.04%, which is a 5.35% improvement over the current state-of-the-art methods.
期刊介绍:
Automation in Construction is an international journal that focuses on publishing original research papers related to the use of Information Technologies in various aspects of the construction industry. The journal covers topics such as design, engineering, construction technologies, and the maintenance and management of constructed facilities.
The scope of Automation in Construction is extensive and covers all stages of the construction life cycle. This includes initial planning and design, construction of the facility, operation and maintenance, as well as the eventual dismantling and recycling of buildings and engineering structures.