{"title":"基于 Sam 的实例分割模型,用于结构损伤的自动化检测","authors":"","doi":"10.1016/j.aei.2024.102826","DOIUrl":null,"url":null,"abstract":"<div><p>In infrastructure asset management, monitoring structural condition is vital for safety and cost-efficiency. Traditional visual inspections are subjective, inconsistent, and time-consuming. Advanced automating visual inspections using digital technologies and artificial intelligence can effectively address these issues. Previous studies mainly focused on concrete structures and pavements, neglecting masonry defects and lacking publicly available datasets. In this paper, we address these gaps by introducing the “MCrack1300” dataset, annotated for bricks, broken bricks, and cracks, targeting instance segmentation. We propose two novel, automatically executable methods based on the latest visual large-scale model, the prompt-based Segment Anything Model (SAM). We fine-tune SAM’s encoder using Low-Rank Adaptation (LoRA). The first method connects SAM’s encoder to other decoders directly, while the second uses a learnable self-generating prompter. We modify the feature extractor for seamless integration of these methods with SAM’s encoder. Both methods outperform the state-of-the-art models, improving benchmark results approximately 3 % across all classes and around 6 % specifically for cracks. Building on successful detection, we then propose a monocular-based method to automatically convert images into orthographic projection maps via Hough Line Transform. By incorporating known real sizes of brick units and employing Euclidean Distance Transform, we accurately estimate crack dimensions, with the error less than 10 %. Overall, we offer reliable automated solutions for masonry crack detection and size estimation, which effectively enhances the management and maintenance efficiency of masonry structural asset.</p></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":null,"pages":null},"PeriodicalIF":8.0000,"publicationDate":"2024-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sam-based instance segmentation models for the automation of structural damage detection\",\"authors\":\"\",\"doi\":\"10.1016/j.aei.2024.102826\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In infrastructure asset management, monitoring structural condition is vital for safety and cost-efficiency. Traditional visual inspections are subjective, inconsistent, and time-consuming. Advanced automating visual inspections using digital technologies and artificial intelligence can effectively address these issues. Previous studies mainly focused on concrete structures and pavements, neglecting masonry defects and lacking publicly available datasets. In this paper, we address these gaps by introducing the “MCrack1300” dataset, annotated for bricks, broken bricks, and cracks, targeting instance segmentation. We propose two novel, automatically executable methods based on the latest visual large-scale model, the prompt-based Segment Anything Model (SAM). We fine-tune SAM’s encoder using Low-Rank Adaptation (LoRA). The first method connects SAM’s encoder to other decoders directly, while the second uses a learnable self-generating prompter. We modify the feature extractor for seamless integration of these methods with SAM’s encoder. Both methods outperform the state-of-the-art models, improving benchmark results approximately 3 % across all classes and around 6 % specifically for cracks. Building on successful detection, we then propose a monocular-based method to automatically convert images into orthographic projection maps via Hough Line Transform. By incorporating known real sizes of brick units and employing Euclidean Distance Transform, we accurately estimate crack dimensions, with the error less than 10 %. Overall, we offer reliable automated solutions for masonry crack detection and size estimation, which effectively enhances the management and maintenance efficiency of masonry structural asset.</p></div>\",\"PeriodicalId\":50941,\"journal\":{\"name\":\"Advanced Engineering Informatics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2024-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Engineering Informatics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1474034624004749\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034624004749","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
摘要
在基础设施资产管理中,监测结构状况对安全和成本效益至关重要。传统的视觉检测具有主观性、不一致性和耗时性。利用数字技术和人工智能实现先进的自动化视觉检测可以有效解决这些问题。以往的研究主要集中在混凝土结构和路面,忽略了砌体缺陷,也缺乏公开可用的数据集。本文通过引入 "MCrack1300 "数据集来填补这些空白,该数据集对砖块、碎砖和裂缝进行了注释,并以实例分割为目标。我们基于最新的视觉大规模模型,即基于提示的 "任何分割模型"(SAM),提出了两种新颖的、可自动执行的方法。我们使用低库自适应(Low-Rank Adaptation,LoRA)对 SAM 的编码器进行了微调。第一种方法将 SAM 编码器直接连接到其他解码器,第二种方法则使用可学习的自生成提示器。我们修改了特征提取器,以便将这些方法与 SAM 编码器无缝集成。这两种方法都优于最先进的模型,所有类别的基准结果提高了约 3%,裂缝的基准结果提高了约 6%。在成功检测的基础上,我们又提出了一种基于单眼的方法,通过 Hough 线性变换将图像自动转换为正投影图。通过结合已知砖块单元的实际尺寸并采用欧氏距离变换,我们准确地估算出了裂缝尺寸,误差小于 10%。总之,我们为砌体裂缝检测和尺寸估算提供了可靠的自动化解决方案,有效提高了砌体结构资产的管理和维护效率。
Sam-based instance segmentation models for the automation of structural damage detection
In infrastructure asset management, monitoring structural condition is vital for safety and cost-efficiency. Traditional visual inspections are subjective, inconsistent, and time-consuming. Advanced automating visual inspections using digital technologies and artificial intelligence can effectively address these issues. Previous studies mainly focused on concrete structures and pavements, neglecting masonry defects and lacking publicly available datasets. In this paper, we address these gaps by introducing the “MCrack1300” dataset, annotated for bricks, broken bricks, and cracks, targeting instance segmentation. We propose two novel, automatically executable methods based on the latest visual large-scale model, the prompt-based Segment Anything Model (SAM). We fine-tune SAM’s encoder using Low-Rank Adaptation (LoRA). The first method connects SAM’s encoder to other decoders directly, while the second uses a learnable self-generating prompter. We modify the feature extractor for seamless integration of these methods with SAM’s encoder. Both methods outperform the state-of-the-art models, improving benchmark results approximately 3 % across all classes and around 6 % specifically for cracks. Building on successful detection, we then propose a monocular-based method to automatically convert images into orthographic projection maps via Hough Line Transform. By incorporating known real sizes of brick units and employing Euclidean Distance Transform, we accurately estimate crack dimensions, with the error less than 10 %. Overall, we offer reliable automated solutions for masonry crack detection and size estimation, which effectively enhances the management and maintenance efficiency of masonry structural asset.
期刊介绍:
Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.