Sam-based instance segmentation models for the automation of structural damage detection

IF 8 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
{"title":"Sam-based instance segmentation models for the automation of structural damage detection","authors":"","doi":"10.1016/j.aei.2024.102826","DOIUrl":null,"url":null,"abstract":"<div><p>In infrastructure asset management, monitoring structural condition is vital for safety and cost-efficiency. Traditional visual inspections are subjective, inconsistent, and time-consuming. Advanced automating visual inspections using digital technologies and artificial intelligence can effectively address these issues. Previous studies mainly focused on concrete structures and pavements, neglecting masonry defects and lacking publicly available datasets. In this paper, we address these gaps by introducing the “MCrack1300” dataset, annotated for bricks, broken bricks, and cracks, targeting instance segmentation. We propose two novel, automatically executable methods based on the latest visual large-scale model, the prompt-based Segment Anything Model (SAM). We fine-tune SAM’s encoder using Low-Rank Adaptation (LoRA). The first method connects SAM’s encoder to other decoders directly, while the second uses a learnable self-generating prompter. We modify the feature extractor for seamless integration of these methods with SAM’s encoder. Both methods outperform the state-of-the-art models, improving benchmark results approximately 3 % across all classes and around 6 % specifically for cracks. Building on successful detection, we then propose a monocular-based method to automatically convert images into orthographic projection maps via Hough Line Transform. By incorporating known real sizes of brick units and employing Euclidean Distance Transform, we accurately estimate crack dimensions, with the error less than 10 %. Overall, we offer reliable automated solutions for masonry crack detection and size estimation, which effectively enhances the management and maintenance efficiency of masonry structural asset.</p></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":null,"pages":null},"PeriodicalIF":8.0000,"publicationDate":"2024-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034624004749","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

In infrastructure asset management, monitoring structural condition is vital for safety and cost-efficiency. Traditional visual inspections are subjective, inconsistent, and time-consuming. Advanced automating visual inspections using digital technologies and artificial intelligence can effectively address these issues. Previous studies mainly focused on concrete structures and pavements, neglecting masonry defects and lacking publicly available datasets. In this paper, we address these gaps by introducing the “MCrack1300” dataset, annotated for bricks, broken bricks, and cracks, targeting instance segmentation. We propose two novel, automatically executable methods based on the latest visual large-scale model, the prompt-based Segment Anything Model (SAM). We fine-tune SAM’s encoder using Low-Rank Adaptation (LoRA). The first method connects SAM’s encoder to other decoders directly, while the second uses a learnable self-generating prompter. We modify the feature extractor for seamless integration of these methods with SAM’s encoder. Both methods outperform the state-of-the-art models, improving benchmark results approximately 3 % across all classes and around 6 % specifically for cracks. Building on successful detection, we then propose a monocular-based method to automatically convert images into orthographic projection maps via Hough Line Transform. By incorporating known real sizes of brick units and employing Euclidean Distance Transform, we accurately estimate crack dimensions, with the error less than 10 %. Overall, we offer reliable automated solutions for masonry crack detection and size estimation, which effectively enhances the management and maintenance efficiency of masonry structural asset.

基于 Sam 的实例分割模型,用于结构损伤的自动化检测
在基础设施资产管理中,监测结构状况对安全和成本效益至关重要。传统的视觉检测具有主观性、不一致性和耗时性。利用数字技术和人工智能实现先进的自动化视觉检测可以有效解决这些问题。以往的研究主要集中在混凝土结构和路面,忽略了砌体缺陷,也缺乏公开可用的数据集。本文通过引入 "MCrack1300 "数据集来填补这些空白,该数据集对砖块、碎砖和裂缝进行了注释,并以实例分割为目标。我们基于最新的视觉大规模模型,即基于提示的 "任何分割模型"(SAM),提出了两种新颖的、可自动执行的方法。我们使用低库自适应(Low-Rank Adaptation,LoRA)对 SAM 的编码器进行了微调。第一种方法将 SAM 编码器直接连接到其他解码器,第二种方法则使用可学习的自生成提示器。我们修改了特征提取器,以便将这些方法与 SAM 编码器无缝集成。这两种方法都优于最先进的模型,所有类别的基准结果提高了约 3%,裂缝的基准结果提高了约 6%。在成功检测的基础上,我们又提出了一种基于单眼的方法,通过 Hough 线性变换将图像自动转换为正投影图。通过结合已知砖块单元的实际尺寸并采用欧氏距离变换,我们准确地估算出了裂缝尺寸,误差小于 10%。总之,我们为砌体裂缝检测和尺寸估算提供了可靠的自动化解决方案,有效提高了砌体结构资产的管理和维护效率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Advanced Engineering Informatics
Advanced Engineering Informatics 工程技术-工程:综合
CiteScore
12.40
自引率
18.20%
发文量
292
审稿时长
45 days
期刊介绍: Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信