用于裂纹检测的通用合成数据生成框架

IF 6.4 1区 工程技术 Q1 ENGINEERING, CIVIL
Jiawei Xie , Baolin Chen , Anna Giacomini , Hongyu Guo , Umair Iqbal , Jinsong Huang
{"title":"用于裂纹检测的通用合成数据生成框架","authors":"Jiawei Xie ,&nbsp;Baolin Chen ,&nbsp;Anna Giacomini ,&nbsp;Hongyu Guo ,&nbsp;Umair Iqbal ,&nbsp;Jinsong Huang","doi":"10.1016/j.engstruct.2025.121428","DOIUrl":null,"url":null,"abstract":"<div><div>Deep learning has shown remarkable promise in automating crack detection for civil infrastructure inspection, yet collecting and annotating large, high-quality datasets remains a major bottleneck. To address this challenge, we propose CrackGen—a versatile, fully controllable framework for synthetic data generation. By integrating fine-tuned Stable Diffusion models with ControlNet, CrackGen synthesizes unlimited crack images that closely mimic real-world variability in material textures, crack patterns, and morphological features. A pivotal innovation of our approach is that the same control conditions used to steer image creation naturally serve as ground truth labels, eliminating the need for labor-intensive post-generation annotation. This feature is especially critical for large-scale dataset construction. Through fine-tuning on extremely small training sets, CrackGen inherits the learned knowledge of Stable Diffusion, enabling users to adjust crack widths, orientations, and background materials through simple prompt adjustments. Our experiments systematically analyze how different parameter settings affect the fidelity and diversity of generated images. Additionally, we propose novel crack sketching methods, including a Rapidly-exploring Random Trees (RRT) algorithm that emulates real-world crack propagation paths to produce complex, fractal-like crack networks. Extensive experiments demonstrate that models trained purely on CrackGen-generated data achieve consistent, high-quality results on real-world crack detection tasks, validating the synthetic data’s robustness and practicality. This robustness was further confirmed through cross-domain validation on rock surfaces. The source code and datasets are freely available from GitHub repository (<span><span>https://github.com/GEO-ATLAS/CrackGen</span><svg><path></path></svg></span>).</div></div>","PeriodicalId":11763,"journal":{"name":"Engineering Structures","volume":"344 ","pages":"Article 121428"},"PeriodicalIF":6.4000,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A versatile synthetic data generation framework for crack detection\",\"authors\":\"Jiawei Xie ,&nbsp;Baolin Chen ,&nbsp;Anna Giacomini ,&nbsp;Hongyu Guo ,&nbsp;Umair Iqbal ,&nbsp;Jinsong Huang\",\"doi\":\"10.1016/j.engstruct.2025.121428\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Deep learning has shown remarkable promise in automating crack detection for civil infrastructure inspection, yet collecting and annotating large, high-quality datasets remains a major bottleneck. To address this challenge, we propose CrackGen—a versatile, fully controllable framework for synthetic data generation. By integrating fine-tuned Stable Diffusion models with ControlNet, CrackGen synthesizes unlimited crack images that closely mimic real-world variability in material textures, crack patterns, and morphological features. A pivotal innovation of our approach is that the same control conditions used to steer image creation naturally serve as ground truth labels, eliminating the need for labor-intensive post-generation annotation. This feature is especially critical for large-scale dataset construction. Through fine-tuning on extremely small training sets, CrackGen inherits the learned knowledge of Stable Diffusion, enabling users to adjust crack widths, orientations, and background materials through simple prompt adjustments. Our experiments systematically analyze how different parameter settings affect the fidelity and diversity of generated images. Additionally, we propose novel crack sketching methods, including a Rapidly-exploring Random Trees (RRT) algorithm that emulates real-world crack propagation paths to produce complex, fractal-like crack networks. Extensive experiments demonstrate that models trained purely on CrackGen-generated data achieve consistent, high-quality results on real-world crack detection tasks, validating the synthetic data’s robustness and practicality. This robustness was further confirmed through cross-domain validation on rock surfaces. The source code and datasets are freely available from GitHub repository (<span><span>https://github.com/GEO-ATLAS/CrackGen</span><svg><path></path></svg></span>).</div></div>\",\"PeriodicalId\":11763,\"journal\":{\"name\":\"Engineering Structures\",\"volume\":\"344 \",\"pages\":\"Article 121428\"},\"PeriodicalIF\":6.4000,\"publicationDate\":\"2025-09-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Structures\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S014102962501819X\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Structures","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S014102962501819X","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0

摘要

深度学习在民用基础设施检测的自动化裂缝检测方面显示出巨大的前景,但收集和注释大型高质量数据集仍然是一个主要瓶颈。为了应对这一挑战,我们提出了crackgen——一个通用的、完全可控的合成数据生成框架。通过将微调的稳定扩散模型与ControlNet相结合,CrackGen合成了无限的裂纹图像,这些图像非常接近地模拟了材料纹理、裂纹模式和形态特征的真实变化。我们的方法的一个关键创新是,用于引导图像创建的相同控制条件自然地用作基础真理标签,消除了对劳动密集型后生成注释的需要。这个特性对于大规模数据集的构建尤其重要。通过对极小的训练集进行微调,CrackGen继承了stability Diffusion的知识,用户可以通过简单的提示调整来调整裂缝宽度、方向和背景材料。我们的实验系统地分析了不同的参数设置如何影响生成图像的保真度和多样性。此外,我们提出了新的裂缝草图绘制方法,包括快速探索随机树(RRT)算法,该算法模拟现实世界的裂缝传播路径,以产生复杂的分形裂缝网络。大量的实验表明,纯粹使用crackgen生成的数据训练的模型在实际的裂纹检测任务中获得了一致的、高质量的结果,验证了合成数据的鲁棒性和实用性。通过岩石表面的跨域验证进一步证实了这种鲁棒性。源代码和数据集可以从GitHub存储库(https://github.com/GEO-ATLAS/CrackGen)免费获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A versatile synthetic data generation framework for crack detection
Deep learning has shown remarkable promise in automating crack detection for civil infrastructure inspection, yet collecting and annotating large, high-quality datasets remains a major bottleneck. To address this challenge, we propose CrackGen—a versatile, fully controllable framework for synthetic data generation. By integrating fine-tuned Stable Diffusion models with ControlNet, CrackGen synthesizes unlimited crack images that closely mimic real-world variability in material textures, crack patterns, and morphological features. A pivotal innovation of our approach is that the same control conditions used to steer image creation naturally serve as ground truth labels, eliminating the need for labor-intensive post-generation annotation. This feature is especially critical for large-scale dataset construction. Through fine-tuning on extremely small training sets, CrackGen inherits the learned knowledge of Stable Diffusion, enabling users to adjust crack widths, orientations, and background materials through simple prompt adjustments. Our experiments systematically analyze how different parameter settings affect the fidelity and diversity of generated images. Additionally, we propose novel crack sketching methods, including a Rapidly-exploring Random Trees (RRT) algorithm that emulates real-world crack propagation paths to produce complex, fractal-like crack networks. Extensive experiments demonstrate that models trained purely on CrackGen-generated data achieve consistent, high-quality results on real-world crack detection tasks, validating the synthetic data’s robustness and practicality. This robustness was further confirmed through cross-domain validation on rock surfaces. The source code and datasets are freely available from GitHub repository (https://github.com/GEO-ATLAS/CrackGen).
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Engineering Structures
Engineering Structures 工程技术-工程:土木
CiteScore
10.20
自引率
14.50%
发文量
1385
审稿时长
67 days
期刊介绍: Engineering Structures provides a forum for a broad blend of scientific and technical papers to reflect the evolving needs of the structural engineering and structural mechanics communities. Particularly welcome are contributions dealing with applications of structural engineering and mechanics principles in all areas of technology. The journal aspires to a broad and integrated coverage of the effects of dynamic loadings and of the modelling techniques whereby the structural response to these loadings may be computed. The scope of Engineering Structures encompasses, but is not restricted to, the following areas: infrastructure engineering; earthquake engineering; structure-fluid-soil interaction; wind engineering; fire engineering; blast engineering; structural reliability/stability; life assessment/integrity; structural health monitoring; multi-hazard engineering; structural dynamics; optimization; expert systems; experimental modelling; performance-based design; multiscale analysis; value engineering. Topics of interest include: tall buildings; innovative structures; environmentally responsive structures; bridges; stadiums; commercial and public buildings; transmission towers; television and telecommunication masts; foldable structures; cooling towers; plates and shells; suspension structures; protective structures; smart structures; nuclear reactors; dams; pressure vessels; pipelines; tunnels. Engineering Structures also publishes review articles, short communications and discussions, book reviews, and a diary on international events related to any aspect of structural engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信