Automatic real-time crack detection using lightweight deep learning models

IF 7.5 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS
{"title":"Automatic real-time crack detection using lightweight deep learning models","authors":"","doi":"10.1016/j.engappai.2024.109340","DOIUrl":null,"url":null,"abstract":"<div><p>Crack detection methods using deep learning models such as convolutional neural network (CNN) and the newly developed vision transformer (ViT) are expanding. However, there is still a lack of comparative evaluation of these models in real-time crack detection. In this paper, a total of 14 lightweight deep learning models, comprising seven CNN models, five ViT models and two hybrid models, are trained to build deep learning-based crack detection methods. Comprehensive experiments are conducted on the publicly available DeepCrack dataset, including accuracy, inference time, robustness and transfer learning experiments to compare the effectiveness and real-time performance of models. In terms of accuracy metrics and robustness performance, the ViT model using SegFormer segmentation method with MiT-B1 as backbone has the best performance, and in terms of the model inference time, the ViT models using TopFormer segmentation method demonstrate the fastest performance. If both the accuracy and inference time are considered, TopFormer with its small version of the backbone network has relatively better real-time performance, while the ViT model using SegFormer segmentation method with MiT-B0 as backbone and the CNN model using the fully convolutional network (FCN) segmentation method with HRNetV2-W18-Small as backbone have higher mean intersection over union (mIoU) values on computers and mobile devices, respectively. We also find that pre-training on a dataset that is more relevant to the target application scenario rather than on the widely used ImageNet gives better results for deep learning models. This study provides a reference for engineers to make choices about lightweight deep learning models.</p></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5000,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624014982","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Crack detection methods using deep learning models such as convolutional neural network (CNN) and the newly developed vision transformer (ViT) are expanding. However, there is still a lack of comparative evaluation of these models in real-time crack detection. In this paper, a total of 14 lightweight deep learning models, comprising seven CNN models, five ViT models and two hybrid models, are trained to build deep learning-based crack detection methods. Comprehensive experiments are conducted on the publicly available DeepCrack dataset, including accuracy, inference time, robustness and transfer learning experiments to compare the effectiveness and real-time performance of models. In terms of accuracy metrics and robustness performance, the ViT model using SegFormer segmentation method with MiT-B1 as backbone has the best performance, and in terms of the model inference time, the ViT models using TopFormer segmentation method demonstrate the fastest performance. If both the accuracy and inference time are considered, TopFormer with its small version of the backbone network has relatively better real-time performance, while the ViT model using SegFormer segmentation method with MiT-B0 as backbone and the CNN model using the fully convolutional network (FCN) segmentation method with HRNetV2-W18-Small as backbone have higher mean intersection over union (mIoU) values on computers and mobile devices, respectively. We also find that pre-training on a dataset that is more relevant to the target application scenario rather than on the widely used ImageNet gives better results for deep learning models. This study provides a reference for engineers to make choices about lightweight deep learning models.

使用轻量级深度学习模型自动实时检测裂纹
使用卷积神经网络(CNN)和新开发的视觉转换器(ViT)等深度学习模型的裂缝检测方法正在不断扩展。然而,这些模型在实时裂缝检测方面仍然缺乏比较评估。本文共训练了 14 个轻量级深度学习模型,包括 7 个 CNN 模型、5 个 ViT 模型和 2 个混合模型,以构建基于深度学习的裂纹检测方法。本文在公开的 DeepCrack 数据集上进行了全面的实验,包括准确性、推理时间、鲁棒性和迁移学习实验,以比较模型的有效性和实时性。在准确度指标和鲁棒性表现方面,以 MiT-B1 为骨干、使用 SegFormer 分割方法的 ViT 模型表现最佳;在模型推理时间方面,使用 TopFormer 分割方法的 ViT 模型表现最快。如果同时考虑准确性和推理时间,采用小型骨干网络的 TopFormer 的实时性相对更好,而在计算机和移动设备上,采用以 MiT-B0 为骨干的 SegFormer 分割方法的 ViT 模型和采用以 HRNetV2-W18-Small 为骨干的全卷积网络(FCN)分割方法的 CNN 模型的平均交集大于联合(mIoU)值分别更高。我们还发现,在与目标应用场景更相关的数据集上进行预训练,而不是在广泛使用的 ImageNet 上进行预训练,能为深度学习模型带来更好的结果。这项研究为工程师选择轻量级深度学习模型提供了参考。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Engineering Applications of Artificial Intelligence
Engineering Applications of Artificial Intelligence 工程技术-工程:电子与电气
CiteScore
9.60
自引率
10.00%
发文量
505
审稿时长
68 days
期刊介绍: Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.
文献相关原料
公司名称 产品信息 采购帮参考价格
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信