A defect detection network for painted wall surfaces based on YOLOv5 enhanced by attention mechanism and bi-directional FPN

IF 3.1 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Hongyang Zhang, Shuai Ji, Yingxin Ye, Hepeng Ni, Xiaoming Gao, Buyao Liu
{"title":"A defect detection network for painted wall surfaces based on YOLOv5 enhanced by attention mechanism and bi-directional FPN","authors":"Hongyang Zhang, Shuai Ji, Yingxin Ye, Hepeng Ni, Xiaoming Gao, Buyao Liu","doi":"10.1007/s00500-024-09799-5","DOIUrl":null,"url":null,"abstract":"<p>Automatic detection of defects on painted wall surfaces (DPWSs) based on machine vision is meaningful for reducing manpower consumption and shorting lead time, which is one of the critical components of intelligent construction. Conventional detection methods suffer from some challenges due to the multi-scale defects and unstructured detection environment. In this study, a detection network for DPWSs is developed based on the enhanced You Only Look Once version 5 (YOLOv5). First, the convolutional block attention module (CBAM) is inserted into the backbone of YOLOv5 to boost the feature extraction and suppress noise, which can sufficiently extract the features of the defects with blurry edges. Then, to improve the adaptability for multi-scale defects and reduce the model size, the Bi-directional Feature Pyramid Network (BiFPN) is employed in the neck of YOLOv5 to enhance the feature fusion, where the multi-scale objects can be fully captured. Finally, the decoupled head is proposed to replace the original convolution layer in the You Only Look Once (YOLO) head, which separates the classification and localization tasks to improve detection speed and robustness. Since there is no publicly available data set, a data set of DPWSs is constructed, and a series of comparative experiments are conducted. The results show that the detection accuracy is improved by 15.6% and the model size is reduced by 30.8% compared with YOLOv5. Meanwhile, the proposed network has better adaptability to DPWSs with higher detection accuracy and smaller model sizes compared with other advanced methods. As to the general applicability aspect of the model, the proposed model holds significant academic and practical implications in the realms of intelligent construction. Besides the model’s primary application domain of construction quality control, it can also be applied to defect detection in other scenarios that have multi-scale defects and unstructured environments. This versatility benefits a wide spectrum of construction projects.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"171 1","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s00500-024-09799-5","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Automatic detection of defects on painted wall surfaces (DPWSs) based on machine vision is meaningful for reducing manpower consumption and shorting lead time, which is one of the critical components of intelligent construction. Conventional detection methods suffer from some challenges due to the multi-scale defects and unstructured detection environment. In this study, a detection network for DPWSs is developed based on the enhanced You Only Look Once version 5 (YOLOv5). First, the convolutional block attention module (CBAM) is inserted into the backbone of YOLOv5 to boost the feature extraction and suppress noise, which can sufficiently extract the features of the defects with blurry edges. Then, to improve the adaptability for multi-scale defects and reduce the model size, the Bi-directional Feature Pyramid Network (BiFPN) is employed in the neck of YOLOv5 to enhance the feature fusion, where the multi-scale objects can be fully captured. Finally, the decoupled head is proposed to replace the original convolution layer in the You Only Look Once (YOLO) head, which separates the classification and localization tasks to improve detection speed and robustness. Since there is no publicly available data set, a data set of DPWSs is constructed, and a series of comparative experiments are conducted. The results show that the detection accuracy is improved by 15.6% and the model size is reduced by 30.8% compared with YOLOv5. Meanwhile, the proposed network has better adaptability to DPWSs with higher detection accuracy and smaller model sizes compared with other advanced methods. As to the general applicability aspect of the model, the proposed model holds significant academic and practical implications in the realms of intelligent construction. Besides the model’s primary application domain of construction quality control, it can also be applied to defect detection in other scenarios that have multi-scale defects and unstructured environments. This versatility benefits a wide spectrum of construction projects.

Abstract Image

基于 YOLOv5 的油漆墙面缺陷检测网络,通过注意力机制和双向 FPN 得到增强
基于机器视觉的涂漆墙面(DPWS)缺陷自动检测对于减少人力消耗和缩短交付周期意义重大,是智能建筑的关键组成部分之一。由于存在多尺度缺陷和非结构化检测环境,传统检测方法面临一些挑战。本研究基于增强型 You Only Look Once version 5(YOLOv5)开发了一种用于 DPWS 的检测网络。首先,在 YOLOv5 的骨干网中插入卷积块注意模块(CBAM),以增强特征提取和抑制噪声,从而充分提取边缘模糊的缺陷特征。然后,为了提高对多尺度缺陷的适应性并减小模型大小,在 YOLOv5 的颈部采用了双向特征金字塔网络(BiFPN)来增强特征融合,从而可以充分捕捉多尺度对象。最后,我们提出了解耦头,以取代 "只看一次"(YOLO)头中的原始卷积层,从而将分类和定位任务分开,提高检测速度和鲁棒性。由于没有公开的数据集,因此构建了一个 DPWS 数据集,并进行了一系列对比实验。结果表明,与 YOLOv5 相比,检测精度提高了 15.6%,模型大小减少了 30.8%。同时,与其他先进方法相比,所提出的网络对 DPWS 具有更好的适应性,检测精度更高,模型体积更小。在模型的普遍适用性方面,所提出的模型在智能建筑领域具有重要的学术和实践意义。除了该模型的主要应用领域--建筑质量控制之外,它还可以应用于其他具有多尺度缺陷和非结构化环境的场景中的缺陷检测。这种多功能性可惠及广泛的建筑项目。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Soft Computing
Soft Computing 工程技术-计算机:跨学科应用
CiteScore
8.10
自引率
9.80%
发文量
927
审稿时长
7.3 months
期刊介绍: Soft Computing is dedicated to system solutions based on soft computing techniques. It provides rapid dissemination of important results in soft computing technologies, a fusion of research in evolutionary algorithms and genetic programming, neural science and neural net systems, fuzzy set theory and fuzzy systems, and chaos theory and chaotic systems. Soft Computing encourages the integration of soft computing techniques and tools into both everyday and advanced applications. By linking the ideas and techniques of soft computing with other disciplines, the journal serves as a unifying platform that fosters comparisons, extensions, and new applications. As a result, the journal is an international forum for all scientists and engineers engaged in research and development in this fast growing field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信