基于金字塔关注的裂纹检测显著性网络

Wenhao Guo, Xing Zhang, Fanyi Meng, Yi Li, Tian Lin, Dejin Zhang, Qingquan Li
{"title":"基于金字塔关注的裂纹检测显著性网络","authors":"Wenhao Guo, Xing Zhang, Fanyi Meng, Yi Li, Tian Lin, Dejin Zhang, Qingquan Li","doi":"10.1109/CGIP58526.2023.00017","DOIUrl":null,"url":null,"abstract":"Road crack is a common road disease and can endanger the safety of vehicular traffic. To solve the problem of the low accuracy of traditional deep neural networks in detecting crack images with more complex background and interference, this paper proposes a crack detection network (GLPANet) based on human visual cognitive mechanism. We construct three key modules for extracting crack image features extraction, namely Global Correspondence Modelling (GCM), Local Correspondence Modelling (LCM), and Pyramidal Attention Network (PANet). Specifically, GCM directly fuses all internal features outside through three-dimensional (3D) convolution, LCM also uses 3D convolution to decouple multi-image relationships into multiple local pairs (LP) of image correspondences. PANet learns the spatial geometry of cracks through a network of pyramidal attention mechanisms to create associations between crack features, in the PASCAL VOC dataset, PANet achieved a mean IoU of 77.92%, an 11.5% improvement over FCN-R101. In the crack dataset, with the mean absolute error (MAE) of 0.0172, GLPANet outperforms the state-of-the-art competitors. GLPANet network can improve the accuracy of fracture detection in complex and disturbed backgrounds.","PeriodicalId":286064,"journal":{"name":"2023 International Conference on Computer Graphics and Image Processing (CGIP)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Saliency Network with Pyramidal Attention for Crack Detection\",\"authors\":\"Wenhao Guo, Xing Zhang, Fanyi Meng, Yi Li, Tian Lin, Dejin Zhang, Qingquan Li\",\"doi\":\"10.1109/CGIP58526.2023.00017\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Road crack is a common road disease and can endanger the safety of vehicular traffic. To solve the problem of the low accuracy of traditional deep neural networks in detecting crack images with more complex background and interference, this paper proposes a crack detection network (GLPANet) based on human visual cognitive mechanism. We construct three key modules for extracting crack image features extraction, namely Global Correspondence Modelling (GCM), Local Correspondence Modelling (LCM), and Pyramidal Attention Network (PANet). Specifically, GCM directly fuses all internal features outside through three-dimensional (3D) convolution, LCM also uses 3D convolution to decouple multi-image relationships into multiple local pairs (LP) of image correspondences. PANet learns the spatial geometry of cracks through a network of pyramidal attention mechanisms to create associations between crack features, in the PASCAL VOC dataset, PANet achieved a mean IoU of 77.92%, an 11.5% improvement over FCN-R101. In the crack dataset, with the mean absolute error (MAE) of 0.0172, GLPANet outperforms the state-of-the-art competitors. GLPANet network can improve the accuracy of fracture detection in complex and disturbed backgrounds.\",\"PeriodicalId\":286064,\"journal\":{\"name\":\"2023 International Conference on Computer Graphics and Image Processing (CGIP)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Computer Graphics and Image Processing (CGIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CGIP58526.2023.00017\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Computer Graphics and Image Processing (CGIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CGIP58526.2023.00017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

道路裂缝是一种常见的道路病害,严重危害行车安全。为了解决传统深度神经网络在检测背景和干扰较复杂的裂纹图像时准确率较低的问题,本文提出了一种基于人类视觉认知机制的裂纹检测网络(GLPANet)。构建了裂纹图像特征提取的三个关键模块,即全局对应模型(GCM)、局部对应模型(LCM)和金字塔关注网络(PANet)。具体来说,GCM通过三维(3D)卷积直接融合外部的所有内部特征,LCM还使用三维卷积将多图像关系解耦为多个图像对应的局部对(LP)。PANet通过金字塔注意力机制网络学习裂缝的空间几何形状,以创建裂缝特征之间的关联,在PASCAL VOC数据集中,PANet的平均IoU达到77.92%,比FCN-R101提高11.5%。在破解数据集中,平均绝对误差(MAE)为0.0172,GLPANet优于最先进的竞争对手。GLPANet网络可以提高复杂和干扰背景下裂缝检测的精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Saliency Network with Pyramidal Attention for Crack Detection
Road crack is a common road disease and can endanger the safety of vehicular traffic. To solve the problem of the low accuracy of traditional deep neural networks in detecting crack images with more complex background and interference, this paper proposes a crack detection network (GLPANet) based on human visual cognitive mechanism. We construct three key modules for extracting crack image features extraction, namely Global Correspondence Modelling (GCM), Local Correspondence Modelling (LCM), and Pyramidal Attention Network (PANet). Specifically, GCM directly fuses all internal features outside through three-dimensional (3D) convolution, LCM also uses 3D convolution to decouple multi-image relationships into multiple local pairs (LP) of image correspondences. PANet learns the spatial geometry of cracks through a network of pyramidal attention mechanisms to create associations between crack features, in the PASCAL VOC dataset, PANet achieved a mean IoU of 77.92%, an 11.5% improvement over FCN-R101. In the crack dataset, with the mean absolute error (MAE) of 0.0172, GLPANet outperforms the state-of-the-art competitors. GLPANet network can improve the accuracy of fracture detection in complex and disturbed backgrounds.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信