Saliency Network with Pyramidal Attention for Crack Detection

2023 International Conference on Computer Graphics and Image Processing (CGIP) Pub Date : 2023-01-01 DOI:10.1109/CGIP58526.2023.00017

Wenhao Guo, Xing Zhang, Fanyi Meng, Yi Li, Tian Lin, Dejin Zhang, Qingquan Li

{"title":"Saliency Network with Pyramidal Attention for Crack Detection","authors":"Wenhao Guo, Xing Zhang, Fanyi Meng, Yi Li, Tian Lin, Dejin Zhang, Qingquan Li","doi":"10.1109/CGIP58526.2023.00017","DOIUrl":null,"url":null,"abstract":"Road crack is a common road disease and can endanger the safety of vehicular traffic. To solve the problem of the low accuracy of traditional deep neural networks in detecting crack images with more complex background and interference, this paper proposes a crack detection network (GLPANet) based on human visual cognitive mechanism. We construct three key modules for extracting crack image features extraction, namely Global Correspondence Modelling (GCM), Local Correspondence Modelling (LCM), and Pyramidal Attention Network (PANet). Specifically, GCM directly fuses all internal features outside through three-dimensional (3D) convolution, LCM also uses 3D convolution to decouple multi-image relationships into multiple local pairs (LP) of image correspondences. PANet learns the spatial geometry of cracks through a network of pyramidal attention mechanisms to create associations between crack features, in the PASCAL VOC dataset, PANet achieved a mean IoU of 77.92%, an 11.5% improvement over FCN-R101. In the crack dataset, with the mean absolute error (MAE) of 0.0172, GLPANet outperforms the state-of-the-art competitors. GLPANet network can improve the accuracy of fracture detection in complex and disturbed backgrounds.","PeriodicalId":286064,"journal":{"name":"2023 International Conference on Computer Graphics and Image Processing (CGIP)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Computer Graphics and Image Processing (CGIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CGIP58526.2023.00017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Road crack is a common road disease and can endanger the safety of vehicular traffic. To solve the problem of the low accuracy of traditional deep neural networks in detecting crack images with more complex background and interference, this paper proposes a crack detection network (GLPANet) based on human visual cognitive mechanism. We construct three key modules for extracting crack image features extraction, namely Global Correspondence Modelling (GCM), Local Correspondence Modelling (LCM), and Pyramidal Attention Network (PANet). Specifically, GCM directly fuses all internal features outside through three-dimensional (3D) convolution, LCM also uses 3D convolution to decouple multi-image relationships into multiple local pairs (LP) of image correspondences. PANet learns the spatial geometry of cracks through a network of pyramidal attention mechanisms to create associations between crack features, in the PASCAL VOC dataset, PANet achieved a mean IoU of 77.92%, an 11.5% improvement over FCN-R101. In the crack dataset, with the mean absolute error (MAE) of 0.0172, GLPANet outperforms the state-of-the-art competitors. GLPANet network can improve the accuracy of fracture detection in complex and disturbed backgrounds.

查看原文本刊更多论文

基于金字塔关注的裂纹检测显著性网络

道路裂缝是一种常见的道路病害，严重危害行车安全。为了解决传统深度神经网络在检测背景和干扰较复杂的裂纹图像时准确率较低的问题，本文提出了一种基于人类视觉认知机制的裂纹检测网络(GLPANet)。构建了裂纹图像特征提取的三个关键模块，即全局对应模型(GCM)、局部对应模型(LCM)和金字塔关注网络(PANet)。具体来说，GCM通过三维(3D)卷积直接融合外部的所有内部特征，LCM还使用三维卷积将多图像关系解耦为多个图像对应的局部对(LP)。PANet通过金字塔注意力机制网络学习裂缝的空间几何形状，以创建裂缝特征之间的关联，在PASCAL VOC数据集中，PANet的平均IoU达到77.92%，比FCN-R101提高11.5%。在破解数据集中，平均绝对误差(MAE)为0.0172,GLPANet优于最先进的竞争对手。GLPANet网络可以提高复杂和干扰背景下裂缝检测的精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 International Conference on Computer Graphics and Image Processing (CGIP)

自引率

0.00%

发文量