Wenhao Guo, Xing Zhang, Fanyi Meng, Yi Li, Tian Lin, Dejin Zhang, Qingquan Li
{"title":"Saliency Network with Pyramidal Attention for Crack Detection","authors":"Wenhao Guo, Xing Zhang, Fanyi Meng, Yi Li, Tian Lin, Dejin Zhang, Qingquan Li","doi":"10.1109/CGIP58526.2023.00017","DOIUrl":null,"url":null,"abstract":"Road crack is a common road disease and can endanger the safety of vehicular traffic. To solve the problem of the low accuracy of traditional deep neural networks in detecting crack images with more complex background and interference, this paper proposes a crack detection network (GLPANet) based on human visual cognitive mechanism. We construct three key modules for extracting crack image features extraction, namely Global Correspondence Modelling (GCM), Local Correspondence Modelling (LCM), and Pyramidal Attention Network (PANet). Specifically, GCM directly fuses all internal features outside through three-dimensional (3D) convolution, LCM also uses 3D convolution to decouple multi-image relationships into multiple local pairs (LP) of image correspondences. PANet learns the spatial geometry of cracks through a network of pyramidal attention mechanisms to create associations between crack features, in the PASCAL VOC dataset, PANet achieved a mean IoU of 77.92%, an 11.5% improvement over FCN-R101. In the crack dataset, with the mean absolute error (MAE) of 0.0172, GLPANet outperforms the state-of-the-art competitors. GLPANet network can improve the accuracy of fracture detection in complex and disturbed backgrounds.","PeriodicalId":286064,"journal":{"name":"2023 International Conference on Computer Graphics and Image Processing (CGIP)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Computer Graphics and Image Processing (CGIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CGIP58526.2023.00017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Road crack is a common road disease and can endanger the safety of vehicular traffic. To solve the problem of the low accuracy of traditional deep neural networks in detecting crack images with more complex background and interference, this paper proposes a crack detection network (GLPANet) based on human visual cognitive mechanism. We construct three key modules for extracting crack image features extraction, namely Global Correspondence Modelling (GCM), Local Correspondence Modelling (LCM), and Pyramidal Attention Network (PANet). Specifically, GCM directly fuses all internal features outside through three-dimensional (3D) convolution, LCM also uses 3D convolution to decouple multi-image relationships into multiple local pairs (LP) of image correspondences. PANet learns the spatial geometry of cracks through a network of pyramidal attention mechanisms to create associations between crack features, in the PASCAL VOC dataset, PANet achieved a mean IoU of 77.92%, an 11.5% improvement over FCN-R101. In the crack dataset, with the mean absolute error (MAE) of 0.0172, GLPANet outperforms the state-of-the-art competitors. GLPANet network can improve the accuracy of fracture detection in complex and disturbed backgrounds.