Crack-SegNet: Surface Crack Detection in Complex Background Using Encoder-Decoder Architecture

Proceedings of the 2021 4th International Conference on Sensors, Signal and Image Processing Pub Date : 2021-10-15 DOI:10.1145/3502814.3502817

Rong Ran, Xin-yu Xu, S. Qiu, Xiaopeng Cui, Fuhui Wu

{"title":"Crack-SegNet: Surface Crack Detection in Complex Background Using Encoder-Decoder Architecture","authors":"Rong Ran, Xin-yu Xu, S. Qiu, Xiaopeng Cui, Fuhui Wu","doi":"10.1145/3502814.3502817","DOIUrl":null,"url":null,"abstract":"Timely and accurate detection of the initiation and expansion of crack is of great significance for improving safe operation of civil infrastructures. Image-based visual surface inspection has been an indispensable way for long-time infrastructure monitoring. However, existing crack detection methods generally suffer from the interference of complex background, leading to obvious performance drops. To tackle this, an improved encoder-decoder architecture based on SegNet is proposed in this paper, namely crack-SegNet. The encoder network hierarchically learns visual features from the original image, and the decoder network gradually up-samples and maps the encoded features to the input size for the pixel-level classification. In order to enhance the feature capacity of cracks in complex background, a channel attention mechanism is integrated into the encoder, as well as a spatial attention module in the decoder to improve the feature representation of cracks. Meanwhile, a spatial pyramid pooling is also attached to the last convolutional layer of the encoder to capture crack with different scales. To better validate the proposed method, a challenging metal surface crack dataset with much more complex background is collected. Experimental results on the datasets show that the proposed crack-SegNet outperforms other state-of-the-art crack detection methods, especially in complex background.","PeriodicalId":115172,"journal":{"name":"Proceedings of the 2021 4th International Conference on Sensors, Signal and Image Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 4th International Conference on Sensors, Signal and Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3502814.3502817","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Timely and accurate detection of the initiation and expansion of crack is of great significance for improving safe operation of civil infrastructures. Image-based visual surface inspection has been an indispensable way for long-time infrastructure monitoring. However, existing crack detection methods generally suffer from the interference of complex background, leading to obvious performance drops. To tackle this, an improved encoder-decoder architecture based on SegNet is proposed in this paper, namely crack-SegNet. The encoder network hierarchically learns visual features from the original image, and the decoder network gradually up-samples and maps the encoded features to the input size for the pixel-level classification. In order to enhance the feature capacity of cracks in complex background, a channel attention mechanism is integrated into the encoder, as well as a spatial attention module in the decoder to improve the feature representation of cracks. Meanwhile, a spatial pyramid pooling is also attached to the last convolutional layer of the encoder to capture crack with different scales. To better validate the proposed method, a challenging metal surface crack dataset with much more complex background is collected. Experimental results on the datasets show that the proposed crack-SegNet outperforms other state-of-the-art crack detection methods, especially in complex background.

查看原文本刊更多论文

裂纹-分段网:基于编码器-解码器结构的复杂背景表面裂纹检测

及时准确地检测裂缝的起裂和扩展对提高民用基础设施的安全运行具有重要意义。基于图像的目视表面检测已成为基础设施长期监测不可缺少的手段。然而，现有的裂纹检测方法普遍受到复杂背景的干扰，导致性能下降明显。为了解决这个问题，本文提出了一种改进的基于SegNet的编码器-解码器结构，即crack-SegNet。编码器网络从原始图像中分层学习视觉特征，解码器网络逐渐上采样并将编码特征映射到输入大小以进行像素级分类。为了增强裂缝在复杂背景下的特征容量，在编码器中加入通道注意机制，在解码器中加入空间注意模块，提高裂缝的特征表示能力。同时，在编码器的最后一层卷积层附加空间金字塔池来捕获不同尺度的裂纹。为了更好地验证提出的方法，收集了具有更复杂背景的具有挑战性的金属表面裂纹数据集。在数据集上的实验结果表明，所提出的裂缝隔离网优于其他最先进的裂缝检测方法，特别是在复杂背景下。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2021 4th International Conference on Sensors, Signal and Image Processing

自引率

0.00%

发文量