桥梁实际检测中复杂混凝土裂缝语义分割的轻量级深度学习模型

Proceedings of the 13th International Workshop on Structural Health Monitoring Pub Date : 2022-03-15 DOI:10.12783/shm2021/36273

Yang Xu, Yunlei Fan, Weidong Qiao, Hui Li

{"title":"桥梁实际检测中复杂混凝土裂缝语义分割的轻量级深度学习模型","authors":"Yang Xu, Yunlei Fan, Weidong Qiao, Hui Li","doi":"10.12783/shm2021/36273","DOIUrl":null,"url":null,"abstract":"Recently, extensive studies have been performed for crack detection and segmentation using deep learning and computer vision techniques to accomplish autonomous bridge inspection. These deep network models are frequently trained with a large volume of parameters to ensure good performance. However, the robust applications under real-world situations of actual bridge inspection still face significant challenges. For example, false-positive recognitions of complex background disturbances excluded in the training sets are inevitable to exist. Besides, the real-time requirement for deploying large-volume deep networks in edge computing equipment is still challenging to achieve. This study establishes a lightweight semantic segmentation model for complex concrete crack segmentation in actual bridge inspection. First, the DeepLabv3+ model is adopted as the baseline, and the backbone module is replaced by MobileNetV2 instead of ResNet101. Second, the depthwise separable convolution, atrous convolution pyramid, and inverted residual modules are utilized to reduce convolutional parameters, expand receptive fields, and alleviate gradient vanishing, respectively. Third, the dataset is enhanced with negative disturbance examples, including straight-line-like structural edges and exposed rebars, to improve the model performance against false positives without additional labeling workload. Original images with different resolutions are first collected from actual bridges, and negative samples are further added to the dataset. A total of 4303 patches in 512 × 512 are generated by a sliding window, where 3443, 430, and 430 are randomly selected for training, validation, and test. Ablation experiments demonstrate the necessity and effectiveness of using MobileNetV2 instead of ResNet101 as the backbone and adding negative examples into the dataset. The results show that the mean intersection-over-union (mIoU) for crack segmentation in various real-world scenarios reaches 0.759. The recognition rate of false positives for complex background disturbances is effectually suppressed by introducing straight-line-like structural edges and exposed rebars into the dataset. Furthermore, the average time cost gains a significant reduction of 35.1% using the established lightweight crack segmentation model with only a slight drop on IoU of 0.017.","PeriodicalId":180083,"journal":{"name":"Proceedings of the 13th International Workshop on Structural Health Monitoring","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"LIGHTWEIGHT DEEP LEARNING MODEL OF SEMANTIC SEGMENTATION FOR COMPLEX CONCRETE CRACKS IN ACTUAL BRIDGE INSPECTION\",\"authors\":\"Yang Xu, Yunlei Fan, Weidong Qiao, Hui Li\",\"doi\":\"10.12783/shm2021/36273\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, extensive studies have been performed for crack detection and segmentation using deep learning and computer vision techniques to accomplish autonomous bridge inspection. These deep network models are frequently trained with a large volume of parameters to ensure good performance. However, the robust applications under real-world situations of actual bridge inspection still face significant challenges. For example, false-positive recognitions of complex background disturbances excluded in the training sets are inevitable to exist. Besides, the real-time requirement for deploying large-volume deep networks in edge computing equipment is still challenging to achieve. This study establishes a lightweight semantic segmentation model for complex concrete crack segmentation in actual bridge inspection. First, the DeepLabv3+ model is adopted as the baseline, and the backbone module is replaced by MobileNetV2 instead of ResNet101. Second, the depthwise separable convolution, atrous convolution pyramid, and inverted residual modules are utilized to reduce convolutional parameters, expand receptive fields, and alleviate gradient vanishing, respectively. Third, the dataset is enhanced with negative disturbance examples, including straight-line-like structural edges and exposed rebars, to improve the model performance against false positives without additional labeling workload. Original images with different resolutions are first collected from actual bridges, and negative samples are further added to the dataset. A total of 4303 patches in 512 × 512 are generated by a sliding window, where 3443, 430, and 430 are randomly selected for training, validation, and test. Ablation experiments demonstrate the necessity and effectiveness of using MobileNetV2 instead of ResNet101 as the backbone and adding negative examples into the dataset. The results show that the mean intersection-over-union (mIoU) for crack segmentation in various real-world scenarios reaches 0.759. The recognition rate of false positives for complex background disturbances is effectually suppressed by introducing straight-line-like structural edges and exposed rebars into the dataset. Furthermore, the average time cost gains a significant reduction of 35.1% using the established lightweight crack segmentation model with only a slight drop on IoU of 0.017.\",\"PeriodicalId\":180083,\"journal\":{\"name\":\"Proceedings of the 13th International Workshop on Structural Health Monitoring\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 13th International Workshop on Structural Health Monitoring\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12783/shm2021/36273\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 13th International Workshop on Structural Health Monitoring","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12783/shm2021/36273","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

最近，使用深度学习和计算机视觉技术进行裂缝检测和分割进行了广泛的研究，以完成自动桥梁检测。这些深度网络模型经常使用大量参数进行训练，以确保良好的性能。然而，实际桥梁检测在实际情况下的鲁棒应用仍然面临着重大挑战。例如，在训练集中排除复杂背景干扰的假阳性识别是不可避免的。此外，在边缘计算设备中部署大容量深度网络的实时性要求仍然难以实现。本研究建立了一种轻量级的语义分割模型，用于实际桥梁检测中复杂混凝土裂缝的分割。首先，采用DeepLabv3+模型作为基线，将骨干模块替换为MobileNetV2而不是ResNet101。其次，利用深度可分卷积、亚光卷积金字塔和倒残差模块分别减小卷积参数、扩大接收场和缓解梯度消失。第三，使用负扰动示例增强数据集，包括直线状结构边缘和暴露的钢筋，以提高模型抗误报的性能，而无需额外的标记工作量。首先从实际桥梁中采集不同分辨率的原始图像，并将负样本进一步添加到数据集中。滑动窗口共生成512 × 512的4303个patch，其中随机抽取3443个、430个、430个进行训练、验证和测试。消融实验证明了使用MobileNetV2代替ResNet101作为主干并在数据集中添加负例的必要性和有效性。结果表明，在各种真实场景下，裂缝分割的平均相交-过并度(mIoU)达到0.759。通过在数据集中引入类似直线的结构边缘和暴露的钢筋，有效地抑制了复杂背景干扰的误报识别率。此外，采用所建立的轻量化裂缝分割模型，平均时间成本显著降低35.1%，IoU仅略有下降，为0.017。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

LIGHTWEIGHT DEEP LEARNING MODEL OF SEMANTIC SEGMENTATION FOR COMPLEX CONCRETE CRACKS IN ACTUAL BRIDGE INSPECTION

Recently, extensive studies have been performed for crack detection and segmentation using deep learning and computer vision techniques to accomplish autonomous bridge inspection. These deep network models are frequently trained with a large volume of parameters to ensure good performance. However, the robust applications under real-world situations of actual bridge inspection still face significant challenges. For example, false-positive recognitions of complex background disturbances excluded in the training sets are inevitable to exist. Besides, the real-time requirement for deploying large-volume deep networks in edge computing equipment is still challenging to achieve. This study establishes a lightweight semantic segmentation model for complex concrete crack segmentation in actual bridge inspection. First, the DeepLabv3+ model is adopted as the baseline, and the backbone module is replaced by MobileNetV2 instead of ResNet101. Second, the depthwise separable convolution, atrous convolution pyramid, and inverted residual modules are utilized to reduce convolutional parameters, expand receptive fields, and alleviate gradient vanishing, respectively. Third, the dataset is enhanced with negative disturbance examples, including straight-line-like structural edges and exposed rebars, to improve the model performance against false positives without additional labeling workload. Original images with different resolutions are first collected from actual bridges, and negative samples are further added to the dataset. A total of 4303 patches in 512 × 512 are generated by a sliding window, where 3443, 430, and 430 are randomly selected for training, validation, and test. Ablation experiments demonstrate the necessity and effectiveness of using MobileNetV2 instead of ResNet101 as the backbone and adding negative examples into the dataset. The results show that the mean intersection-over-union (mIoU) for crack segmentation in various real-world scenarios reaches 0.759. The recognition rate of false positives for complex background disturbances is effectually suppressed by introducing straight-line-like structural edges and exposed rebars into the dataset. Furthermore, the average time cost gains a significant reduction of 35.1% using the established lightweight crack segmentation model with only a slight drop on IoU of 0.017.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 13th International Workshop on Structural Health Monitoring

自引率

0.00%

发文量