{"title":"Multiscale Attention Networks for Pavement Defect Detection","authors":"Junde Chen;Yuxin Wen;Yaser Ahangari Nanehkaran;Defu Zhang;Adnan Zeb","doi":"10.1109/TIM.2023.3298391","DOIUrl":null,"url":null,"abstract":"Pavement defects such as cracks, net cracks, and pit slots can cause potential traffic safety problems. The timely detection and identification play a key role in reducing the harm of various pavement defects. Particularly, the recent development in deep learning (DL)-based convolution neural networks (CNNs) has shown competitive performance in image detection and classification. To detect pavement defects automatically and improve effects, a multiscale mobile attention-based network, which we termed MANet, is proposed to perform the detection of pavement defects. The architecture of the encoder–decoder is used in MANet, where the encoder adopts the MobileNet as the backbone network to extract pavement defect features. Instead of the original \n<inline-formula> <tex-math>$3\\,\\, \\times 3$ </tex-math></inline-formula>\n convolution, the multiscale convolution kernels are used in depthwise separable convolution (DSConv) layers of the network. Furthermore, the hybrid attention mechanism is separately incorporated into the encoder and decoder modules to infer the significance of spatial points and interchannel relationship features for the input intermediate feature maps. The proposed approach achieves state-of-the-art performance on two publicly available benchmark datasets, i.e., the Crack500 (500 crack images with \n<inline-formula> <tex-math>$2000\\,\\, \\times 1500$ </tex-math></inline-formula>\n pixels) and CFD (118 crack images with \n<inline-formula> <tex-math>$480\\,\\, \\times 320$ </tex-math></inline-formula>\n pixels) datasets. The mean intersection over union (MIoU) of the proposed approach on these two datasets reaches 0.7219 and 0.7788, respectively. Ablation experiments show that the multiscale convolution and hybrid attention modules can effectively help the model extract high-level feature representations and generate more accurate pavement crack segmentation results. We further test the model on locally collected pavement crack images (131 images with \n<inline-formula> <tex-math>$1024\\,\\, \\times 768$ </tex-math></inline-formula>\n pixels) and it achieves a satisfactory result. The proposed approach realizes the MIoU of 0.6514 on the local dataset and outperforms other compared baseline methods. Experimental findings demonstrate the validity and feasibility of the proposed approach and it provides a viable solution for pavement crack detection in practical application scenarios. Our code is available at \n<uri>https://github.com/xtu502/pavement-defects</uri>\n.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":null,"pages":null},"PeriodicalIF":5.6000,"publicationDate":"2023-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Instrumentation and Measurement","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10192438/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 2
Abstract
Pavement defects such as cracks, net cracks, and pit slots can cause potential traffic safety problems. The timely detection and identification play a key role in reducing the harm of various pavement defects. Particularly, the recent development in deep learning (DL)-based convolution neural networks (CNNs) has shown competitive performance in image detection and classification. To detect pavement defects automatically and improve effects, a multiscale mobile attention-based network, which we termed MANet, is proposed to perform the detection of pavement defects. The architecture of the encoder–decoder is used in MANet, where the encoder adopts the MobileNet as the backbone network to extract pavement defect features. Instead of the original
$3\,\, \times 3$
convolution, the multiscale convolution kernels are used in depthwise separable convolution (DSConv) layers of the network. Furthermore, the hybrid attention mechanism is separately incorporated into the encoder and decoder modules to infer the significance of spatial points and interchannel relationship features for the input intermediate feature maps. The proposed approach achieves state-of-the-art performance on two publicly available benchmark datasets, i.e., the Crack500 (500 crack images with
$2000\,\, \times 1500$
pixels) and CFD (118 crack images with
$480\,\, \times 320$
pixels) datasets. The mean intersection over union (MIoU) of the proposed approach on these two datasets reaches 0.7219 and 0.7788, respectively. Ablation experiments show that the multiscale convolution and hybrid attention modules can effectively help the model extract high-level feature representations and generate more accurate pavement crack segmentation results. We further test the model on locally collected pavement crack images (131 images with
$1024\,\, \times 768$
pixels) and it achieves a satisfactory result. The proposed approach realizes the MIoU of 0.6514 on the local dataset and outperforms other compared baseline methods. Experimental findings demonstrate the validity and feasibility of the proposed approach and it provides a viable solution for pavement crack detection in practical application scenarios. Our code is available at
https://github.com/xtu502/pavement-defects
.
期刊介绍:
Papers are sought that address innovative solutions to the development and use of electrical and electronic instruments and equipment to measure, monitor and/or record physical phenomena for the purpose of advancing measurement science, methods, functionality and applications. The scope of these papers may encompass: (1) theory, methodology, and practice of measurement; (2) design, development and evaluation of instrumentation and measurement systems and components used in generating, acquiring, conditioning and processing signals; (3) analysis, representation, display, and preservation of the information obtained from a set of measurements; and (4) scientific and technical support to establishment and maintenance of technical standards in the field of Instrumentation and Measurement.