Kernel Inversed Pyramidal Resizing Network for Efficient Pavement Distress Recognition

International Conference on Neural Information Processing Pub Date : 2022-12-04 DOI:10.48550/arXiv.2212.01790

Rong Qin, Luwen Huangfu, Devon Hood, James Ma, Shengyue Huang

{"title":"Kernel Inversed Pyramidal Resizing Network for Efficient Pavement Distress Recognition","authors":"Rong Qin, Luwen Huangfu, Devon Hood, James Ma, Shengyue Huang","doi":"10.48550/arXiv.2212.01790","DOIUrl":null,"url":null,"abstract":"Pavement Distress Recognition (PDR) is an important step in pavement inspection and can be powered by image-based automation to expedite the process and reduce labor costs. Pavement images are often in high-resolution with a low ratio of distressed to non-distressed areas. Advanced approaches leverage these properties via dividing images into patches and explore discriminative features in the scale space. However, these approaches usually suffer from information loss during image resizing and low efficiency due to complex learning frameworks. In this paper, we propose a novel and efficient method for PDR. A light network named the Kernel Inversed Pyramidal Resizing Network (KIPRN) is introduced for image resizing, and can be flexibly plugged into the image classification network as a pre-network to exploit resolution and scale information. In KIPRN, pyramidal convolution and kernel inversed convolution are specifically designed to mine discriminative information across different feature granularities and scales. The mined information is passed along to the resized images to yield an informative image pyramid to assist the image classification network for PDR. We applied our method to three well-known Convolutional Neural Networks (CNNs), and conducted an evaluation on a large-scale pavement image dataset named CQU-BPDD. Extensive results demonstrate that KIPRN can generally improve the pavement distress recognition of these CNN models and show that the simple combination of KIPRN and EfficientNet-B3 significantly outperforms the state-of-the-art patch-based method in both performance and efficiency.","PeriodicalId":281152,"journal":{"name":"International Conference on Neural Information Processing","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Neural Information Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2212.01790","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Pavement Distress Recognition (PDR) is an important step in pavement inspection and can be powered by image-based automation to expedite the process and reduce labor costs. Pavement images are often in high-resolution with a low ratio of distressed to non-distressed areas. Advanced approaches leverage these properties via dividing images into patches and explore discriminative features in the scale space. However, these approaches usually suffer from information loss during image resizing and low efficiency due to complex learning frameworks. In this paper, we propose a novel and efficient method for PDR. A light network named the Kernel Inversed Pyramidal Resizing Network (KIPRN) is introduced for image resizing, and can be flexibly plugged into the image classification network as a pre-network to exploit resolution and scale information. In KIPRN, pyramidal convolution and kernel inversed convolution are specifically designed to mine discriminative information across different feature granularities and scales. The mined information is passed along to the resized images to yield an informative image pyramid to assist the image classification network for PDR. We applied our method to three well-known Convolutional Neural Networks (CNNs), and conducted an evaluation on a large-scale pavement image dataset named CQU-BPDD. Extensive results demonstrate that KIPRN can generally improve the pavement distress recognition of these CNN models and show that the simple combination of KIPRN and EfficientNet-B3 significantly outperforms the state-of-the-art patch-based method in both performance and efficiency.

查看原文本刊更多论文

高效路面破损识别的核反金字塔调整网络

路面破损识别(PDR)是路面检测的重要步骤，可以通过基于图像的自动化来加快过程并降低人工成本。路面图像通常是高分辨率的，受损区域与非受损区域的比例很低。先进的方法通过将图像分成小块来利用这些属性，并探索尺度空间中的判别特征。然而，由于学习框架复杂，这些方法在图像调整过程中存在信息丢失和效率低下的问题。本文提出了一种新颖高效的PDR方法。提出了一种用于图像大小调整的轻量级网络——核反金字塔大小调整网络(Kernel inverse Pyramidal Resizing network, KIPRN)，该网络可以灵活地插入到图像分类网络中作为预网络，利用分辨率和尺度信息。在KIPRN中，金字塔卷积和核反卷积被专门设计用于挖掘不同特征粒度和尺度的判别信息。将挖掘的信息传递到调整大小的图像中，生成信息丰富的图像金字塔，以辅助PDR图像分类网络。我们将该方法应用于三种著名的卷积神经网络(cnn)，并对大规模路面图像数据集CQU-BPDD进行了评估。大量的结果表明，KIPRN可以普遍提高这些CNN模型的路面破损识别，并且表明KIPRN和effentnet - b3的简单组合在性能和效率上都明显优于最先进的基于patch的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Conference on Neural Information Processing

自引率

0.00%

发文量