{"title":"Road-pavement classification by artificial neural network model based on tire-pavement noise and road-surface image","authors":"","doi":"10.1016/j.apacoust.2024.110194","DOIUrl":null,"url":null,"abstract":"<div><p>This study focuses on an artificial neural network (ANN) model for classifying pavement types using acoustic and image data. While conventional studies often use road-surface images for pavement classification, they face challenges with image quality degradation owing to external factors, such as sunlight angle, shadows, and lighting. Therefore, in this study, tire-pavement noise, which has different noise characteristics depending on the material and surface treatment, is used independently and in conjunction with image data for ANN training. To construct the training dataset, tire-pavement noise, and road-surface images are collected from 11 highway sampling sites in South Korea. Two simultaneous measurements are used: the tire-pavement noise is collected using the On-board sound intensity (OBSI) method, and the camera captures the road-surface images. 1/3 octave SIL, spectrum, MFCC, GLCM, and HOG are extracted from the raw data, and the ANN models are trained by these features. Using the spectrum as an input feature for the ANN yields a classification accuracy of 95.18%. However, the total number of parameters in the ANN is double that of the other models. To reduce the ANN size, 1/3 octave band SIL is used for training, and the model size is halved. However, the accuracy decreases by 13.47 percentage points. To overcome this significant decrease, the 1/3 octave bands SIL and image features were used to train ANN, simultaneously. This approach increases the accuracy by 93.85%. By training the ANN using MFCC, which is commonly used as an acoustic feature in other machine learning studies, the highest classification accuracy of 96.84% is achieved. Additionally, MFCC models are affected by the number of coefficients and the signal length. To include the dominant frequency of tire-pavement noise, more than 13 coefficients are used, a number generally known to be suitable for speech recognition. Increasing the number of coefficients from 13 to 40 improves accuracy by 1.17 percentage points. The interval for slicing raw WAV files is reduced to increase the training data and classify the pavement using shorter signals without statistically significant accuracy loss. Although accuracy does not decrease until the signal lengths reach 0.5 seconds, it rapidly decreases when the signal lengths become shorter than 0.4 seconds.</p></div>","PeriodicalId":55506,"journal":{"name":"Applied Acoustics","volume":null,"pages":null},"PeriodicalIF":3.4000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Acoustics","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0003682X24003451","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0
Abstract
This study focuses on an artificial neural network (ANN) model for classifying pavement types using acoustic and image data. While conventional studies often use road-surface images for pavement classification, they face challenges with image quality degradation owing to external factors, such as sunlight angle, shadows, and lighting. Therefore, in this study, tire-pavement noise, which has different noise characteristics depending on the material and surface treatment, is used independently and in conjunction with image data for ANN training. To construct the training dataset, tire-pavement noise, and road-surface images are collected from 11 highway sampling sites in South Korea. Two simultaneous measurements are used: the tire-pavement noise is collected using the On-board sound intensity (OBSI) method, and the camera captures the road-surface images. 1/3 octave SIL, spectrum, MFCC, GLCM, and HOG are extracted from the raw data, and the ANN models are trained by these features. Using the spectrum as an input feature for the ANN yields a classification accuracy of 95.18%. However, the total number of parameters in the ANN is double that of the other models. To reduce the ANN size, 1/3 octave band SIL is used for training, and the model size is halved. However, the accuracy decreases by 13.47 percentage points. To overcome this significant decrease, the 1/3 octave bands SIL and image features were used to train ANN, simultaneously. This approach increases the accuracy by 93.85%. By training the ANN using MFCC, which is commonly used as an acoustic feature in other machine learning studies, the highest classification accuracy of 96.84% is achieved. Additionally, MFCC models are affected by the number of coefficients and the signal length. To include the dominant frequency of tire-pavement noise, more than 13 coefficients are used, a number generally known to be suitable for speech recognition. Increasing the number of coefficients from 13 to 40 improves accuracy by 1.17 percentage points. The interval for slicing raw WAV files is reduced to increase the training data and classify the pavement using shorter signals without statistically significant accuracy loss. Although accuracy does not decrease until the signal lengths reach 0.5 seconds, it rapidly decreases when the signal lengths become shorter than 0.4 seconds.
期刊介绍:
Since its launch in 1968, Applied Acoustics has been publishing high quality research papers providing state-of-the-art coverage of research findings for engineers and scientists involved in applications of acoustics in the widest sense.
Applied Acoustics looks not only at recent developments in the understanding of acoustics but also at ways of exploiting that understanding. The Journal aims to encourage the exchange of practical experience through publication and in so doing creates a fund of technological information that can be used for solving related problems. The presentation of information in graphical or tabular form is especially encouraged. If a report of a mathematical development is a necessary part of a paper it is important to ensure that it is there only as an integral part of a practical solution to a problem and is supported by data. Applied Acoustics encourages the exchange of practical experience in the following ways: • Complete Papers • Short Technical Notes • Review Articles; and thereby provides a wealth of technological information that can be used to solve related problems.
Manuscripts that address all fields of applications of acoustics ranging from medicine and NDT to the environment and buildings are welcome.