{"title":"使用 MFCC 多项式混合特征的车辆音频预警系统自动评估方法","authors":"Zuoliang Wang, Qimin Xu, Zehua Chen","doi":"10.1177/09544070241227089","DOIUrl":null,"url":null,"abstract":"In the evaluation of vehicle audio warning system, there is no automatic method. Besides, due to the noise interference of in-vehicle environmental, the quantity limitation and only positive training samples, the accuracy of traditional template matching or identification methods for audio is low. To solve the above problems, an efficient, accurate, and automatic evaluation method is proposed for vehicle audio warning system. Firstly, logmmse-spectrum subtraction method is used to filter the dynamic noise and static noise of the evaluation audio acquired in the in-vehicle environment. Secondly, the end point detection based on short-time energy is used to obtain the effective audio segment after noise reduction, and the start time of the audio warning segment can be accurately obtained. Then, the Mel Frequency Cepstrum Coefficient (MFCC) feature and the polynomial fitting feature of each audio segment are extracted. The hybrid features are treated as the input of the Hidden Markov Model-Gaussian Mixture Model (GMM-HMM) based audio matching model. Finally, according to frame shift set by endpoint detection and the audio sampling frequency, the emitted time of matched audio warning can be calculated to support the evaluation of vehicle audio warning system. The experimental result shows that, with dynamic-static noise reduction and MFCC-polynomial hybrid feature, the average matching accuracy of the proposed method reaches 99.6% in the case of only five training samples.","PeriodicalId":509770,"journal":{"name":"Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automatic evaluation method for vehicle audio warning system using MFCC-polynomial hybrid feature\",\"authors\":\"Zuoliang Wang, Qimin Xu, Zehua Chen\",\"doi\":\"10.1177/09544070241227089\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the evaluation of vehicle audio warning system, there is no automatic method. Besides, due to the noise interference of in-vehicle environmental, the quantity limitation and only positive training samples, the accuracy of traditional template matching or identification methods for audio is low. To solve the above problems, an efficient, accurate, and automatic evaluation method is proposed for vehicle audio warning system. Firstly, logmmse-spectrum subtraction method is used to filter the dynamic noise and static noise of the evaluation audio acquired in the in-vehicle environment. Secondly, the end point detection based on short-time energy is used to obtain the effective audio segment after noise reduction, and the start time of the audio warning segment can be accurately obtained. Then, the Mel Frequency Cepstrum Coefficient (MFCC) feature and the polynomial fitting feature of each audio segment are extracted. The hybrid features are treated as the input of the Hidden Markov Model-Gaussian Mixture Model (GMM-HMM) based audio matching model. Finally, according to frame shift set by endpoint detection and the audio sampling frequency, the emitted time of matched audio warning can be calculated to support the evaluation of vehicle audio warning system. The experimental result shows that, with dynamic-static noise reduction and MFCC-polynomial hybrid feature, the average matching accuracy of the proposed method reaches 99.6% in the case of only five training samples.\",\"PeriodicalId\":509770,\"journal\":{\"name\":\"Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1177/09544070241227089\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/09544070241227089","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Automatic evaluation method for vehicle audio warning system using MFCC-polynomial hybrid feature
In the evaluation of vehicle audio warning system, there is no automatic method. Besides, due to the noise interference of in-vehicle environmental, the quantity limitation and only positive training samples, the accuracy of traditional template matching or identification methods for audio is low. To solve the above problems, an efficient, accurate, and automatic evaluation method is proposed for vehicle audio warning system. Firstly, logmmse-spectrum subtraction method is used to filter the dynamic noise and static noise of the evaluation audio acquired in the in-vehicle environment. Secondly, the end point detection based on short-time energy is used to obtain the effective audio segment after noise reduction, and the start time of the audio warning segment can be accurately obtained. Then, the Mel Frequency Cepstrum Coefficient (MFCC) feature and the polynomial fitting feature of each audio segment are extracted. The hybrid features are treated as the input of the Hidden Markov Model-Gaussian Mixture Model (GMM-HMM) based audio matching model. Finally, according to frame shift set by endpoint detection and the audio sampling frequency, the emitted time of matched audio warning can be calculated to support the evaluation of vehicle audio warning system. The experimental result shows that, with dynamic-static noise reduction and MFCC-polynomial hybrid feature, the average matching accuracy of the proposed method reaches 99.6% in the case of only five training samples.