Automatic evaluation method for vehicle audio warning system using MFCC-polynomial hybrid feature

Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering Pub Date : 2024-01-22 DOI:10.1177/09544070241227089

Zuoliang Wang, Qimin Xu, Zehua Chen

{"title":"Automatic evaluation method for vehicle audio warning system using MFCC-polynomial hybrid feature","authors":"Zuoliang Wang, Qimin Xu, Zehua Chen","doi":"10.1177/09544070241227089","DOIUrl":null,"url":null,"abstract":"In the evaluation of vehicle audio warning system, there is no automatic method. Besides, due to the noise interference of in-vehicle environmental, the quantity limitation and only positive training samples, the accuracy of traditional template matching or identification methods for audio is low. To solve the above problems, an efficient, accurate, and automatic evaluation method is proposed for vehicle audio warning system. Firstly, logmmse-spectrum subtraction method is used to filter the dynamic noise and static noise of the evaluation audio acquired in the in-vehicle environment. Secondly, the end point detection based on short-time energy is used to obtain the effective audio segment after noise reduction, and the start time of the audio warning segment can be accurately obtained. Then, the Mel Frequency Cepstrum Coefficient (MFCC) feature and the polynomial fitting feature of each audio segment are extracted. The hybrid features are treated as the input of the Hidden Markov Model-Gaussian Mixture Model (GMM-HMM) based audio matching model. Finally, according to frame shift set by endpoint detection and the audio sampling frequency, the emitted time of matched audio warning can be calculated to support the evaluation of vehicle audio warning system. The experimental result shows that, with dynamic-static noise reduction and MFCC-polynomial hybrid feature, the average matching accuracy of the proposed method reaches 99.6% in the case of only five training samples.","PeriodicalId":509770,"journal":{"name":"Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering","volume":"94 25","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/09544070241227089","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In the evaluation of vehicle audio warning system, there is no automatic method. Besides, due to the noise interference of in-vehicle environmental, the quantity limitation and only positive training samples, the accuracy of traditional template matching or identification methods for audio is low. To solve the above problems, an efficient, accurate, and automatic evaluation method is proposed for vehicle audio warning system. Firstly, logmmse-spectrum subtraction method is used to filter the dynamic noise and static noise of the evaluation audio acquired in the in-vehicle environment. Secondly, the end point detection based on short-time energy is used to obtain the effective audio segment after noise reduction, and the start time of the audio warning segment can be accurately obtained. Then, the Mel Frequency Cepstrum Coefficient (MFCC) feature and the polynomial fitting feature of each audio segment are extracted. The hybrid features are treated as the input of the Hidden Markov Model-Gaussian Mixture Model (GMM-HMM) based audio matching model. Finally, according to frame shift set by endpoint detection and the audio sampling frequency, the emitted time of matched audio warning can be calculated to support the evaluation of vehicle audio warning system. The experimental result shows that, with dynamic-static noise reduction and MFCC-polynomial hybrid feature, the average matching accuracy of the proposed method reaches 99.6% in the case of only five training samples.

查看原文本刊更多论文

使用 MFCC 多项式混合特征的车辆音频预警系统自动评估方法

在车载音频预警系统的评估方面，目前还没有一种自动方法。此外，受车载环境噪声干扰、数量限制和仅有正向训练样本等因素影响，传统的音频模板匹配或识别方法准确率较低。为解决上述问题，本文提出了一种高效、准确、自动的车载音频预警系统评估方法。首先，采用 logmmse 频谱减法过滤车载环境中获取的评估音频的动态噪声和静态噪声。其次，利用基于短时能量的端点检测法获得降噪后的有效音频片段，从而准确获得音频预警片段的起始时间。然后，提取每个音频片段的 Mel Frequency Cepstrum Coefficient（MFCC）特征和多项式拟合特征。混合特征被视为基于隐马尔可夫模型-高斯混合模型（GMM-HMM）的音频匹配模型的输入。最后，根据端点检测设置的帧偏移和音频采样频率，可以计算出匹配音频警报的发射时间，从而为车辆音频警报系统的评估提供支持。实验结果表明，在采用动态-静态降噪和 MFCC-多项式混合特征的情况下，在只有 5 个训练样本的情况下，所提方法的平均匹配准确率达到 99.6%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering

自引率

0.00%

发文量