Automatic evaluation method for vehicle audio warning system using MFCC-polynomial hybrid feature

Zuoliang Wang, Qimin Xu, Zehua Chen
{"title":"Automatic evaluation method for vehicle audio warning system using MFCC-polynomial hybrid feature","authors":"Zuoliang Wang, Qimin Xu, Zehua Chen","doi":"10.1177/09544070241227089","DOIUrl":null,"url":null,"abstract":"In the evaluation of vehicle audio warning system, there is no automatic method. Besides, due to the noise interference of in-vehicle environmental, the quantity limitation and only positive training samples, the accuracy of traditional template matching or identification methods for audio is low. To solve the above problems, an efficient, accurate, and automatic evaluation method is proposed for vehicle audio warning system. Firstly, logmmse-spectrum subtraction method is used to filter the dynamic noise and static noise of the evaluation audio acquired in the in-vehicle environment. Secondly, the end point detection based on short-time energy is used to obtain the effective audio segment after noise reduction, and the start time of the audio warning segment can be accurately obtained. Then, the Mel Frequency Cepstrum Coefficient (MFCC) feature and the polynomial fitting feature of each audio segment are extracted. The hybrid features are treated as the input of the Hidden Markov Model-Gaussian Mixture Model (GMM-HMM) based audio matching model. Finally, according to frame shift set by endpoint detection and the audio sampling frequency, the emitted time of matched audio warning can be calculated to support the evaluation of vehicle audio warning system. The experimental result shows that, with dynamic-static noise reduction and MFCC-polynomial hybrid feature, the average matching accuracy of the proposed method reaches 99.6% in the case of only five training samples.","PeriodicalId":509770,"journal":{"name":"Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/09544070241227089","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In the evaluation of vehicle audio warning system, there is no automatic method. Besides, due to the noise interference of in-vehicle environmental, the quantity limitation and only positive training samples, the accuracy of traditional template matching or identification methods for audio is low. To solve the above problems, an efficient, accurate, and automatic evaluation method is proposed for vehicle audio warning system. Firstly, logmmse-spectrum subtraction method is used to filter the dynamic noise and static noise of the evaluation audio acquired in the in-vehicle environment. Secondly, the end point detection based on short-time energy is used to obtain the effective audio segment after noise reduction, and the start time of the audio warning segment can be accurately obtained. Then, the Mel Frequency Cepstrum Coefficient (MFCC) feature and the polynomial fitting feature of each audio segment are extracted. The hybrid features are treated as the input of the Hidden Markov Model-Gaussian Mixture Model (GMM-HMM) based audio matching model. Finally, according to frame shift set by endpoint detection and the audio sampling frequency, the emitted time of matched audio warning can be calculated to support the evaluation of vehicle audio warning system. The experimental result shows that, with dynamic-static noise reduction and MFCC-polynomial hybrid feature, the average matching accuracy of the proposed method reaches 99.6% in the case of only five training samples.
使用 MFCC 多项式混合特征的车辆音频预警系统自动评估方法
在车载音频预警系统的评估方面,目前还没有一种自动方法。此外,受车载环境噪声干扰、数量限制和仅有正向训练样本等因素影响,传统的音频模板匹配或识别方法准确率较低。为解决上述问题,本文提出了一种高效、准确、自动的车载音频预警系统评估方法。首先,采用 logmmse 频谱减法过滤车载环境中获取的评估音频的动态噪声和静态噪声。其次,利用基于短时能量的端点检测法获得降噪后的有效音频片段,从而准确获得音频预警片段的起始时间。然后,提取每个音频片段的 Mel Frequency Cepstrum Coefficient(MFCC)特征和多项式拟合特征。混合特征被视为基于隐马尔可夫模型-高斯混合模型(GMM-HMM)的音频匹配模型的输入。最后,根据端点检测设置的帧偏移和音频采样频率,可以计算出匹配音频警报的发射时间,从而为车辆音频警报系统的评估提供支持。实验结果表明,在采用动态-静态降噪和 MFCC-多项式混合特征的情况下,在只有 5 个训练样本的情况下,所提方法的平均匹配准确率达到 99.6%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信