Speaker recognition based on dynamic MFCC parameters

Wang Yutai, Li Bo, Jiang Xiaoqing, Liu Feng, Wang Lihao
{"title":"Speaker recognition based on dynamic MFCC parameters","authors":"Wang Yutai, Li Bo, Jiang Xiaoqing, Liu Feng, Wang Lihao","doi":"10.1109/IASP.2009.5054638","DOIUrl":null,"url":null,"abstract":"The Mel-frequency cepstral coefficient is the most widely used feature in speech and speaker recognition. However, the traditional MFCC is very sensitive to noise interference, which tends to drastically degrade the performance of recognition systems because of the mismatches between training and testing. In this paper, we proposed a new speaker recognition algorithm based on the dynamic MFCC parameters. As the human auditory system can sensitively perceive the pitch changes in the speech, the algorithm, which combines the speaker information obtained by the MFCC with the pitch, can dynamically construct a set of Mel-filters according to the results of pitch detection. The Mel-filters are then used to extract the dynamic MFCC parameter, which represents the speaker's identity characteristics, and enhance accuracy of speaker recognition. The experimental results show that the method can perform well in a real environment and improve much on robustness in a noisy environment. The recognition rate in different signal-to-noise ratio conditions is obviously excelled to that of traditional MFCC with 5 to 6 percentage points higher on average.","PeriodicalId":143959,"journal":{"name":"2009 International Conference on Image Analysis and Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"30","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Image Analysis and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IASP.2009.5054638","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 30

Abstract

The Mel-frequency cepstral coefficient is the most widely used feature in speech and speaker recognition. However, the traditional MFCC is very sensitive to noise interference, which tends to drastically degrade the performance of recognition systems because of the mismatches between training and testing. In this paper, we proposed a new speaker recognition algorithm based on the dynamic MFCC parameters. As the human auditory system can sensitively perceive the pitch changes in the speech, the algorithm, which combines the speaker information obtained by the MFCC with the pitch, can dynamically construct a set of Mel-filters according to the results of pitch detection. The Mel-filters are then used to extract the dynamic MFCC parameter, which represents the speaker's identity characteristics, and enhance accuracy of speaker recognition. The experimental results show that the method can perform well in a real environment and improve much on robustness in a noisy environment. The recognition rate in different signal-to-noise ratio conditions is obviously excelled to that of traditional MFCC with 5 to 6 percentage points higher on average.
基于动态MFCC参数的说话人识别
mel频率倒谱系数是语音和说话人识别中应用最广泛的特征。然而,传统的MFCC对噪声干扰非常敏感,由于训练和测试之间的不匹配,往往会大大降低识别系统的性能。本文提出了一种基于动态MFCC参数的说话人识别算法。由于人的听觉系统能够敏感地感知语音中的音高变化,该算法将MFCC获得的说话人信息与音高相结合,根据音高检测结果动态构建一组mel -filter。然后利用mel滤波器提取动态MFCC参数,该参数代表说话人的身份特征,提高了说话人识别的准确性。实验结果表明,该方法在真实环境下具有良好的性能,在噪声环境下的鲁棒性有较大提高。在不同信噪比条件下的识别率明显优于传统MFCC,平均提高5 ~ 6个百分点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信