Joint MFCC-and-vector quantization based text-independent speaker recognition system

Ala Eldin Omer
{"title":"Joint MFCC-and-vector quantization based text-independent speaker recognition system","authors":"Ala Eldin Omer","doi":"10.1109/ICCCCEE.2017.7867612","DOIUrl":null,"url":null,"abstract":"Signal processing front end for extracting the feature set is an important stage in any speaker recognition system. There are many types of features that are derived differently and have good impact on the recognition rate. This paper uses one of the techniques to extract the feature set from a speech signal known as Mel Frequency Cepstrum Coefficients (MFCCs) to represent the signal parametrically for further processing. Speakers provide samples of their voices once in a training session and once in a testing session later. Subsequently, the feature coefficients {MFCCs} are calculated in both phases and the speaker is identified according to the minimum quantization distance which is calculated between the stored features in the training phase and the MFCCs of the speaker who requests to log into the system in the testing phase. The proposed recognition system was designed and implemented using three different algorithms in MATLAB. Simulation and experimental results show that the Joint MFCC-and-vector quantization algorithm achieves better performance compared to the MFCC and FFT algorithms in terms of recognition accuracy and text dependency.","PeriodicalId":227798,"journal":{"name":"2017 International Conference on Communication, Control, Computing and Electronics Engineering (ICCCCEE)","volume":"140 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Communication, Control, Computing and Electronics Engineering (ICCCCEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCCEE.2017.7867612","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

Abstract

Signal processing front end for extracting the feature set is an important stage in any speaker recognition system. There are many types of features that are derived differently and have good impact on the recognition rate. This paper uses one of the techniques to extract the feature set from a speech signal known as Mel Frequency Cepstrum Coefficients (MFCCs) to represent the signal parametrically for further processing. Speakers provide samples of their voices once in a training session and once in a testing session later. Subsequently, the feature coefficients {MFCCs} are calculated in both phases and the speaker is identified according to the minimum quantization distance which is calculated between the stored features in the training phase and the MFCCs of the speaker who requests to log into the system in the testing phase. The proposed recognition system was designed and implemented using three different algorithms in MATLAB. Simulation and experimental results show that the Joint MFCC-and-vector quantization algorithm achieves better performance compared to the MFCC and FFT algorithms in terms of recognition accuracy and text dependency.
基于mfcc和矢量量化的文本独立说话人识别系统
信号处理前端提取特征集是任何说话人识别系统的重要环节。有许多类型的特征是不同的,对识别率有很好的影响。本文使用其中一种技术从语音信号中提取特征集,称为Mel频率倒谱系数(MFCCs),以参数化表示信号以进行进一步处理。演讲者在训练阶段提供一次他们的声音样本,然后在测试阶段提供一次。随后,计算两阶段的特征系数{mfccc},根据训练阶段存储的特征与测试阶段请求登录系统的说话人的mfccc之间计算的最小量化距离来识别说话人。在MATLAB中使用三种不同的算法设计并实现了所提出的识别系统。仿真和实验结果表明,与MFCC和FFT算法相比,MFCC和矢量量化联合算法在识别精度和文本依赖性方面取得了更好的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信