Joint MFCC-and-vector quantization based text-independent speaker recognition system

2017 International Conference on Communication, Control, Computing and Electronics Engineering (ICCCCEE) Pub Date : 1900-01-01 DOI:10.1109/ICCCCEE.2017.7867612

Ala Eldin Omer

{"title":"Joint MFCC-and-vector quantization based text-independent speaker recognition system","authors":"Ala Eldin Omer","doi":"10.1109/ICCCCEE.2017.7867612","DOIUrl":null,"url":null,"abstract":"Signal processing front end for extracting the feature set is an important stage in any speaker recognition system. There are many types of features that are derived differently and have good impact on the recognition rate. This paper uses one of the techniques to extract the feature set from a speech signal known as Mel Frequency Cepstrum Coefficients (MFCCs) to represent the signal parametrically for further processing. Speakers provide samples of their voices once in a training session and once in a testing session later. Subsequently, the feature coefficients {MFCCs} are calculated in both phases and the speaker is identified according to the minimum quantization distance which is calculated between the stored features in the training phase and the MFCCs of the speaker who requests to log into the system in the testing phase. The proposed recognition system was designed and implemented using three different algorithms in MATLAB. Simulation and experimental results show that the Joint MFCC-and-vector quantization algorithm achieves better performance compared to the MFCC and FFT algorithms in terms of recognition accuracy and text dependency.","PeriodicalId":227798,"journal":{"name":"2017 International Conference on Communication, Control, Computing and Electronics Engineering (ICCCCEE)","volume":"140 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Communication, Control, Computing and Electronics Engineering (ICCCCEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCCEE.2017.7867612","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

Abstract

Signal processing front end for extracting the feature set is an important stage in any speaker recognition system. There are many types of features that are derived differently and have good impact on the recognition rate. This paper uses one of the techniques to extract the feature set from a speech signal known as Mel Frequency Cepstrum Coefficients (MFCCs) to represent the signal parametrically for further processing. Speakers provide samples of their voices once in a training session and once in a testing session later. Subsequently, the feature coefficients {MFCCs} are calculated in both phases and the speaker is identified according to the minimum quantization distance which is calculated between the stored features in the training phase and the MFCCs of the speaker who requests to log into the system in the testing phase. The proposed recognition system was designed and implemented using three different algorithms in MATLAB. Simulation and experimental results show that the Joint MFCC-and-vector quantization algorithm achieves better performance compared to the MFCC and FFT algorithms in terms of recognition accuracy and text dependency.

查看原文本刊更多论文

基于mfcc和矢量量化的文本独立说话人识别系统

信号处理前端提取特征集是任何说话人识别系统的重要环节。有许多类型的特征是不同的，对识别率有很好的影响。本文使用其中一种技术从语音信号中提取特征集，称为Mel频率倒谱系数(MFCCs)，以参数化表示信号以进行进一步处理。演讲者在训练阶段提供一次他们的声音样本，然后在测试阶段提供一次。随后，计算两阶段的特征系数{mfccc}，根据训练阶段存储的特征与测试阶段请求登录系统的说话人的mfccc之间计算的最小量化距离来识别说话人。在MATLAB中使用三种不同的算法设计并实现了所提出的识别系统。仿真和实验结果表明，与MFCC和FFT算法相比，MFCC和矢量量化联合算法在识别精度和文本依赖性方面取得了更好的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 International Conference on Communication, Control, Computing and Electronics Engineering (ICCCCEE)

自引率

0.00%

发文量