{"title":"Combination of system and source characteristics for speaker verification under limited data condition","authors":"T. R. Jayanthi Kumari, H. S. Jayanna","doi":"10.1109/CSPA.2016.7515823","DOIUrl":null,"url":null,"abstract":"There is immense potential for speaker verification system under limited data condition in several real life applications. This paper explains how the combined dissimilar characteristics of voice data improve the performance of speaker verification when training and testing data lengths are reduced (less than 15 sec). To carry out this work, Mel-Frequency Cepstral Coefficients (MFCC), Linear Prediction Cepstral Coefficient (LPCC), Linear Prediction Residual (LPR) and Linear Prediction Residual Phase (LPRP) features are considered. The features are extracted from these extraction techniques are studied individually and pooled them to acquire better verification performance. The experimental evaluation is made by different classifiers of Gaussian mixture model (GMM) and GMM-universal background model (GMM-UBM). The NIST-2003 dataset is used for conducting experiments. The combined dissimilar features provide relatively improved performance compared to all individual features. The GMM-UBM classifier comparatively gives reduced equal error rate (EER) compared to GMM.","PeriodicalId":314829,"journal":{"name":"2016 IEEE 12th International Colloquium on Signal Processing & Its Applications (CSPA)","volume":"67 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE 12th International Colloquium on Signal Processing & Its Applications (CSPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSPA.2016.7515823","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
There is immense potential for speaker verification system under limited data condition in several real life applications. This paper explains how the combined dissimilar characteristics of voice data improve the performance of speaker verification when training and testing data lengths are reduced (less than 15 sec). To carry out this work, Mel-Frequency Cepstral Coefficients (MFCC), Linear Prediction Cepstral Coefficient (LPCC), Linear Prediction Residual (LPR) and Linear Prediction Residual Phase (LPRP) features are considered. The features are extracted from these extraction techniques are studied individually and pooled them to acquire better verification performance. The experimental evaluation is made by different classifiers of Gaussian mixture model (GMM) and GMM-universal background model (GMM-UBM). The NIST-2003 dataset is used for conducting experiments. The combined dissimilar features provide relatively improved performance compared to all individual features. The GMM-UBM classifier comparatively gives reduced equal error rate (EER) compared to GMM.