{"title":"On the use of classifiers for text-independent speaker identification","authors":"N. P. Jawarkar, R. S. Holambe, T. Basu","doi":"10.1109/ACES.2014.6808023","DOIUrl":null,"url":null,"abstract":"In this paper we have presented the comparative study of different modelling techniques (classifiers) for the text independent speaker identification. Four classifiers, namely, Gaussian mixture models, Fuzzy min-max neural network, Self organizing map, and Vector Quantization based Probabilistic Neural Network (VQ-PNN) have been used for the study. The database containing speech utterances recorded from forty two speakers in Hindi language was used for experimentation. Mel frequency cepstral coefficients that represent short time spectrum are used as features for identification. The performance of four classifiers is analysed under clean- and noisy-speech environment for different signal to noise ratios. All the four classifiers have almost similar performance for 10 second test speech utterances under clean environment. However, GMM outperforms other three classifiers under noisy test conditions.","PeriodicalId":353124,"journal":{"name":"2014 First International Conference on Automation, Control, Energy and Systems (ACES)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 First International Conference on Automation, Control, Energy and Systems (ACES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACES.2014.6808023","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
In this paper we have presented the comparative study of different modelling techniques (classifiers) for the text independent speaker identification. Four classifiers, namely, Gaussian mixture models, Fuzzy min-max neural network, Self organizing map, and Vector Quantization based Probabilistic Neural Network (VQ-PNN) have been used for the study. The database containing speech utterances recorded from forty two speakers in Hindi language was used for experimentation. Mel frequency cepstral coefficients that represent short time spectrum are used as features for identification. The performance of four classifiers is analysed under clean- and noisy-speech environment for different signal to noise ratios. All the four classifiers have almost similar performance for 10 second test speech utterances under clean environment. However, GMM outperforms other three classifiers under noisy test conditions.