{"title":"分类器在文本无关说话人识别中的应用","authors":"N. P. Jawarkar, R. S. Holambe, T. Basu","doi":"10.1109/ACES.2014.6808023","DOIUrl":null,"url":null,"abstract":"In this paper we have presented the comparative study of different modelling techniques (classifiers) for the text independent speaker identification. Four classifiers, namely, Gaussian mixture models, Fuzzy min-max neural network, Self organizing map, and Vector Quantization based Probabilistic Neural Network (VQ-PNN) have been used for the study. The database containing speech utterances recorded from forty two speakers in Hindi language was used for experimentation. Mel frequency cepstral coefficients that represent short time spectrum are used as features for identification. The performance of four classifiers is analysed under clean- and noisy-speech environment for different signal to noise ratios. All the four classifiers have almost similar performance for 10 second test speech utterances under clean environment. However, GMM outperforms other three classifiers under noisy test conditions.","PeriodicalId":353124,"journal":{"name":"2014 First International Conference on Automation, Control, Energy and Systems (ACES)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"On the use of classifiers for text-independent speaker identification\",\"authors\":\"N. P. Jawarkar, R. S. Holambe, T. Basu\",\"doi\":\"10.1109/ACES.2014.6808023\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper we have presented the comparative study of different modelling techniques (classifiers) for the text independent speaker identification. Four classifiers, namely, Gaussian mixture models, Fuzzy min-max neural network, Self organizing map, and Vector Quantization based Probabilistic Neural Network (VQ-PNN) have been used for the study. The database containing speech utterances recorded from forty two speakers in Hindi language was used for experimentation. Mel frequency cepstral coefficients that represent short time spectrum are used as features for identification. The performance of four classifiers is analysed under clean- and noisy-speech environment for different signal to noise ratios. All the four classifiers have almost similar performance for 10 second test speech utterances under clean environment. However, GMM outperforms other three classifiers under noisy test conditions.\",\"PeriodicalId\":353124,\"journal\":{\"name\":\"2014 First International Conference on Automation, Control, Energy and Systems (ACES)\",\"volume\":\"63 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 First International Conference on Automation, Control, Energy and Systems (ACES)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACES.2014.6808023\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 First International Conference on Automation, Control, Energy and Systems (ACES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACES.2014.6808023","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
On the use of classifiers for text-independent speaker identification
In this paper we have presented the comparative study of different modelling techniques (classifiers) for the text independent speaker identification. Four classifiers, namely, Gaussian mixture models, Fuzzy min-max neural network, Self organizing map, and Vector Quantization based Probabilistic Neural Network (VQ-PNN) have been used for the study. The database containing speech utterances recorded from forty two speakers in Hindi language was used for experimentation. Mel frequency cepstral coefficients that represent short time spectrum are used as features for identification. The performance of four classifiers is analysed under clean- and noisy-speech environment for different signal to noise ratios. All the four classifiers have almost similar performance for 10 second test speech utterances under clean environment. However, GMM outperforms other three classifiers under noisy test conditions.