{"title":"Source and system features for text independent speaker identification using iterative clustering approach","authors":"A. Revathi, Y. Venkataramani","doi":"10.1109/ICSIPA.2009.5478637","DOIUrl":null,"url":null,"abstract":"The main objective of this paper is to explore the effectiveness of perceptual features combined with pitch for text independent speaker recognition. The proposed combined features are captured and training models are developed by K-means clustering procedure. Speaker recognition system is evaluated on clean test speeches and the experimental results reveal the performance of the proposed algorithm in performing speaker recognition based on minimum distance between test features and clusters. This algorithm gives the overall accuracy of 99.675% and 98.75% for the combined features and perceptual features respectively for identifying speaker among 8 speakers chosen randomly from 8 different dialect regions in “TIMIT” database. It also gives average accuracy of 96.375% and 95.625% for perceptual linear predictive cepstrum combined with pitch and perceptual linear predictive cepstrum respectively for 8 speakers chosen randomly from the same dialect region. The noteworthy feature of speaker identification algorithm is to evaluate the testing procedure on identical messages for all speakers. In this work, Fratio is computed as a theoretical measure to validate the experimental results on speaker recognition.","PeriodicalId":400165,"journal":{"name":"2009 IEEE International Conference on Signal and Image Processing Applications","volume":"400 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE International Conference on Signal and Image Processing Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSIPA.2009.5478637","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The main objective of this paper is to explore the effectiveness of perceptual features combined with pitch for text independent speaker recognition. The proposed combined features are captured and training models are developed by K-means clustering procedure. Speaker recognition system is evaluated on clean test speeches and the experimental results reveal the performance of the proposed algorithm in performing speaker recognition based on minimum distance between test features and clusters. This algorithm gives the overall accuracy of 99.675% and 98.75% for the combined features and perceptual features respectively for identifying speaker among 8 speakers chosen randomly from 8 different dialect regions in “TIMIT” database. It also gives average accuracy of 96.375% and 95.625% for perceptual linear predictive cepstrum combined with pitch and perceptual linear predictive cepstrum respectively for 8 speakers chosen randomly from the same dialect region. The noteworthy feature of speaker identification algorithm is to evaluate the testing procedure on identical messages for all speakers. In this work, Fratio is computed as a theoretical measure to validate the experimental results on speaker recognition.