{"title":"Pitch in Speaker Recognition","authors":"Jian-wei Zhu, Shuifa Sun, Xiao-li Liu, B. Lei","doi":"10.1109/HIS.2009.14","DOIUrl":null,"url":null,"abstract":"In order to improve the speaker recognition accuracy, the pitch is applied to GMM-based speaker recognition (SR). The circular average magnitude difference function (CAMDF) method is used to extract the pitch. An endpoint detection method based on the pitch is proposed. The following four features are selected as the features of the SR: the mel-frequency cepstral coefficient (MFCC) based on the pitch, the pitch contour, the pitch first-order difference and the pitch changed rate. Experimental results show that the recognition rate using proposed endpoint detection method is improved 20% than that using the conventional method. The recognition rate of the proposed system using the selected four features is improved 5% than that of the speaker recognition system using the MFCC parameters only.","PeriodicalId":414085,"journal":{"name":"2009 Ninth International Conference on Hybrid Intelligent Systems","volume":"76 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Ninth International Conference on Hybrid Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HIS.2009.14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15
Abstract
In order to improve the speaker recognition accuracy, the pitch is applied to GMM-based speaker recognition (SR). The circular average magnitude difference function (CAMDF) method is used to extract the pitch. An endpoint detection method based on the pitch is proposed. The following four features are selected as the features of the SR: the mel-frequency cepstral coefficient (MFCC) based on the pitch, the pitch contour, the pitch first-order difference and the pitch changed rate. Experimental results show that the recognition rate using proposed endpoint detection method is improved 20% than that using the conventional method. The recognition rate of the proposed system using the selected four features is improved 5% than that of the speaker recognition system using the MFCC parameters only.