{"title":"基于概率主成分分析的非文本说话人识别系统","authors":"Luan Xiao-chun, Yin Jun-xun, Hu Wei-ping","doi":"10.1109/ICSSEM.2012.6340721","DOIUrl":null,"url":null,"abstract":"To alleviate the problem of severe degradation of speaker recognition performance because of the phoneme variability between training and testing speech data, in the text-independent speaker recognition system. The paper proposed a text-independent (TI) speaker identification method that suppresses the phonetic information by a subspace method, Probabilistic Principle Component Analysis (PPCA) is utilized to construct these subspaces. Firstly, the covariance matrix was obtained from the large training speech feature data, and then the projection matrix was obtained using the EM algorithm. In the proposed method, it is assumed that a subspace with large variance in the speech feature space is a “phoneme-dependent subspace” and a complementary subspace of it is a “phoneme-independent subspace”, the feature vectors of train/test speech data are projected to a phoneme-independent subspace and a new feature vectors are obtained. In GMM-based TI speaker identification experiments, the new feature vectors improves the identification rate by 16.25% and 2.99% respectively, compared with conventional MFCC, PCA-based MFCC. It shows that the new feature vectors of the proposed method can efficiently capture speaker-discriminative information, and suppress the other speech information.","PeriodicalId":115037,"journal":{"name":"2012 3rd International Conference on System Science, Engineering Design and Manufacturing Informatization","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"A text-independent speaker recognition system based on Probabilistic Principle Component Analysis\",\"authors\":\"Luan Xiao-chun, Yin Jun-xun, Hu Wei-ping\",\"doi\":\"10.1109/ICSSEM.2012.6340721\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To alleviate the problem of severe degradation of speaker recognition performance because of the phoneme variability between training and testing speech data, in the text-independent speaker recognition system. The paper proposed a text-independent (TI) speaker identification method that suppresses the phonetic information by a subspace method, Probabilistic Principle Component Analysis (PPCA) is utilized to construct these subspaces. Firstly, the covariance matrix was obtained from the large training speech feature data, and then the projection matrix was obtained using the EM algorithm. In the proposed method, it is assumed that a subspace with large variance in the speech feature space is a “phoneme-dependent subspace” and a complementary subspace of it is a “phoneme-independent subspace”, the feature vectors of train/test speech data are projected to a phoneme-independent subspace and a new feature vectors are obtained. In GMM-based TI speaker identification experiments, the new feature vectors improves the identification rate by 16.25% and 2.99% respectively, compared with conventional MFCC, PCA-based MFCC. It shows that the new feature vectors of the proposed method can efficiently capture speaker-discriminative information, and suppress the other speech information.\",\"PeriodicalId\":115037,\"journal\":{\"name\":\"2012 3rd International Conference on System Science, Engineering Design and Manufacturing Informatization\",\"volume\":\"40 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-11-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 3rd International Conference on System Science, Engineering Design and Manufacturing Informatization\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSSEM.2012.6340721\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 3rd International Conference on System Science, Engineering Design and Manufacturing Informatization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSSEM.2012.6340721","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A text-independent speaker recognition system based on Probabilistic Principle Component Analysis
To alleviate the problem of severe degradation of speaker recognition performance because of the phoneme variability between training and testing speech data, in the text-independent speaker recognition system. The paper proposed a text-independent (TI) speaker identification method that suppresses the phonetic information by a subspace method, Probabilistic Principle Component Analysis (PPCA) is utilized to construct these subspaces. Firstly, the covariance matrix was obtained from the large training speech feature data, and then the projection matrix was obtained using the EM algorithm. In the proposed method, it is assumed that a subspace with large variance in the speech feature space is a “phoneme-dependent subspace” and a complementary subspace of it is a “phoneme-independent subspace”, the feature vectors of train/test speech data are projected to a phoneme-independent subspace and a new feature vectors are obtained. In GMM-based TI speaker identification experiments, the new feature vectors improves the identification rate by 16.25% and 2.99% respectively, compared with conventional MFCC, PCA-based MFCC. It shows that the new feature vectors of the proposed method can efficiently capture speaker-discriminative information, and suppress the other speech information.