{"title":"使用最大似然广义回归的说话人自适应","authors":"M. H. Bahari, H. V. hamme","doi":"10.1109/ISSPA.2012.6310564","DOIUrl":null,"url":null,"abstract":"In this paper, a new method called Maximum Likelihood General Regression (MLGR) is introduced for speaker adaptation. Gaussian means of a speaker independent (SI) model are adapted to the data of a new speaker by assuming a non-linear mapping from the SI Gaussian means to the adapted Gaussian means. MLGR performs a non-linear regression between ML estimates of the means and the SI means using General Regression Neural Network. The proposed method is evaluated on the Wall Street Journal database. Evaluation results show that the suggested scheme outperforms different conventional approaches in the case of short adaptation utterances. We also mathematically prove that the Gaussian means of the adapted model using the MLGR converges to their ML estimates in the case of long adaptation utterances.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Speaker adaptation using Maximum Likelihood General Regression\",\"authors\":\"M. H. Bahari, H. V. hamme\",\"doi\":\"10.1109/ISSPA.2012.6310564\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, a new method called Maximum Likelihood General Regression (MLGR) is introduced for speaker adaptation. Gaussian means of a speaker independent (SI) model are adapted to the data of a new speaker by assuming a non-linear mapping from the SI Gaussian means to the adapted Gaussian means. MLGR performs a non-linear regression between ML estimates of the means and the SI means using General Regression Neural Network. The proposed method is evaluated on the Wall Street Journal database. Evaluation results show that the suggested scheme outperforms different conventional approaches in the case of short adaptation utterances. We also mathematically prove that the Gaussian means of the adapted model using the MLGR converges to their ML estimates in the case of long adaptation utterances.\",\"PeriodicalId\":248763,\"journal\":{\"name\":\"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISSPA.2012.6310564\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSPA.2012.6310564","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Speaker adaptation using Maximum Likelihood General Regression
In this paper, a new method called Maximum Likelihood General Regression (MLGR) is introduced for speaker adaptation. Gaussian means of a speaker independent (SI) model are adapted to the data of a new speaker by assuming a non-linear mapping from the SI Gaussian means to the adapted Gaussian means. MLGR performs a non-linear regression between ML estimates of the means and the SI means using General Regression Neural Network. The proposed method is evaluated on the Wall Street Journal database. Evaluation results show that the suggested scheme outperforms different conventional approaches in the case of short adaptation utterances. We also mathematically prove that the Gaussian means of the adapted model using the MLGR converges to their ML estimates in the case of long adaptation utterances.