{"title":"Speaker adaptation using Maximum Likelihood General Regression","authors":"M. H. Bahari, H. V. hamme","doi":"10.1109/ISSPA.2012.6310564","DOIUrl":null,"url":null,"abstract":"In this paper, a new method called Maximum Likelihood General Regression (MLGR) is introduced for speaker adaptation. Gaussian means of a speaker independent (SI) model are adapted to the data of a new speaker by assuming a non-linear mapping from the SI Gaussian means to the adapted Gaussian means. MLGR performs a non-linear regression between ML estimates of the means and the SI means using General Regression Neural Network. The proposed method is evaluated on the Wall Street Journal database. Evaluation results show that the suggested scheme outperforms different conventional approaches in the case of short adaptation utterances. We also mathematically prove that the Gaussian means of the adapted model using the MLGR converges to their ML estimates in the case of long adaptation utterances.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSPA.2012.6310564","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In this paper, a new method called Maximum Likelihood General Regression (MLGR) is introduced for speaker adaptation. Gaussian means of a speaker independent (SI) model are adapted to the data of a new speaker by assuming a non-linear mapping from the SI Gaussian means to the adapted Gaussian means. MLGR performs a non-linear regression between ML estimates of the means and the SI means using General Regression Neural Network. The proposed method is evaluated on the Wall Street Journal database. Evaluation results show that the suggested scheme outperforms different conventional approaches in the case of short adaptation utterances. We also mathematically prove that the Gaussian means of the adapted model using the MLGR converges to their ML estimates in the case of long adaptation utterances.