Aggregate a posteriori linear regression adaptation

IEEE Trans. Speech Audio Process. Pub Date : 2006-12-01 DOI:10.1109/TSA.2005.860847

Jen-Tzung Chien, Chih-Hsien Huang

{"title":"Aggregate a posteriori linear regression adaptation","authors":"Jen-Tzung Chien, Chih-Hsien Huang","doi":"10.1109/TSA.2005.860847","DOIUrl":null,"url":null,"abstract":"We present a new discriminative linear regression adaptation algorithm for hidden Markov model (HMM) based speech recognition. The cluster-dependent regression matrices are estimated from speaker-specific adaptation data through maximizing the aggregate a posteriori probability, which can be expressed in a form of classification error function adopting the logarithm of posterior distribution as the discriminant function. Accordingly, the aggregate a posteriori linear regression (AAPLR) is developed for discriminative adaptation where the classification errors of adaptation data are minimized. Because the prior distribution of regression matrix is involved, AAPLR is geared with the Bayesian learning capability. We demonstrate that the difference between AAPLR discriminative adaptation and maximum a posteriori linear regression (MAPLR) adaptation is due to the treatment of the evidence. Different from minimum classification error linear regression (MCELR), AAPLR has closed-form solution to fulfil rapid adaptation. Experimental results reveal that AAPLR speaker adaptation does improve speech recognition performance with moderate computational cost compared to maximum likelihood linear regression (MLLR), MAPLR, MCELR and conditional maximum likelihood linear regression (CMLLR). These results are verified for supervised adaptation as well as unsupervised adaptation for different numbers of adaptation data.","PeriodicalId":13155,"journal":{"name":"IEEE Trans. Speech Audio Process.","volume":"12 5 1","pages":"797-807"},"PeriodicalIF":0.0000,"publicationDate":"2006-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Trans. Speech Audio Process.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TSA.2005.860847","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 14

Abstract

We present a new discriminative linear regression adaptation algorithm for hidden Markov model (HMM) based speech recognition. The cluster-dependent regression matrices are estimated from speaker-specific adaptation data through maximizing the aggregate a posteriori probability, which can be expressed in a form of classification error function adopting the logarithm of posterior distribution as the discriminant function. Accordingly, the aggregate a posteriori linear regression (AAPLR) is developed for discriminative adaptation where the classification errors of adaptation data are minimized. Because the prior distribution of regression matrix is involved, AAPLR is geared with the Bayesian learning capability. We demonstrate that the difference between AAPLR discriminative adaptation and maximum a posteriori linear regression (MAPLR) adaptation is due to the treatment of the evidence. Different from minimum classification error linear regression (MCELR), AAPLR has closed-form solution to fulfil rapid adaptation. Experimental results reveal that AAPLR speaker adaptation does improve speech recognition performance with moderate computational cost compared to maximum likelihood linear regression (MLLR), MAPLR, MCELR and conditional maximum likelihood linear regression (CMLLR). These results are verified for supervised adaptation as well as unsupervised adaptation for different numbers of adaptation data.

查看原文本刊更多论文

集合后验线性回归自适应

提出了一种新的基于隐马尔可夫模型的语音识别判别线性回归自适应算法。从特定说话人的适应数据中，通过最大化集合的后验概率来估计聚类相关的回归矩阵，该后验概率可以表示为采用后验分布的对数作为判别函数的分类误差函数。在此基础上，提出了一种用于判别自适应的聚类后先验线性回归(AAPLR)方法，使自适应数据的分类误差最小化。由于涉及到回归矩阵的先验分布，AAPLR与贝叶斯学习能力相结合。我们证明了AAPLR判别适应与最大后检线性回归(MAPLR)适应之间的差异是由于对证据的处理。与最小分类误差线性回归(MCELR)不同，AAPLR具有封闭解，可以实现快速自适应。实验结果表明，与最大似然线性回归(MLLR)、MAPLR、mclr和条件最大似然线性回归(CMLLR)相比，AAPLR自适应方法在计算成本中等的情况下提高了语音识别性能。对不同数量的自适应数据进行了监督自适应和无监督自适应的验证。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Trans. Speech Audio Process.

自引率

0.00%

发文量