Learning discriminative basis coefficients for eigenspace MLLR unsupervised adaptation

2013 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2013-05-26 DOI:10.1109/ICASSP.2013.6639208

Yajie Miao, Florian Metze, A. Waibel

引用次数: 5

Abstract

Eigenspace MLLR is effective for fast adaptation when the amount of adaptation data is limited, e.g., less than 5s. The general motivation is to represent the MLLR transform as a linear combination of basis matrices. In this paper, we present a framework to estimate a speaker-independent discriminative transform over the combination coefficients. This discriminative basis coefficients transform (DBCT) is learned by optimizing discriminative criteria over all the training speakers. During recognition, the ML basis coefficients for each testing speaker are firstly found, on which DBCT is applied to give the final MLLR transform discrimination ability. Experiments show that DBCT results in consistent WER reduction in unsupervised adaptation, compared with both standard ML and discriminatively trained transforms.

查看原文本刊更多论文

特征空间MLLR无监督自适应判别基系数的学习

在自适应数据量有限的情况下，如小于5s时，特征空间最小二乘最小二乘是有效的快速自适应。一般的动机是将MLLR变换表示为基矩阵的线性组合。在本文中，我们提出了一个估计组合系数上与说话人无关的判别变换的框架。这种判别基系数变换(DBCT)是通过优化所有训练说话者的判别标准来学习的。在识别过程中，首先找到每个测试说话人的ML基系数，在此基础上应用DBCT给出最终的MLLR变换识别能力。实验表明，与标准ML和判别训练的变换相比，DBCT在无监督自适应中产生了一致的WER降低。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2013 IEEE International Conference on Acoustics, Speech and Signal Processing

自引率

0.00%

发文量