Discriminative adaptive training with VTS and JUD

2009 IEEE Workshop on Automatic Speech Recognition & Understanding Pub Date : 2009-12-01 DOI:10.1109/ASRU.2009.5373266

F. Flego, M. Gales

引用次数: 23

Abstract

Adaptive training is a powerful approach for building speech recognition systems on non-homogeneous training data. Recently approaches based on predictive model-based compensation schemes, such as Joint Uncertainty Decoding (JUD) and Vector Taylor Series (VTS), have been proposed. This paper reviews these model-based compensation schemes and relates them to factor-analysis style systems. Forms of Maximum Likelihood (ML) adaptive training with these approaches are described, based on both second-order optimisation schemes and Expectation Maximisation (EM). However, discriminative training is used in many state-of-the-art speech recognition. Hence, this paper proposes discriminative adaptive training with predictive model-compensation approaches for noise robust speech recognition. This training approach is applied to both JUD and VTS compensation with minimum phone error training. A large scale multi-environment training configuration is used and the systems evaluated on a range of in-car collected data tasks.

查看原文本刊更多论文

基于VTS和JUD的判别适应性训练

自适应训练是在非同构训练数据上构建语音识别系统的一种有效方法。近年来，人们提出了基于预测模型的补偿方案，如联合不确定性解码(JUD)和矢量泰勒级数(VTS)。本文综述了这些基于模型的薪酬方案，并将它们与因子分析风格的系统联系起来。基于二阶优化方案和期望最大化(EM)，描述了使用这些方法的最大似然(ML)自适应训练的形式。然而，判别训练被用于许多最先进的语音识别。因此，本文提出了基于预测模型补偿的判别自适应训练方法用于噪声鲁棒语音识别。该训练方法同时应用于JUD和VTS补偿，并实现了最小的电话误差训练。使用了大规模的多环境训练配置，并对一系列车内收集的数据任务进行了系统评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 IEEE Workshop on Automatic Speech Recognition & Understanding

自引率

0.00%

发文量