A discriminative training method incorporating pronunciation variations for dysarthric automatic speech recognition

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) Pub Date : 2016-12-01 DOI:10.1109/APSIPA.2016.7820840

Woo Kyeong Seong, Nam Kyun Kim, H. Ha, H. Kim

引用次数: 2

Abstract

While dysarthric speech recognition can be a convenient interface for dysarthric speakers, it is hard to collect enough speech data to overcome the underestimation problem of acoustic models. In addition, there are lots of pronunciation variations in the collected database due to the paralysis of the articulator of dysarthric speakers. Thus, a discriminative training method is proposed for improving the performance of such resource-limited dysarthric speech recognition. The proposed method is applied to subspace Gaussian mixture modeling by incorporating pronunciation variations into a conventional minimum phone error discriminative training method.

查看原文本刊更多论文

一种结合发音变化的辨别性训练方法用于困难语音自动识别

虽然困难语音识别可以为困难语音说话者提供方便的接口，但很难收集足够的语音数据来克服声学模型的低估问题。此外，由于发音困难的说话者的发音麻痹，在收集到的数据库中存在大量的发音变异。因此，我们提出了一种判别训练方法来提高这种资源有限的困难语音识别的性能。该方法将语音变化与传统的最小电话误差判别训练方法相结合，应用于子空间高斯混合建模。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

自引率

0.00%

发文量