Combining NDHMM and phonetic feature detection for speech recognition

2015 23rd European Signal Processing Conference (EUSIPCO) Pub Date : 2015-12-28 DOI:10.1109/EUSIPCO.2015.7362667

T. Svendsen, Jarle Bauck Hamar

引用次数: 1

Abstract

Non-negative HMM (N-HMM) [1] is a model that is well suited for modeling a mixture of e.g. audio signals, but does not have the ability to generalize to model unseen data. Non-negative durational HMM (NdHMM) has recently been proposed [2] as a modification to N-HMM that can allow for generalization, and thus make the approach suitable for automatic speech recognition. A detector-based approach to speech recognition has been studied by several researchers as an alternative to the traditional HMM approach. A bank of phonetic feature detectors will produce phonetic feature posteriors, which fit well with the non-negativity constraint of NdHMM. We review the NdHMM approach proposed in [2] and propose to extend this approach by combining NdHMM with a phonetic feature detection front-end in a tandem-like system. Experimental results of the proposed approach are presented.

查看原文本刊更多论文

结合NDHMM和语音特征检测进行语音识别

非负HMM (N-HMM)[1]是一种非常适合建模混合信号(如音频信号)的模型，但不具备推广到建模未知数据的能力。非负持续HMM (NdHMM)最近被提出[2]，作为N-HMM的修改，可以允许泛化，从而使该方法适用于自动语音识别。一些研究者研究了一种基于检测器的语音识别方法，作为传统HMM方法的替代方法。一组语音特征检测器生成语音特征后验，该后验很好地满足了NdHMM的非否定约束。我们回顾了[2]中提出的NdHMM方法，并建议通过将NdHMM与类似串联系统中的语音特征检测前端相结合来扩展该方法。给出了该方法的实验结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 23rd European Signal Processing Conference (EUSIPCO)

自引率

0.00%

发文量