A comparison of trajectory and mixture modeling in segment-based word recognition

1993 IEEE International Conference on Acoustics, Speech, and Signal Processing Pub Date : 1993-04-27 DOI:10.1109/ICASSP.1993.319303

Ashvin Kannan, Mari Ostendorf

引用次数: 19

Abstract

A mechanism for implementing mixtures at a phone-subsegment (microsegment) level for continuous word recognition based on the stochastic segment model (SMM) is presented. The issues that are involved in tradeoffs between the trajectory and mixture modeling in segment-based word recognition are investigated. Experimental results are reported on DAPRA's speaker-independent Resource management corpus. The results obtained suggest that there is a tradeoff in using mixture models and trajectory models, associated with the level of detail of the modeling unit. The results support the use of whole segment models in the context-dependent case, and microsegment-level (and possibly segment-level) mixtures rather than frame-level mixtures.<>

查看原文本刊更多论文

基于分段词识别的轨迹建模与混合建模的比较

提出了一种基于随机段模型(SMM)在电话子段(微段)水平上实现连续词识别的混合机制。研究了基于分段的词识别中轨迹建模与混合建模之间的权衡问题。报道了DAPRA独立于说话人的资源管理语料库的实验结果。所获得的结果表明，在使用混合模型和轨迹模型时存在权衡，这与建模单元的详细程度有关。结果支持在上下文相关的情况下使用全段模型，以及微段级(可能还有段级)混合而不是帧级混合。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

1993 IEEE International Conference on Acoustics, Speech, and Signal Processing

自引率

0.00%

发文量