An improved recursive algorithm for automatic alignment of complex long audio

2009 IEEE International Conference on Network Infrastructure and Digital Content Pub Date : 2009-12-31 DOI:10.1109/ICNIDC.2009.5360838

He Kejia, Liu Gang, Tang Jie, Guo Jun

引用次数: 0

Abstract

In this paper we present an approach for automatic alignment of long audio data with varied acoustic conditions to their corresponding transcripts in an effective manner. Accurate time-aligned transcripts provide easier access to audio materials by aiding applications such as the indexing, summarizing and retrieving of audio segments. Accurate time alignments are also necessary for labeling the training data for a speech recognizer's acoustic model. We provide an improved recursive technique of speech recognition with a gradually self-adaptive language model and acoustic model.

查看原文本刊更多论文

复杂长音频自动对齐的改进递归算法

在本文中，我们提出了一种有效的方法来自动校准具有不同声学条件的长音频数据与其相应的转录本。准确的与时间一致的转录本通过帮助诸如索引、总结和检索音频片段等应用程序，使音频材料更容易访问。准确的时间对齐对于标记语音识别器声学模型的训练数据也是必要的。我们提出了一种改进的递归语音识别技术，该技术采用逐步自适应的语言模型和声学模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 IEEE International Conference on Network Infrastructure and Digital Content

自引率

0.00%

发文量