Robust syllable segmentation and its application to syllable-centric continuous speech recognition

2010 National Conference On Communications (NCC) Pub Date : 2010-03-15 DOI:10.1109/NCC.2010.5430189

Rajesh Janakiraman, J. Kumar, H. Murthy

引用次数: 23

Abstract

The focus of this paper is two-fold: (a) to develop a knowledge-based robust syllable segmentation algorithm and (b) to establish the importance of accurate segmentation in both the training and testing phases of a speech recognition system. A robust segmentation algorithm for segmenting the speech signal into syllables is first developed. This uses a non-statistical technique that is based on group delay(GD) segmentation and Vowel Onset point(VOP) detection. The transcription corresponding to the utterance is syllabified using rules. This produces an annotation for the train data. The annotated train data is then used to train a syllable-based speech recognition system. The test signal is also segmented using the proposed algorithm. This segmentation information is then incorporated into the linguistic search space to reduce both computational complexity and word error rate(WER). WER's of 4.4% and 21.2% are reported on the TIMIT and NTIMIT databases respectively.

查看原文本刊更多论文

鲁棒音节分割及其在以音节为中心的连续语音识别中的应用

本文的重点是两个方面:(a)开发一种基于知识的鲁棒音节分词算法;(b)建立准确分词在语音识别系统的训练和测试阶段的重要性。首先提出了一种将语音信号分割成音节的鲁棒分割算法。这使用了一种基于群延迟(GD)分割和元音起始点(VOP)检测的非统计技术。与话语相对应的转录用规则进行音节化。这将为列车数据生成一个注释。然后使用标注的训练数据来训练基于音节的语音识别系统。利用该算法对测试信号进行了分割。然后将这些分词信息合并到语言搜索空间中，以降低计算复杂度和单词错误率(WER)。在TIMIT和NTIMIT数据库中报告的WER值分别为4.4%和21.2%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2010 National Conference On Communications (NCC)

自引率

0.00%

发文量