Advances in Arabic broadcast news transcription at RWTH

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI:10.1109/ASRU.2007.4430154

David Rybach, Stefan Hahn, C. Gollan, R. Schlüter, H. Ney

引用次数: 30

Abstract

This paper describes the RWTH speech recognition system for Arabic. Several design aspects of the system, including cross-adaptation, multiple system design and combination, are analyzed. We summarize the semi-automatic lexicon generation for Arabic using a statistical approach to grapheme-to-phoneme conversion and pronunciation statistics. Furthermore, a novel ASR-based audio segmentation algorithm is presented. Finally, we discuss practical approaches for parallelized acoustic training and memory efficient lattice rescoring. Systematic results are reported on recent GALE evaluation corpora.

查看原文本刊更多论文

工业大学阿拉伯语广播新闻转录的进展

本文介绍了RWTH阿拉伯语语音识别系统。分析了系统的交叉适应、多系统设计和组合等几个设计方面的问题。我们总结了阿拉伯语半自动词汇生成使用的统计方法，字形音素转换和发音统计。在此基础上，提出了一种新的基于asr的音频分割算法。最后，我们讨论了并行声学训练和高效记忆点阵重记的实际方法。系统地报道了最近的GALE评价语料库的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU)

自引率

0.00%

发文量