Language independent and language adaptive large vocabulary speech recognition

5th International Conference on Spoken Language Processing (ICSLP 1998) Pub Date : 1998-11-30 DOI:10.21437/ICSLP.1998-751

Tanja Schultz, A. Waibel

引用次数: 90

Abstract

This paper describes the design of a multilingual speech recognizer using an LVCSR dictation database which has been collected under the project GlobalPhone. This project at the University of Karlsruhe investigates LVCSR systems in 15 languages of the world, namely Arabic, Chinese, Croatian, English, French, German, Italian, Japanese, Korean, Portuguese, Russian, Spanish, Swedish, Tamil, and Turkish. Based on a global phoneme set we built different multilingual speech recognition systems for five of the 15 languages. Context dependent phoneme models are created data-driven by introducing questions about language and language groups to our polyphone clustering procedure. We apply the resulting multilingual models to unseen languages and present several recognition results in language independent and language adaptive setups.

查看原文本刊更多论文

语言独立和语言自适应的大词汇量语音识别

本文介绍了一种基于LVCSR听写数据库的多语言语音识别器的设计，该数据库是在GlobalPhone项目下收集的。卡尔斯鲁厄大学的这个项目研究了世界上15种语言的LVCSR系统，即阿拉伯语、中文、克罗地亚语、英语、法语、德语、意大利语、日语、韩语、葡萄牙语、俄语、西班牙语、瑞典语、泰米尔语和土耳其语。基于全球音素集，我们为15种语言中的5种构建了不同的多语言语音识别系统。上下文相关的音素模型是通过在我们的多音素聚类过程中引入关于语言和语言群的问题来创建数据驱动的。我们将所得到的多语言模型应用于未见过的语言，并在语言独立和语言自适应设置中给出了几个识别结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

5th International Conference on Spoken Language Processing (ICSLP 1998)

自引率

0.00%

发文量