Language identification through large vocabulary continuous speech recognition

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI:10.1109/CHINSL.2004.1409583

Boon Pang Lim, Haizhou Li, Yu Chen

引用次数: 3

Abstract

In recent years, automatic language identification has become an increasingly important component in practical spoken language systems, and much attention has been devoted to various competing approaches. In this paper, we are concerned with the automatic identification of languages that may be highly similar in nature, such as the various dialects of Chinese. Our approach differs from many recent successful systems by exploiting a fusion of feature scores readily available from a large vocabulary speech recognition system. We show that such features are able to distinguish among the similar sounding dialects of Chinese, and experiments on a nine language corpus show promising performance on a three way identification task.

查看原文本刊更多论文

语言识别通过大词汇量连续语音识别

近年来，自动语言识别已成为实用口语系统中越来越重要的组成部分，各种相互竞争的方法引起了人们的广泛关注。在本文中，我们关注的是可能在性质上高度相似的语言的自动识别，例如汉语的各种方言。我们的方法与最近许多成功的系统不同，它利用了从大词汇量语音识别系统中随时可用的特征分数的融合。我们证明了这些特征能够区分发音相似的汉语方言，并且在九种语言语料库上的实验表明，这些特征在三向识别任务上表现良好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2004 International Symposium on Chinese Spoken Language Processing

自引率

0.00%

发文量