A study on Mandarin broadcast news speech recognition

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-01 DOI:10.1109/CHINSL.2004.1409635

C. L. Chen, Yih-Ru Wang, Sin-Horng Chen

引用次数: 0

Abstract

In this paper, a basic Mandarin broadcast news speech recognition system is constructed using the MATBN database. It considers the acoustic modeling for Mandarin base-syllables, particles, and paralinguistic phenomena. It also considers environment-dependent acoustic modeling for three recording environments: studio anchors, outdoor reporters, and outdoor interviewees. Moreover, it incorporates a bigram language model with adaptation, using the data in MATBN. Syllable recognition rates of 89.64, 84.42 and 61.62% were achieved for the three environments of anchors, reporters and interviewees, respectively.

查看原文本刊更多论文

普通话广播新闻语音识别研究

本文利用MATBN数据库构建了一个基本的普通话广播新闻语音识别系统。它考虑了普通话基本音节、小调和副语言现象的声学建模。它还考虑了三种录音环境的环境相关声学建模:演播室主播、户外记者和户外受访者。此外，它还结合了一个具有自适应功能的双元语言模型，使用了MATBN中的数据。主播、记者、受访者三种环境下的音节识别率分别为89.64%、84.42%、61.62%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2004 International Symposium on Chinese Spoken Language Processing

自引率

0.00%

发文量