Mongolian speech corpus for text-to-speech development

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA) Pub Date : 2011-11-28 DOI:10.1109/ICSDA.2011.6085994

C. Hansakunbuntheung, A. Thangthai, N. Thatphithakkul, Altangerel Chagnaa

引用次数: 3

Abstract

This paper presents a first attempt to develop Mongolian speech corpus that designed for data-driven speech synthesis in Mongolia. The aim of the speech corpus is to develop a high-quality Mongolian TTS for blinds to use with screen reader. The speech corpus contains nearly 6 hours of Mongolian phones. It well provides Cyrillic text transcription and its phonetic transcription with stress marking. It also provides context information including phone context, stressing levels, syntactic position in word, phrase and utterance for modeling speech acoustics and characteristics for speech synthesis.

查看原文本刊更多论文

用于文本到语音发展的蒙古语语料库

本文首次尝试开发蒙古语语音语料库，用于蒙古语数据驱动语音合成。语音语料库的目的是开发一个高质量的蒙古语TTS，供盲人与屏幕阅读器一起使用。语音语料库包含近6小时的蒙古语电话。它很好地提供了西里尔文字转录及其语音转录的重音标记。它还提供上下文信息，包括电话上下文、重音水平、单词、短语和话语的句法位置，用于语音声学建模和语音合成的特征。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2011 International Conference on Speech Database and Assessments (Oriental COCOSDA)

自引率

0.00%

发文量