TTS - VLSP 2021: The Thunder Text-To-Speech System

VNU Journal of Science: Computer Science and Communication Engineering Pub Date : 2022-06-30 DOI:10.25073/2588-1086/vnucsce.342

N. Ngoc Anh, Nguyen Tien Thanh, Le Dang Linh

引用次数: 1

Abstract

This paper describes our speech synthesis system participating in the Vietnamese Text-To-Speech track of the 2021 VLSP evaluation campaign. The goal of this challenge is to build a synthetic voice from a provided spontaneous speech corpus in Vietnamese. In this paper, we propose our implementation of FastSpeech2 model on spontaneous speech. We used a special strategy with spontaneous datasets using the TTS system. We present our utilization in generating mel-spectrograms from given texts and then synthesize speech from generated mel-spectrograms using a separately trained vocoder. In evaluation, our team achieved 3.943 mean score in MOS in-domain test, 3.3 in MOS out-domain test, and 85.00% SUS, which indicates the effectiveness of the proposed system.

查看原文本刊更多论文

TTS - VLSP 2021:迅雷文本转语音系统

本文描述了我们的语音合成系统参与2021年VLSP评估活动的越南文本到语音轨道。这个挑战的目标是从提供的越南语自发语音语料库中构建一个合成语音。在本文中，我们提出了在自发语音上实现FastSpeech2模型。我们使用TTS系统对自发数据集使用了一种特殊的策略。我们介绍了从给定文本生成梅尔谱图的应用，然后使用单独训练的声码器从生成的梅尔谱图合成语音。在评估中，我们的团队在MOS域内测试中获得了3.943分的平均分，在MOS域外测试中获得了3.3分，SUS达到了85.00%，表明我们提出的系统是有效的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

VNU Journal of Science: Computer Science and Communication Engineering

自引率

0.00%

发文量