Reactive and continuous control of HMM-based speech synthesis

2012 IEEE Spoken Language Technology Workshop (SLT) Pub Date : 2012-12-01 DOI:10.1109/SLT.2012.6424231

M. Astrinaki, N. D'Alessandro, B. Picart, Thomas Drugman, T. Dutoit

引用次数: 26

Abstract

In this paper, we present a modified version of HTS, called performative HTS or pHTS. The objective of pHTS is to enhance the control ability and reactivity of HTS. pHTS reduces the phonetic context used for training the models and generates the speech parameters within a 2-label window. Speech waveforms are generated on-the-fly and the models can be re-actively modified, impacting the synthesized speech with a delay of only one phoneme. It is shown that HTS and pHTS have comparable output quality. We use this new system to achieve reactive model interpolation and conduct a new test where articulation degree is modified within the sentence.

查看原文本刊更多论文

基于hmm的语音合成反应和连续控制

在本文中，我们提出了一个改进的HTS版本，称为高性能HTS或pHTS。pHTS的目标是提高HTS的控制能力和反应性。pHTS减少了用于训练模型的语音上下文，并在一个2标签窗口内生成语音参数。语音波形是实时生成的，模型可以被主动修改，影响合成语音只有一个音素的延迟。结果表明，HTS和pHTS具有相当的输出质量。我们使用这个新系统来实现反应模型插值，并进行了一个新的测试，其中在句子内修改了发音度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2012 IEEE Spoken Language Technology Workshop (SLT)

自引率

0.00%

发文量