Construction of the acoustic inventory for a Greek text-to-speech concatenative synthesis system

2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100) Pub Date : 2000-06-05 DOI:10.1109/ICASSP.2000.859113

C. Christogiannis, T. Varvarigou, Agatha Zappa, Yiannis Vamvakoulas, Chilin Shih, A. Arvaniti

引用次数: 4

Abstract

The development of the Greek text-to-speech (TTS) system by NTUA is based on the method of concatenative synthesis and follows the Bell Labs approach to this technique. Concatenative synthesis is one of the simplest methods for speech synthesis and at the same time bypasses most of the problems encountered by articulatory and formant synthesis techniques. The method relies on designing and creating the acoustic inventory of the language by taking real recorded speech, cutting it into segments and concatenating these segments back together during synthesis. The design and implementation of the acoustic database is a key factor for the performance of the synthesizer, since all the possible phone-to-phone transitions must be considered in order to minimize abrupt discontinuities and thus maximize the naturalness of the synthesized utterances.

查看原文本刊更多论文

希腊文-语音串联合成系统声学库的构建

NTUA开发的希腊语文本到语音(TTS)系统是基于连接合成的方法，并遵循贝尔实验室的方法来实现这一技术。连接合成是最简单的语音合成方法之一，同时也绕过了发音合成和形成峰合成技术所遇到的大部分问题。该方法依赖于设计和创建语言的声学库存，通过将真实录制的语音切割成片段，并在合成期间将这些片段连接在一起。声学数据库的设计和实现是合成器性能的关键因素，因为必须考虑所有可能的电话到电话转换，以尽量减少突然的不连续性，从而最大限度地提高合成话语的自然性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)

自引率

0.00%

发文量