C. Christogiannis, T. Varvarigou, Agatha Zappa, Yiannis Vamvakoulas, Chilin Shih, A. Arvaniti
{"title":"Construction of the acoustic inventory for a Greek text-to-speech concatenative synthesis system","authors":"C. Christogiannis, T. Varvarigou, Agatha Zappa, Yiannis Vamvakoulas, Chilin Shih, A. Arvaniti","doi":"10.1109/ICASSP.2000.859113","DOIUrl":null,"url":null,"abstract":"The development of the Greek text-to-speech (TTS) system by NTUA is based on the method of concatenative synthesis and follows the Bell Labs approach to this technique. Concatenative synthesis is one of the simplest methods for speech synthesis and at the same time bypasses most of the problems encountered by articulatory and formant synthesis techniques. The method relies on designing and creating the acoustic inventory of the language by taking real recorded speech, cutting it into segments and concatenating these segments back together during synthesis. The design and implementation of the acoustic database is a key factor for the performance of the synthesizer, since all the possible phone-to-phone transitions must be considered in order to minimize abrupt discontinuities and thus maximize the naturalness of the synthesized utterances.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSP.2000.859113","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
The development of the Greek text-to-speech (TTS) system by NTUA is based on the method of concatenative synthesis and follows the Bell Labs approach to this technique. Concatenative synthesis is one of the simplest methods for speech synthesis and at the same time bypasses most of the problems encountered by articulatory and formant synthesis techniques. The method relies on designing and creating the acoustic inventory of the language by taking real recorded speech, cutting it into segments and concatenating these segments back together during synthesis. The design and implementation of the acoustic database is a key factor for the performance of the synthesizer, since all the possible phone-to-phone transitions must be considered in order to minimize abrupt discontinuities and thus maximize the naturalness of the synthesized utterances.