{"title":"HMM-based TTS System Framework","authors":"Saly Keo, Soky Kak, Y. Shiga, H. Kato, H. Kawai","doi":"10.1109/CIFEr.2019.8759128","DOIUrl":null,"url":null,"abstract":"The research focuses on the use of Hidden Markov Model (HMM) to build Khmer text-to-speech (TTS) system. Although the system is based on HMM statistic model, language specific functions were newly designed and developed to cope with the orthographical and grammatical nature of Khmer, some of which included word segmentation, grapheme to phoneme conversion, definitions of full context labels and question sets. In total four-thousand phonemically-balanced Khmer sentences were read aloud by an adult male speaker of Khmer, which were in turn served for training a model for Khmer TTS. The system has been incorporated into VoiceTra, a multilingual speech-to-speech translation app that has been developed and maintained by NICT. The app is publicly released for mobile devices and available to download in both App store and Google Play store.","PeriodicalId":368382,"journal":{"name":"2019 IEEE Conference on Computational Intelligence for Financial Engineering & Economics (CIFEr)","volume":"129 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Conference on Computational Intelligence for Financial Engineering & Economics (CIFEr)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIFEr.2019.8759128","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The research focuses on the use of Hidden Markov Model (HMM) to build Khmer text-to-speech (TTS) system. Although the system is based on HMM statistic model, language specific functions were newly designed and developed to cope with the orthographical and grammatical nature of Khmer, some of which included word segmentation, grapheme to phoneme conversion, definitions of full context labels and question sets. In total four-thousand phonemically-balanced Khmer sentences were read aloud by an adult male speaker of Khmer, which were in turn served for training a model for Khmer TTS. The system has been incorporated into VoiceTra, a multilingual speech-to-speech translation app that has been developed and maintained by NICT. The app is publicly released for mobile devices and available to download in both App store and Google Play store.