{"title":"Arabic HMM-based speech synthesis","authors":"K. M. Khalil, Cherif Adnan","doi":"10.1109/ICEESA.2013.6578437","DOIUrl":null,"url":null,"abstract":"This paper describes the Arabic system synthesis on hidden Markov models (HTS). Our developed synthesis system uses phonemes as HMM synthesis unit, Arabic database was developed for the first test. The main objective is to maintain the consolidated text coherence which is interpreted by concatenating HMM phoneme. In our experiments, spectral properties were represented by Mel cepstrum coefficients. For the waveform synthesis, a noise or pulse excited corresponding MLSA filter was utilized. Besides that basic setup, a high-quality analysis/synthesis system STRAIGHT was employed for more sophisticated speech representation. This method has several advantages. As it is parametric, it is possible to play on the HMM parameters, change the producer voice characteristics. The developed model improves the speech synthesis, naturalness and intelligibility quality in the Arabic language environment.","PeriodicalId":212631,"journal":{"name":"2013 International Conference on Electrical Engineering and Software Applications","volume":"66 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 International Conference on Electrical Engineering and Software Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEESA.2013.6578437","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 23
Abstract
This paper describes the Arabic system synthesis on hidden Markov models (HTS). Our developed synthesis system uses phonemes as HMM synthesis unit, Arabic database was developed for the first test. The main objective is to maintain the consolidated text coherence which is interpreted by concatenating HMM phoneme. In our experiments, spectral properties were represented by Mel cepstrum coefficients. For the waveform synthesis, a noise or pulse excited corresponding MLSA filter was utilized. Besides that basic setup, a high-quality analysis/synthesis system STRAIGHT was employed for more sophisticated speech representation. This method has several advantages. As it is parametric, it is possible to play on the HMM parameters, change the producer voice characteristics. The developed model improves the speech synthesis, naturalness and intelligibility quality in the Arabic language environment.