A. Deka, Priyankoo Sarmah, K. Samudravijaya, S. Prasanna
{"title":"Development of Assamese Text-to-speech System using Deep Neural Network","authors":"A. Deka, Priyankoo Sarmah, K. Samudravijaya, S. Prasanna","doi":"10.1109/NCC.2019.8732262","DOIUrl":null,"url":null,"abstract":"This paper describes the development of a text-to-speech system for Assamese language, using Deep Neural Network (DNN). The system is trained with speech data, collected by a consortium, that is available free of cost for academic use. The DNN based method eliminates the need for a grapheme to phoneme conversion; rather, it synthesizes speech directly from the UTF-8 based Assamese script. The results of objective and subjective evaluations confirm that the Assamese speech synthesized using DNN approach is better than the ones synthesized using the traditional hidden Markov model based text-to-speech system.","PeriodicalId":6870,"journal":{"name":"2019 National Conference on Communications (NCC)","volume":"181 1","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2019-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 National Conference on Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC.2019.8732262","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
This paper describes the development of a text-to-speech system for Assamese language, using Deep Neural Network (DNN). The system is trained with speech data, collected by a consortium, that is available free of cost for academic use. The DNN based method eliminates the need for a grapheme to phoneme conversion; rather, it synthesizes speech directly from the UTF-8 based Assamese script. The results of objective and subjective evaluations confirm that the Assamese speech synthesized using DNN approach is better than the ones synthesized using the traditional hidden Markov model based text-to-speech system.