{"title":"ESSA (Enhanced speech synthesis approach) for Building Punjabi Voice Model","authors":"S. Gill, Gurgeet Kaur Sandhu","doi":"10.1109/Indo-TaiwanICAN48429.2020.9181352","DOIUrl":null,"url":null,"abstract":"This Paper presents the text to speech synthesis model using Random Forest Technique along with mixed excitation approach for decision making. Base model is developed by extracting the various voice features types (segment features, phoneme identity etc.) in statistical parametric synthesis approach, which is further enhanced with Random Forest criteria to redevelop the voice model. Twenty cluster trees are generated in Random forest from which one best is selected and used to create a voice model.In this paper for each developed text to speech model, the Mel-cepstral distortion scores are evaluated for comparative study.","PeriodicalId":171125,"journal":{"name":"2020 Indo – Taiwan 2nd International Conference on Computing, Analytics and Networks (Indo-Taiwan ICAN)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 Indo – Taiwan 2nd International Conference on Computing, Analytics and Networks (Indo-Taiwan ICAN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/Indo-TaiwanICAN48429.2020.9181352","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This Paper presents the text to speech synthesis model using Random Forest Technique along with mixed excitation approach for decision making. Base model is developed by extracting the various voice features types (segment features, phoneme identity etc.) in statistical parametric synthesis approach, which is further enhanced with Random Forest criteria to redevelop the voice model. Twenty cluster trees are generated in Random forest from which one best is selected and used to create a voice model.In this paper for each developed text to speech model, the Mel-cepstral distortion scores are evaluated for comparative study.