ESSA (Enhanced speech synthesis approach) for Building Punjabi Voice Model

S. Gill, Gurgeet Kaur Sandhu
{"title":"ESSA (Enhanced speech synthesis approach) for Building Punjabi Voice Model","authors":"S. Gill, Gurgeet Kaur Sandhu","doi":"10.1109/Indo-TaiwanICAN48429.2020.9181352","DOIUrl":null,"url":null,"abstract":"This Paper presents the text to speech synthesis model using Random Forest Technique along with mixed excitation approach for decision making. Base model is developed by extracting the various voice features types (segment features, phoneme identity etc.) in statistical parametric synthesis approach, which is further enhanced with Random Forest criteria to redevelop the voice model. Twenty cluster trees are generated in Random forest from which one best is selected and used to create a voice model.In this paper for each developed text to speech model, the Mel-cepstral distortion scores are evaluated for comparative study.","PeriodicalId":171125,"journal":{"name":"2020 Indo – Taiwan 2nd International Conference on Computing, Analytics and Networks (Indo-Taiwan ICAN)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 Indo – Taiwan 2nd International Conference on Computing, Analytics and Networks (Indo-Taiwan ICAN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/Indo-TaiwanICAN48429.2020.9181352","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This Paper presents the text to speech synthesis model using Random Forest Technique along with mixed excitation approach for decision making. Base model is developed by extracting the various voice features types (segment features, phoneme identity etc.) in statistical parametric synthesis approach, which is further enhanced with Random Forest criteria to redevelop the voice model. Twenty cluster trees are generated in Random forest from which one best is selected and used to create a voice model.In this paper for each developed text to speech model, the Mel-cepstral distortion scores are evaluated for comparative study.
建立旁遮普语语音模型的增强语音合成方法
本文介绍了利用随机森林技术和混合激励方法进行决策的语音合成模型。在统计参数合成方法中提取各种语音特征类型(音段特征、音素同一性等)建立基本模型,再用随机森林准则对其进行增强,重新建立语音模型。在随机森林中生成20棵聚类树,从中选出最优的一棵用于创建语音模型。在本文中,对每个开发的文本到语音模型,mel -倒谱失真评分进行了比较研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信