ESSA (Enhanced speech synthesis approach) for Building Punjabi Voice Model

2020 Indo – Taiwan 2nd International Conference on Computing, Analytics and Networks (Indo-Taiwan ICAN) Pub Date : 2020-02-01 DOI:10.1109/Indo-TaiwanICAN48429.2020.9181352

S. Gill, Gurgeet Kaur Sandhu

引用次数: 0

Abstract

This Paper presents the text to speech synthesis model using Random Forest Technique along with mixed excitation approach for decision making. Base model is developed by extracting the various voice features types (segment features, phoneme identity etc.) in statistical parametric synthesis approach, which is further enhanced with Random Forest criteria to redevelop the voice model. Twenty cluster trees are generated in Random forest from which one best is selected and used to create a voice model.In this paper for each developed text to speech model, the Mel-cepstral distortion scores are evaluated for comparative study.

查看原文本刊更多论文

建立旁遮普语语音模型的增强语音合成方法

本文介绍了利用随机森林技术和混合激励方法进行决策的语音合成模型。在统计参数合成方法中提取各种语音特征类型(音段特征、音素同一性等)建立基本模型，再用随机森林准则对其进行增强，重新建立语音模型。在随机森林中生成20棵聚类树，从中选出最优的一棵用于创建语音模型。在本文中，对每个开发的文本到语音模型，mel -倒谱失真评分进行了比较研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 Indo – Taiwan 2nd International Conference on Computing, Analytics and Networks (Indo-Taiwan ICAN)

自引率

0.00%

发文量