基于分数的波形拼接和最优耦合平滑技术的文本到语音合成

International Journal of Recent Technology and Engineering Pub Date : 2020-05-30 DOI:10.35940/ijrte.a2530.059120

{"title":"基于分数的波形拼接和最优耦合平滑技术的文本到语音合成","authors":"","doi":"10.35940/ijrte.a2530.059120","DOIUrl":null,"url":null,"abstract":"Text to Speech System is a Speech Synthesis application that converts a text to speech. The current project focuses on developing a TTS System for the Tamil Language with the Synthesis Technique as Unit Selection Synthesis. Letter Level Segmentation of an input text helps in the reduction of corpus size compared to Syllable Level Segmentation. The segmented units are retrieved with respect to Unicode values, concatenated and the synthesized speech is produced. Intelligibility and Naturalness of the spoken word can be improved using the Smoothing Techniques. Optimal Coupling Smoothing Technique is implemented for the smooth transition in between the concatenated speech segments to create continuous Speech output like human voice. Fraction based Waveform Concatenation method is used to produce the intelligible speech segments as output from the pre-recorded speech database.","PeriodicalId":220909,"journal":{"name":"International Journal of Recent Technology and Engineering","volume":"177 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Text to Speech Synthesis using Fraction Based Waveform Concatenation and Optimal Coupling Smoothing Technique\",\"authors\":\"\",\"doi\":\"10.35940/ijrte.a2530.059120\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text to Speech System is a Speech Synthesis application that converts a text to speech. The current project focuses on developing a TTS System for the Tamil Language with the Synthesis Technique as Unit Selection Synthesis. Letter Level Segmentation of an input text helps in the reduction of corpus size compared to Syllable Level Segmentation. The segmented units are retrieved with respect to Unicode values, concatenated and the synthesized speech is produced. Intelligibility and Naturalness of the spoken word can be improved using the Smoothing Techniques. Optimal Coupling Smoothing Technique is implemented for the smooth transition in between the concatenated speech segments to create continuous Speech output like human voice. Fraction based Waveform Concatenation method is used to produce the intelligible speech segments as output from the pre-recorded speech database.\",\"PeriodicalId\":220909,\"journal\":{\"name\":\"International Journal of Recent Technology and Engineering\",\"volume\":\"177 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-05-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Recent Technology and Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.35940/ijrte.a2530.059120\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Recent Technology and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.35940/ijrte.a2530.059120","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

文本到语音系统是一个语音合成应用程序，将文本转换为语音。目前的项目侧重于开发泰米尔语的TTS系统，将综合技术作为单元选择综合。与音节级分词相比，输入文本的字母级分词有助于减少语料库大小。根据Unicode值检索分段单元，并将其连接起来，生成合成语音。使用平滑技术可以提高口语的可理解性和自然度。实现了最优耦合平滑技术，实现了连接语音段之间的平滑过渡，从而产生像人类语音一样的连续语音输出。采用基于分数的波形拼接方法，从预先录制的语音数据库中产生可理解的语音片段作为输出。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Text to Speech Synthesis using Fraction Based Waveform Concatenation and Optimal Coupling Smoothing Technique

Text to Speech System is a Speech Synthesis application that converts a text to speech. The current project focuses on developing a TTS System for the Tamil Language with the Synthesis Technique as Unit Selection Synthesis. Letter Level Segmentation of an input text helps in the reduction of corpus size compared to Syllable Level Segmentation. The segmented units are retrieved with respect to Unicode values, concatenated and the synthesized speech is produced. Intelligibility and Naturalness of the spoken word can be improved using the Smoothing Techniques. Optimal Coupling Smoothing Technique is implemented for the smooth transition in between the concatenated speech segments to create continuous Speech output like human voice. Fraction based Waveform Concatenation method is used to produce the intelligible speech segments as output from the pre-recorded speech database.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Recent Technology and Engineering

自引率

0.00%

发文量