{"title":"Syllable Specific Unit Selection Cost Functions for Text-to-Speech Synthesis","authors":"N. Narendra, K. S. Rao","doi":"10.1145/2382434.2382435","DOIUrl":null,"url":null,"abstract":"This paper presents the design and development of syllable specific unit selection cost functions for improving the quality of text-to-speech synthesis. Appropriate unit selection cost functions, namely concatenation cost and target cost, are proposed for syllable based synthesis. Concatenation costs are defined based on the type of segments present at the syllable joins. Proposed concatenation costs have shown significant reduction in perceptual discontinuity at syllable joins. Three-stage target cost formulation is proposed for selecting appropriate units from database. Subjective evaluation has shown improvement in the quality of speech at each stage.","PeriodicalId":412532,"journal":{"name":"ACM Trans. Speech Lang. Process.","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Trans. Speech Lang. Process.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2382434.2382435","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12
Abstract
This paper presents the design and development of syllable specific unit selection cost functions for improving the quality of text-to-speech synthesis. Appropriate unit selection cost functions, namely concatenation cost and target cost, are proposed for syllable based synthesis. Concatenation costs are defined based on the type of segments present at the syllable joins. Proposed concatenation costs have shown significant reduction in perceptual discontinuity at syllable joins. Three-stage target cost formulation is proposed for selecting appropriate units from database. Subjective evaluation has shown improvement in the quality of speech at each stage.