{"title":"Information Focus Synthesis Based on Question Answer Chain","authors":"Jing Wan, Han Ren","doi":"10.1109/IALP.2009.16","DOIUrl":null,"url":null,"abstract":"While speech synthesis technologies have come a long way in recent ten years, there is still room for improvement. This paper describes a technique called based on joint information structure, syntax and prosody method, which demonstrates noticeable improvements to existing speech synthesis system. As an important parameter for prosody proceedings in mandarin, information focus prosodic distribution features are typical for hearing natural, speech understanding and in-formation acquisition. Because of the complex mapping relation between information structure, syntax and prosody, we present an efficient method for retrieval information focus to augment a naturalness speech synthesis. We use question answering chain to extract information focus and discover them how to move. Then, we adopt feature classification and prosody predictive modeling to deal with fo-cus’s F0 and time period and obtain them features module. Based on the features module, should significantly increase the accuracy and naturalness of speech synthesis. The rest of this paper is organized as follows. Section 2 summarizes the previously proposed theory for information focus extraction, and derives a new method. Experiments are expressed in Section 3. And experimental results are shown in Section 4. Concluding remarks are presented in the final section.","PeriodicalId":156840,"journal":{"name":"2009 International Conference on Asian Language Processing","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Asian Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IALP.2009.16","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
While speech synthesis technologies have come a long way in recent ten years, there is still room for improvement. This paper describes a technique called based on joint information structure, syntax and prosody method, which demonstrates noticeable improvements to existing speech synthesis system. As an important parameter for prosody proceedings in mandarin, information focus prosodic distribution features are typical for hearing natural, speech understanding and in-formation acquisition. Because of the complex mapping relation between information structure, syntax and prosody, we present an efficient method for retrieval information focus to augment a naturalness speech synthesis. We use question answering chain to extract information focus and discover them how to move. Then, we adopt feature classification and prosody predictive modeling to deal with fo-cus’s F0 and time period and obtain them features module. Based on the features module, should significantly increase the accuracy and naturalness of speech synthesis. The rest of this paper is organized as follows. Section 2 summarizes the previously proposed theory for information focus extraction, and derives a new method. Experiments are expressed in Section 3. And experimental results are shown in Section 4. Concluding remarks are presented in the final section.