基于统计修正的hmm语音合成后滤波技术

2012 8th International Symposium on Chinese Spoken Language Processing Pub Date : 2012-12-01 DOI:10.1109/ISCSLP.2012.6423456

Zhengqi Wen, J. Tao, Hao Che

{"title":"基于统计修正的hmm语音合成后滤波技术","authors":"Zhengqi Wen, J. Tao, Hao Che","doi":"10.1109/ISCSLP.2012.6423456","DOIUrl":null,"url":null,"abstract":"The speech generated from hidden Markov model (HMM)-based speech synthesis systems (HTS) is suffered from over-smoothing problem which is due to statistical modeling. This paper will focus on post-filtering technique based on statistical modification for the generated speech parameters. The marginal statistics of parameters' trajectory, such as mean, variance, skewness and kurtosis are adjusted according to the values generated from the HTS system. This technique is compared with global variance (GV)-based speech generation algorithm. The listening test showed that the post-filtering technique considering the mean and variance could generate almost equal result with GV model. When further considering the modification of skewness and kurtosis, the quality of generated speech has been improved.","PeriodicalId":186099,"journal":{"name":"2012 8th International Symposium on Chinese Spoken Language Processing","volume":"85 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Statistical modification based post-filtering technique for HMM-based speech synthesis\",\"authors\":\"Zhengqi Wen, J. Tao, Hao Che\",\"doi\":\"10.1109/ISCSLP.2012.6423456\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The speech generated from hidden Markov model (HMM)-based speech synthesis systems (HTS) is suffered from over-smoothing problem which is due to statistical modeling. This paper will focus on post-filtering technique based on statistical modification for the generated speech parameters. The marginal statistics of parameters' trajectory, such as mean, variance, skewness and kurtosis are adjusted according to the values generated from the HTS system. This technique is compared with global variance (GV)-based speech generation algorithm. The listening test showed that the post-filtering technique considering the mean and variance could generate almost equal result with GV model. When further considering the modification of skewness and kurtosis, the quality of generated speech has been improved.\",\"PeriodicalId\":186099,\"journal\":{\"name\":\"2012 8th International Symposium on Chinese Spoken Language Processing\",\"volume\":\"85 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 8th International Symposium on Chinese Spoken Language Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCSLP.2012.6423456\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 8th International Symposium on Chinese Spoken Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCSLP.2012.6423456","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

基于隐马尔可夫模型(HMM)的语音合成系统(HTS)由于统计建模而产生的语音存在过平滑问题。本文将重点研究基于统计修正的语音参数后滤波技术。根据HTS系统生成的值调整参数轨迹的边际统计量，如均值、方差、偏度和峰度。并与基于全局方差(GV)的语音生成算法进行了比较。听力测试表明，考虑均值和方差的后滤波技术可以得到与GV模型几乎相等的结果。进一步考虑对偏度和峰度的修正，提高了生成语音的质量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Statistical modification based post-filtering technique for HMM-based speech synthesis

The speech generated from hidden Markov model (HMM)-based speech synthesis systems (HTS) is suffered from over-smoothing problem which is due to statistical modeling. This paper will focus on post-filtering technique based on statistical modification for the generated speech parameters. The marginal statistics of parameters' trajectory, such as mean, variance, skewness and kurtosis are adjusted according to the values generated from the HTS system. This technique is compared with global variance (GV)-based speech generation algorithm. The listening test showed that the post-filtering technique considering the mean and variance could generate almost equal result with GV model. When further considering the modification of skewness and kurtosis, the quality of generated speech has been improved.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2012 8th International Symposium on Chinese Spoken Language Processing

自引率

0.00%

发文量