{"title":"A Maximum Entropy Markov Model for Prediction of Prosodic Phrase Boundaries in Chinese TTS","authors":"Ziping Zhao, Tingjian Zhao, Yaoting Zhu","doi":"10.1109/GrC.2007.66","DOIUrl":null,"url":null,"abstract":"Hierarchical prosody structure generation is a key component for a speech synthesis system. One major feature of the prosody of Mandarin Chinese speech flow is prosodic phrase grouping. In this paper a method based on maximum entropy Markov model (MEMM) is proposed to predict prosodic phrase boundaries in unrestricted Chinese text. MEMM is described in detail that combines transition probabilities and conditional probabilities of states effectively. The conditional probabilities of states are estimated by maximum entropy (ME) theory. A comparison is conducted between the new model and maximum entropy model for prosody phrase break prediction. The experiments show that utilizing the same feature set, MEMM improves overall performance. The precision and recall ratio are improved.","PeriodicalId":259430,"journal":{"name":"2007 IEEE International Conference on Granular Computing (GRC 2007)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 IEEE International Conference on Granular Computing (GRC 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GrC.2007.66","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
Hierarchical prosody structure generation is a key component for a speech synthesis system. One major feature of the prosody of Mandarin Chinese speech flow is prosodic phrase grouping. In this paper a method based on maximum entropy Markov model (MEMM) is proposed to predict prosodic phrase boundaries in unrestricted Chinese text. MEMM is described in detail that combines transition probabilities and conditional probabilities of states effectively. The conditional probabilities of states are estimated by maximum entropy (ME) theory. A comparison is conducted between the new model and maximum entropy model for prosody phrase break prediction. The experiments show that utilizing the same feature set, MEMM improves overall performance. The precision and recall ratio are improved.