{"title":"Mandarin syllable signal synthesis using an HNM based scheme","authors":"H. Gu, Yen-Zuo Zhou","doi":"10.1109/ICALIP.2008.4590153","DOIUrl":null,"url":null,"abstract":"In this paper, HNM (harmonic plus noise model) is enhanced and used to design a scheme for synthesizing Mandarin syllable signals. Each syllable is recorded once only and used to synthesize syllable signals with diverse prosodic characteristics without suffering significant signal-quality degradation. For a control point on the synthetic syllable's time axis, two corresponding analysis frames' HNM parameters are interpolated to derive the HNM parameters for the control point. Furthermore, for pitch-contour tuning, another timbre-reserving interpolation is performed for the HNM parameters on a control point. Then, signal samples are synthesized with the HNM synthesis equations rewritten here. According to the result of the perception tests, the HNM based scheme proposed here can indeed be used to synthesize syllable signals with consistent timbre and high signal clarity.","PeriodicalId":175885,"journal":{"name":"2008 International Conference on Audio, Language and Image Processing","volume":"44 15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 International Conference on Audio, Language and Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICALIP.2008.4590153","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In this paper, HNM (harmonic plus noise model) is enhanced and used to design a scheme for synthesizing Mandarin syllable signals. Each syllable is recorded once only and used to synthesize syllable signals with diverse prosodic characteristics without suffering significant signal-quality degradation. For a control point on the synthetic syllable's time axis, two corresponding analysis frames' HNM parameters are interpolated to derive the HNM parameters for the control point. Furthermore, for pitch-contour tuning, another timbre-reserving interpolation is performed for the HNM parameters on a control point. Then, signal samples are synthesized with the HNM synthesis equations rewritten here. According to the result of the perception tests, the HNM based scheme proposed here can indeed be used to synthesize syllable signals with consistent timbre and high signal clarity.