{"title":"A Study on Minimum Spectral Error Analysis of Speech","authors":"Takuma Hayasaka, Takashi Nose, A. Ito","doi":"10.1109/GCCE50665.2020.9291840","DOIUrl":null,"url":null,"abstract":"Conventional source-filter vocoders, such as WORLD, can quickly synthesize speech. However, the quality of synthetic speech is degraded due to speech parameters extraction errors. Therefore, this paper proposes minimum spectral error analysis, a speech analysis method that extracts speech parameters using Analysis-by-Synthesis (A-b-S), to improve the quality of speech synthesized by WORLD. We update speech parameters to minimize the error between the amplitude spectra of natural and synthetic speech. We developed the calculation process of the amplitude spectrum of synthetic speech from speech parameters to perform this analysis. A preliminary experiment shows that we have successfully constructed the calculation process.","PeriodicalId":179456,"journal":{"name":"2020 IEEE 9th Global Conference on Consumer Electronics (GCCE)","volume":"241 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 9th Global Conference on Consumer Electronics (GCCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GCCE50665.2020.9291840","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Conventional source-filter vocoders, such as WORLD, can quickly synthesize speech. However, the quality of synthetic speech is degraded due to speech parameters extraction errors. Therefore, this paper proposes minimum spectral error analysis, a speech analysis method that extracts speech parameters using Analysis-by-Synthesis (A-b-S), to improve the quality of speech synthesized by WORLD. We update speech parameters to minimize the error between the amplitude spectra of natural and synthetic speech. We developed the calculation process of the amplitude spectrum of synthetic speech from speech parameters to perform this analysis. A preliminary experiment shows that we have successfully constructed the calculation process.