Hideki Kawahara, M. Morise, Toru Takahashi, T. Irino, Hideki Banno, O. Fujimura
{"title":"声事件表示的群延迟及其在语音非周期分析中的应用","authors":"Hideki Kawahara, M. Morise, Toru Takahashi, T. Irino, Hideki Banno, O. Fujimura","doi":"10.5281/ZENODO.40659","DOIUrl":null,"url":null,"abstract":"A new framework is proposed for representing acoustic events based on bandwise durations derived from a group delay function and bandwise aperiodicity indices. The goal is to provide an efficient and detailed source information for a high-quality speech manipulation system, STRAIGHT. The proposed representation enables event based processing of speech parameters and provides means to fill the gap between waveform based methods and VOCODERs in a perceptually relevant manner. Simulations using a pulse plus noise source and a time varying filter demonstrated that the proposed method provides accurate estimates of the source aperiodicity. Application of the proposed method to STRAIGHT illustrated that it enables significant reduction in storage size and improves reproduced sound quality.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Group delay for acoustic event representation and its application for speech aperiodicity analysis\",\"authors\":\"Hideki Kawahara, M. Morise, Toru Takahashi, T. Irino, Hideki Banno, O. Fujimura\",\"doi\":\"10.5281/ZENODO.40659\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A new framework is proposed for representing acoustic events based on bandwise durations derived from a group delay function and bandwise aperiodicity indices. The goal is to provide an efficient and detailed source information for a high-quality speech manipulation system, STRAIGHT. The proposed representation enables event based processing of speech parameters and provides means to fill the gap between waveform based methods and VOCODERs in a perceptually relevant manner. Simulations using a pulse plus noise source and a time varying filter demonstrated that the proposed method provides accurate estimates of the source aperiodicity. Application of the proposed method to STRAIGHT illustrated that it enables significant reduction in storage size and improves reproduced sound quality.\",\"PeriodicalId\":176384,\"journal\":{\"name\":\"2007 15th European Signal Processing Conference\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-09-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2007 15th European Signal Processing Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5281/ZENODO.40659\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 15th European Signal Processing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5281/ZENODO.40659","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Group delay for acoustic event representation and its application for speech aperiodicity analysis
A new framework is proposed for representing acoustic events based on bandwise durations derived from a group delay function and bandwise aperiodicity indices. The goal is to provide an efficient and detailed source information for a high-quality speech manipulation system, STRAIGHT. The proposed representation enables event based processing of speech parameters and provides means to fill the gap between waveform based methods and VOCODERs in a perceptually relevant manner. Simulations using a pulse plus noise source and a time varying filter demonstrated that the proposed method provides accurate estimates of the source aperiodicity. Application of the proposed method to STRAIGHT illustrated that it enables significant reduction in storage size and improves reproduced sound quality.