{"title":"Enhancing waveform interpolative coding with weighted REW parametric quantization","authors":"O. Gottesman, A. Gersho","doi":"10.1109/SCFT.2000.878391","DOIUrl":null,"url":null,"abstract":"This paper presents an efficient quantization technique for the rapidly-evolving waveforms in waveform interpolative (WI) coders. The scheme, based on a parametrization of the rapidly-evolving waveform (REW) magnitude, and analysis-by-synthesis (AbS) vector quantization (VQ) of the REW parameters, allows both higher temporal and spectral resolution of the REW. A perceptually weighted distortion measure takes advantage of spectral and temporal masking and leads to improved reconstructed speech quality, most notably in mixed voiced and unvoiced speech segments. The technique is an important component of the enhanced waveform interpolative (EWI) speech coder at 2.8 kbps that achieves a subjective quality slightly better than that of G.723.1 at 6.3 kbps.","PeriodicalId":359453,"journal":{"name":"2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCFT.2000.878391","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
This paper presents an efficient quantization technique for the rapidly-evolving waveforms in waveform interpolative (WI) coders. The scheme, based on a parametrization of the rapidly-evolving waveform (REW) magnitude, and analysis-by-synthesis (AbS) vector quantization (VQ) of the REW parameters, allows both higher temporal and spectral resolution of the REW. A perceptually weighted distortion measure takes advantage of spectral and temporal masking and leads to improved reconstructed speech quality, most notably in mixed voiced and unvoiced speech segments. The technique is an important component of the enhanced waveform interpolative (EWI) speech coder at 2.8 kbps that achieves a subjective quality slightly better than that of G.723.1 at 6.3 kbps.