{"title":"基于语音残差信息的激励建模","authors":"P. Lupini, V. Cuperman","doi":"10.1109/ICASSP.1992.225904","DOIUrl":null,"url":null,"abstract":"Speech codecs based on code excited linear prediction (CELP) traditionally use an adaptive short-term filter, an adaptive codebook (long-term filter), and a fixed (stochastic) excitation codebook. The authors examined the possibility of replacing the fixed stochastic codebook by an adaptive codebook with adaptation based on the characteristics of the unquantized residual. In a typical 4-kb/s CELP codec, the authors use the spectral magnitude and phase of the unquantized residual to experimentally estimate an upper bound on the performance improvement which could be obtained by excitation codebook adaptation. The results suggest that adaptation methods based only on the spectral magnitude (including fractal-based codebooks) are unlikely to result in significant improvement. Adaptation based on the spectral phase information, on the other, shows a significant potential for improving CELP speech quality. The authors also present results of a preliminary test designed to investigate the effect of quantization noise on phased-based adaptation of excitation codebooks.<<ETX>>","PeriodicalId":163713,"journal":{"name":"[Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1992-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Excitation modeling based on speech residual information\",\"authors\":\"P. Lupini, V. Cuperman\",\"doi\":\"10.1109/ICASSP.1992.225904\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speech codecs based on code excited linear prediction (CELP) traditionally use an adaptive short-term filter, an adaptive codebook (long-term filter), and a fixed (stochastic) excitation codebook. The authors examined the possibility of replacing the fixed stochastic codebook by an adaptive codebook with adaptation based on the characteristics of the unquantized residual. In a typical 4-kb/s CELP codec, the authors use the spectral magnitude and phase of the unquantized residual to experimentally estimate an upper bound on the performance improvement which could be obtained by excitation codebook adaptation. The results suggest that adaptation methods based only on the spectral magnitude (including fractal-based codebooks) are unlikely to result in significant improvement. Adaptation based on the spectral phase information, on the other, shows a significant potential for improving CELP speech quality. The authors also present results of a preliminary test designed to investigate the effect of quantization noise on phased-based adaptation of excitation codebooks.<<ETX>>\",\"PeriodicalId\":163713,\"journal\":{\"name\":\"[Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1992-03-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"[Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICASSP.1992.225904\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"[Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSP.1992.225904","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Excitation modeling based on speech residual information
Speech codecs based on code excited linear prediction (CELP) traditionally use an adaptive short-term filter, an adaptive codebook (long-term filter), and a fixed (stochastic) excitation codebook. The authors examined the possibility of replacing the fixed stochastic codebook by an adaptive codebook with adaptation based on the characteristics of the unquantized residual. In a typical 4-kb/s CELP codec, the authors use the spectral magnitude and phase of the unquantized residual to experimentally estimate an upper bound on the performance improvement which could be obtained by excitation codebook adaptation. The results suggest that adaptation methods based only on the spectral magnitude (including fractal-based codebooks) are unlikely to result in significant improvement. Adaptation based on the spectral phase information, on the other, shows a significant potential for improving CELP speech quality. The authors also present results of a preliminary test designed to investigate the effect of quantization noise on phased-based adaptation of excitation codebooks.<>