基于语音残差信息的激励建模

[Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing Pub Date : 1992-03-23 DOI:10.1109/ICASSP.1992.225904

P. Lupini, V. Cuperman

{"title":"基于语音残差信息的激励建模","authors":"P. Lupini, V. Cuperman","doi":"10.1109/ICASSP.1992.225904","DOIUrl":null,"url":null,"abstract":"Speech codecs based on code excited linear prediction (CELP) traditionally use an adaptive short-term filter, an adaptive codebook (long-term filter), and a fixed (stochastic) excitation codebook. The authors examined the possibility of replacing the fixed stochastic codebook by an adaptive codebook with adaptation based on the characteristics of the unquantized residual. In a typical 4-kb/s CELP codec, the authors use the spectral magnitude and phase of the unquantized residual to experimentally estimate an upper bound on the performance improvement which could be obtained by excitation codebook adaptation. The results suggest that adaptation methods based only on the spectral magnitude (including fractal-based codebooks) are unlikely to result in significant improvement. Adaptation based on the spectral phase information, on the other, shows a significant potential for improving CELP speech quality. The authors also present results of a preliminary test designed to investigate the effect of quantization noise on phased-based adaptation of excitation codebooks.<<ETX>>","PeriodicalId":163713,"journal":{"name":"[Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1992-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Excitation modeling based on speech residual information\",\"authors\":\"P. Lupini, V. Cuperman\",\"doi\":\"10.1109/ICASSP.1992.225904\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speech codecs based on code excited linear prediction (CELP) traditionally use an adaptive short-term filter, an adaptive codebook (long-term filter), and a fixed (stochastic) excitation codebook. The authors examined the possibility of replacing the fixed stochastic codebook by an adaptive codebook with adaptation based on the characteristics of the unquantized residual. In a typical 4-kb/s CELP codec, the authors use the spectral magnitude and phase of the unquantized residual to experimentally estimate an upper bound on the performance improvement which could be obtained by excitation codebook adaptation. The results suggest that adaptation methods based only on the spectral magnitude (including fractal-based codebooks) are unlikely to result in significant improvement. Adaptation based on the spectral phase information, on the other, shows a significant potential for improving CELP speech quality. The authors also present results of a preliminary test designed to investigate the effect of quantization noise on phased-based adaptation of excitation codebooks.<<ETX>>\",\"PeriodicalId\":163713,\"journal\":{\"name\":\"[Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1992-03-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"[Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICASSP.1992.225904\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"[Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSP.1992.225904","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

基于码激励线性预测(CELP)的语音编解码器传统上使用自适应短期滤波器、自适应码本(长期滤波器)和固定(随机)激励码本。基于未量化残差的特点，研究了用自适应码本代替固定随机码本的可能性。在一个典型的4kb /s的CELP编解码器中，作者利用未量化残差的谱幅值和相位实验估计了激励码本自适应所能获得的性能改进的上界。结果表明，仅基于光谱幅度(包括基于分形的码本)的自适应方法不太可能产生显著的改善。另一方面，基于频谱相位信息的自适应在提高CELP语音质量方面显示出巨大的潜力。作者还介绍了一项初步试验的结果，该试验旨在研究量化噪声对激励码本的相位自适应的影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Excitation modeling based on speech residual information

Speech codecs based on code excited linear prediction (CELP) traditionally use an adaptive short-term filter, an adaptive codebook (long-term filter), and a fixed (stochastic) excitation codebook. The authors examined the possibility of replacing the fixed stochastic codebook by an adaptive codebook with adaptation based on the characteristics of the unquantized residual. In a typical 4-kb/s CELP codec, the authors use the spectral magnitude and phase of the unquantized residual to experimentally estimate an upper bound on the performance improvement which could be obtained by excitation codebook adaptation. The results suggest that adaptation methods based only on the spectral magnitude (including fractal-based codebooks) are unlikely to result in significant improvement. Adaptation based on the spectral phase information, on the other, shows a significant potential for improving CELP speech quality. The authors also present results of a preliminary test designed to investigate the effect of quantization noise on phased-based adaptation of excitation codebooks.<>

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

[Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing

自引率

0.00%

发文量