在ADPCM技术中使用Volterra滤波器进行语音编码:一个全面的研究

European Transactions on Telecommunications Pub Date : 2011-03-01 DOI:10.1002/ett.1440

G. Alipoor, M. Savoji

{"title":"在ADPCM技术中使用Volterra滤波器进行语音编码:一个全面的研究","authors":"G. Alipoor, M. Savoji","doi":"10.1002/ett.1440","DOIUrl":null,"url":null,"abstract":"Although linear filters are useful in a various applications in the context of speech processing, there are several evidences for existence of nonlinearity in speech signals. Our main aim is to launch a comprehensive investigation into the exploitation of nonlinear Volterra filters in the context of the ADPCM-based speech coding technique, using two methods of forward prediction, based on the LS criterion, and backward prediction, based on both LMS and RLS adaptation algorithms. In any case, after solving some innate problems, for example, ill-conditioning and instability, schemes for optimum exploitation of nonlinear prediction are developed and simulation results are provided, tested with several performance criteria. With forward prediction a scheme is developed to detect and flag those frames for which, after stabilizing, including the quadratic predictor is beneficial. Scalar and vector quantisation methods are used for quantising the residual signal and the filter parameters, respectively. The results show that using this scheme a negligible improvement (up to 0.62 dB in the SNR) can be achieved, in spite of the increase in bit rate and complexity. With backward prediction two frame-based schemes are developed in which for each frame, after examining a set of quadratic filters, the best filter in the sense of the best quality of the reconstructed speech is selected. The ultimate schemes result in an improvement of up to 1.5 dB in the overall SNR of the reconstructed speech at the cost of a slight increase in the bit-rate, a short delay and a demanding increase in the complexity. Copyright © 2010 John Wiley & Sons, Ltd.","PeriodicalId":50473,"journal":{"name":"European Transactions on Telecommunications","volume":"4 1","pages":"81-92"},"PeriodicalIF":0.0000,"publicationDate":"2011-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Employing Volterra filters in the ADPCM technique for speech coding: a comprehensive investigation\",\"authors\":\"G. Alipoor, M. Savoji\",\"doi\":\"10.1002/ett.1440\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Although linear filters are useful in a various applications in the context of speech processing, there are several evidences for existence of nonlinearity in speech signals. Our main aim is to launch a comprehensive investigation into the exploitation of nonlinear Volterra filters in the context of the ADPCM-based speech coding technique, using two methods of forward prediction, based on the LS criterion, and backward prediction, based on both LMS and RLS adaptation algorithms. In any case, after solving some innate problems, for example, ill-conditioning and instability, schemes for optimum exploitation of nonlinear prediction are developed and simulation results are provided, tested with several performance criteria. With forward prediction a scheme is developed to detect and flag those frames for which, after stabilizing, including the quadratic predictor is beneficial. Scalar and vector quantisation methods are used for quantising the residual signal and the filter parameters, respectively. The results show that using this scheme a negligible improvement (up to 0.62 dB in the SNR) can be achieved, in spite of the increase in bit rate and complexity. With backward prediction two frame-based schemes are developed in which for each frame, after examining a set of quadratic filters, the best filter in the sense of the best quality of the reconstructed speech is selected. The ultimate schemes result in an improvement of up to 1.5 dB in the overall SNR of the reconstructed speech at the cost of a slight increase in the bit-rate, a short delay and a demanding increase in the complexity. Copyright © 2010 John Wiley & Sons, Ltd.\",\"PeriodicalId\":50473,\"journal\":{\"name\":\"European Transactions on Telecommunications\",\"volume\":\"4 1\",\"pages\":\"81-92\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Transactions on Telecommunications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1002/ett.1440\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Transactions on Telecommunications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/ett.1440","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

摘要

尽管线性滤波器在语音处理的各种应用中都很有用，但有一些证据表明语音信号中存在非线性。我们的主要目的是全面研究非线性Volterra滤波器在基于adpcm的语音编码技术中的应用，采用两种方法，基于LS准则的前向预测和基于LMS和RLS自适应算法的后向预测。在任何情况下，在解决了一些固有问题(如病态和不稳定性)之后，开发了非线性预测的最佳利用方案，并提供了仿真结果，并用几个性能标准进行了测试。对于前向预测，开发了一种方案来检测和标记那些在稳定后，包含二次预测器是有益的帧。分别采用标量量化和矢量量化方法对残差信号和滤波器参数进行量化。结果表明，尽管比特率和复杂度增加，但使用该方案可以实现可忽略不计的改进(信噪比高达0.62 dB)。对于后向预测，提出了两种基于帧的方案，其中对每一帧，在检查一组二次滤波器后，选择重构语音质量最佳的最佳滤波器。最终方案的结果是重建语音的整体信噪比提高了1.5 dB，但代价是比特率略有增加，延迟较短，复杂性增加。版权所有©2010 John Wiley & Sons, Ltd

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Employing Volterra filters in the ADPCM technique for speech coding: a comprehensive investigation

Although linear filters are useful in a various applications in the context of speech processing, there are several evidences for existence of nonlinearity in speech signals. Our main aim is to launch a comprehensive investigation into the exploitation of nonlinear Volterra filters in the context of the ADPCM-based speech coding technique, using two methods of forward prediction, based on the LS criterion, and backward prediction, based on both LMS and RLS adaptation algorithms. In any case, after solving some innate problems, for example, ill-conditioning and instability, schemes for optimum exploitation of nonlinear prediction are developed and simulation results are provided, tested with several performance criteria. With forward prediction a scheme is developed to detect and flag those frames for which, after stabilizing, including the quadratic predictor is beneficial. Scalar and vector quantisation methods are used for quantising the residual signal and the filter parameters, respectively. The results show that using this scheme a negligible improvement (up to 0.62 dB in the SNR) can be achieved, in spite of the increase in bit rate and complexity. With backward prediction two frame-based schemes are developed in which for each frame, after examining a set of quadratic filters, the best filter in the sense of the best quality of the reconstructed speech is selected. The ultimate schemes result in an improvement of up to 1.5 dB in the overall SNR of the reconstructed speech at the cost of a slight increase in the bit-rate, a short delay and a demanding increase in the complexity. Copyright © 2010 John Wiley & Sons, Ltd.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

European Transactions on Telecommunications 工程技术-电信学

自引率

0.00%

发文量

审稿时长

9 months