Multisensor very lowbit rate speech coding using segment quantization

2008 IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 2008-05-12 DOI:10.1109/ICASSP.2008.4518530

A. McCree, K. Brady, T. Quatieri

引用次数: 12

Abstract

We present two approaches to noise robust very low bit rate speech coding using wideband MELP analysis/synthesis. Both methods exploit multiple acoustic and non-acoustic input sensors, using our previously-presented dynamic waveform fusion algorithm to simultaneously perform waveform fusion, noise suppression, and cross-channel noise cancellation. One coder uses a 600 bps scalable phonetic vocoder, with a phonetic speech recognizer followed by joint predictive vector quantization of the error in wideband MELP parameters. The second coder operates at 300 bps with fixed 80 ms segments, using novel variable-rate multistage matrix quantization techniques. Formal test results show that both coders achieve equivalent intelligibility to the 2.4 kbps NATO standard MELPe coder in harsh acoustic noise environments, at much lower bit rates, with only modest quality loss.

查看原文本刊更多论文

多传感器非常低比特率语音编码使用分段量化

我们提出了两种使用宽带MELP分析/合成实现噪声鲁棒的超低比特率语音编码的方法。这两种方法都利用多个声学和非声学输入传感器，使用我们之前提出的动态波形融合算法同时进行波形融合、噪声抑制和跨通道噪声消除。一个编码器使用600 bps可扩展的语音声码器，带语音识别器，然后对宽带MELP参数中的误差进行联合预测向量量化。第二个编码器以300bps的速度运行，使用新颖的可变速率多级矩阵量化技术，固定80ms段。正式测试结果表明，在恶劣的噪声环境下，这两种编码器都能以更低的比特率实现与2.4 kbps北约标准MELPe编码器相当的清晰度，并且只有适度的质量损失。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

自引率

0.00%

发文量