A robust variable-rate speech coder

1995 International Conference on Acoustics, Speech, and Signal Processing Pub Date : 1995-05-09 DOI:10.1109/ICASSP.1995.479520

A. Shen, Benjamim Tang, A. Alwan, G. Pottie

引用次数: 9

Abstract

The goal of this study is to develop a robust and high-quality speech coder for wireless communication. The proposed coder is a perceptually-based variable-rate subband coder. The perceptual metric ensures that encoding is optimized to the human listener and is based on calculating the signal-to-mask ratio in short-time frames of the input signal. An adaptive bit allocation scheme is employed and the subband energies are then quantized using a Max-Lloyd quantizer. The coder is fully scalable-increasing the bit rates, improves the quality of encoded speech. Subjective listening tests, using quiet and noisy input signals, indicate that the proposed coder produces high-quality speech when operating at 12 kbps or higher. In error-free conditions, our coder has comparable performance to that of QCELP or GSM coders. For speech in background noise, however, our coder, at 12 kbps, outperforms QCELP significantly, and for music, it outperforms both QCELP and GSM.

查看原文本刊更多论文

一种鲁棒可变速率语音编码器

本研究的目标是开发一种鲁棒且高品质的无线通讯语音编码器。所提出的编码器是一种基于感知的可变速率子带编码器。感知度量确保编码对人类听者进行优化，并基于计算输入信号的短时间帧的信号与掩码比。采用自适应比特分配方案，然后使用Max-Lloyd量化器对子带能量进行量化。编码器是完全可扩展的-增加比特率，提高编码语音的质量。主观听力测试，使用安静和嘈杂的输入信号，表明所提出的编码器产生高质量的语音时，工作在12 kbps或更高。在无错误条件下，我们的编码器具有与QCELP或GSM编码器相当的性能。然而，对于背景噪声中的语音，我们的编码器以12 kbps的速度明显优于QCELP，对于音乐，它优于QCELP和GSM。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

1995 International Conference on Acoustics, Speech, and Signal Processing

自引率

0.00%

发文量