Speech compression by vector quantization of epochs

ISSPA '99. Proceedings of the Fifth International Symposium on Signal Processing and its Applications (IEEE Cat. No.99EX359) Pub Date : 1999-08-22 DOI:10.1109/ISSPA.1999.818220

Peter Veprek, A. B. Bradley

引用次数: 3

Abstract

A pitch epoch is a fundamental unit of voiced speech. This paper introduces a speech compression method based on vector quantization of epochs. Pitch determination, epoch marking, vector quantization procedure, and a technique for epoch extrapolation are described. The compression method is then evaluated and briefly compared to other waveform coders. The quality is objectively measured by the segmental signal-to-noise ratio and the results are tabulated. The (automatic) epoch vector quantization yields the following SNRseg: 10.03 dB at 12.0 kbps, 11.35 dB at 21.3 kbps, 13.31 dB at 39.7 kbps, 18.32 dB at 76.4 kbps, and 62.69 dB at 112.9 kbps.

查看原文本刊更多论文

基于时间向量量化的语音压缩

音高epoch是浊音的基本单位。介绍了一种基于时间矢量量化的语音压缩方法。基音确定、历元标记、矢量量化程序和历元外推技术进行了描述。然后对压缩方法进行评估，并与其他波形编码器进行简要比较。用分段信噪比客观地测量了质量，并将结果制成表格。(自动)历元矢量量化产生以下SNRseg: 12.0 kbps时10.03 dB, 21.3 kbps时11.35 dB, 39.7 kbps时13.31 dB, 76.4 kbps时18.32 dB, 112.9 kbps时62.69 dB。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ISSPA '99. Proceedings of the Fifth International Symposium on Signal Processing and its Applications (IEEE Cat. No.99EX359)

自引率

0.00%

发文量