Parametric models of the magnitude/phase spectrum for harmonic speech coding

ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing Pub Date : 1988-04-11 DOI:10.1109/ICASSP.1988.196596

D. Thomson

引用次数: 12

Abstract

A method is described for representing magnitude and phase in a sinusoidal transform coder. Instead of transmitting individual sinusoids, the entire speech spectrum is transmitted. The synthesizer estimates the frequency, amplitude, and phase of each harmonic from the spectrum. Relatively high-quality speech in the 4.8-9.6 kb/s range is obtained by modeling the magnitude/phase spectrum with a combination of pole-zero analysis, phase prediction and vector quantization. A window subtraction method ensures proper synthesis of unvoiced speech. The system is robust since it does not depend on pitch estimates or voicing decisions.<>

查看原文本刊更多论文

谐波语音编码的幅度/相位谱参数模型

描述了一种在正弦变换编码器中表示幅度和相位的方法。传输的不是单个的正弦波，而是整个语音频谱。合成器从频谱中估计每个谐波的频率、幅度和相位。结合极零分析、相位预测和矢量量化对幅相谱进行建模，得到了4.8 ~ 9.6 kb/s范围内的高质量语音。窗口减法可确保正确合成不发音语音。该系统是稳健的，因为它不依赖于音高估计或声音决定

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing

自引率

0.00%

发文量