Classification and spectral extrapolation based packet reconstruction for low-delay speech coding

1994 IEEE GLOBECOM. Communications: The Global Bridge Pub Date : 1994-11-28 DOI:10.1109/GLOCOM.1994.512714

A. Husain, V. Cuperman

引用次数: 4

Abstract

A common aspect of speech transmission through packetized networks is the need to consider discarded (missing) packets as a result of error detection or network overload. The missing packets and the possible mistracking that results in the speech decoder lead to significant quality degradation. In this paper, we examine recovery techniques based on speech classification and spectral extrapolation. The recovery system extrapolates independently the excitation signal and the short-term synthesis filter using an extrapolation strategy based on speech classification (voiced, unvoiced, transition, silence). The extrapolation of the short-term filter uses a least-squares fading memory polynomial filter applied to reflection coefficients. Objective and subjective quality evaluations of the recovery system applied to the LD-CELP G.728 standard for random and burst frame erasures are presented. The results indicate that the system is robust up to a frame erasure rate of 10%. Very little degradation in quality was observed at erasure rates up to 3% for random frame erasures.

查看原文本刊更多论文

基于分类和频谱外推的低延迟语音编码分组重构

通过分组网络进行语音传输的一个常见方面是需要考虑由于错误检测或网络过载而丢弃(丢失)的数据包。丢失的数据包和可能的误跟踪导致语音解码器的质量显著下降。在本文中，我们研究了基于语音分类和频谱外推的恢复技术。恢复系统使用基于语音分类(浊音、不浊音、过渡、静音)的外推策略独立外推激励信号和短期合成滤波器。短期滤波器的外推使用最小二乘衰落记忆多项式滤波器应用于反射系数。对应用于LD-CELP G.728标准的随机和突发帧擦除恢复系统进行了客观和主观的质量评价。结果表明，该系统的鲁棒性可达10%的帧擦除率。对于随机帧擦除，当擦除率高达3%时，质量几乎没有下降。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

1994 IEEE GLOBECOM. Communications: The Global Bridge

自引率

0.00%

发文量