Classification and spectral extrapolation based packet reconstruction for low-delay speech coding

A. Husain, V. Cuperman
{"title":"Classification and spectral extrapolation based packet reconstruction for low-delay speech coding","authors":"A. Husain, V. Cuperman","doi":"10.1109/GLOCOM.1994.512714","DOIUrl":null,"url":null,"abstract":"A common aspect of speech transmission through packetized networks is the need to consider discarded (missing) packets as a result of error detection or network overload. The missing packets and the possible mistracking that results in the speech decoder lead to significant quality degradation. In this paper, we examine recovery techniques based on speech classification and spectral extrapolation. The recovery system extrapolates independently the excitation signal and the short-term synthesis filter using an extrapolation strategy based on speech classification (voiced, unvoiced, transition, silence). The extrapolation of the short-term filter uses a least-squares fading memory polynomial filter applied to reflection coefficients. Objective and subjective quality evaluations of the recovery system applied to the LD-CELP G.728 standard for random and burst frame erasures are presented. The results indicate that the system is robust up to a frame erasure rate of 10%. Very little degradation in quality was observed at erasure rates up to 3% for random frame erasures.","PeriodicalId":323626,"journal":{"name":"1994 IEEE GLOBECOM. Communications: The Global Bridge","volume":"69 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"1994 IEEE GLOBECOM. Communications: The Global Bridge","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GLOCOM.1994.512714","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

A common aspect of speech transmission through packetized networks is the need to consider discarded (missing) packets as a result of error detection or network overload. The missing packets and the possible mistracking that results in the speech decoder lead to significant quality degradation. In this paper, we examine recovery techniques based on speech classification and spectral extrapolation. The recovery system extrapolates independently the excitation signal and the short-term synthesis filter using an extrapolation strategy based on speech classification (voiced, unvoiced, transition, silence). The extrapolation of the short-term filter uses a least-squares fading memory polynomial filter applied to reflection coefficients. Objective and subjective quality evaluations of the recovery system applied to the LD-CELP G.728 standard for random and burst frame erasures are presented. The results indicate that the system is robust up to a frame erasure rate of 10%. Very little degradation in quality was observed at erasure rates up to 3% for random frame erasures.
基于分类和频谱外推的低延迟语音编码分组重构
通过分组网络进行语音传输的一个常见方面是需要考虑由于错误检测或网络过载而丢弃(丢失)的数据包。丢失的数据包和可能的误跟踪导致语音解码器的质量显著下降。在本文中,我们研究了基于语音分类和频谱外推的恢复技术。恢复系统使用基于语音分类(浊音、不浊音、过渡、静音)的外推策略独立外推激励信号和短期合成滤波器。短期滤波器的外推使用最小二乘衰落记忆多项式滤波器应用于反射系数。对应用于LD-CELP G.728标准的随机和突发帧擦除恢复系统进行了客观和主观的质量评价。结果表明,该系统的鲁棒性可达10%的帧擦除率。对于随机帧擦除,当擦除率高达3%时,质量几乎没有下降。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信