Intra-flow loss recovery and control for VoIP

MULTIMEDIA '01 Pub Date : 2001-10-01 DOI:10.1145/500141.500206

H. Sanneck, Nguyen Tuong, L. Le, A. Wolisz, G. Carle

{"title":"Intra-flow loss recovery and control for VoIP","authors":"H. Sanneck, Nguyen Tuong, L. Le, A. Wolisz, G. Carle","doi":"10.1145/500141.500206","DOIUrl":null,"url":null,"abstract":"\"Best effort\" packet-switched networks, like the Internet, do not offer a reliable transmission of packets to applications with real-time constraints such as voice. Thus, the loss of packets impairs the application-level utility. For voice this utility impairment is twofold: on one hand, even short bursts of lost packets may decrease significantly the ability of the receiver to conceal the packet loss and the speech signal playout is interrupted. On the other hand, some packets may be particular sensitive to loss as they carry more important information in terms of user perception than other packets.We first develop an end-to-end model based on loss run-lengths with which we can describe the loss distribution within a flow. These packet-level metrics are then linked to user-level objective speech quality metrics. Using this framework, we find that for low-compressing sample-based codecs (PCM) with loss concealment isolated packet losses can be concealed well, whereas burst losses have a higher perceptual impact. For high-compressing frame-based codecs (G.729) on one hand the impact of loss is amplified through error propagation caused by the decoder filter memories, though on the other hand such coding schemes help to perform loss concealment by extrapolation of decoder state. Contrary to sample-based codecs we show that the concealment performance may \"break\" at transitions within the speech signal however.We then propose mechanisms which differentiate between packets within a voice data flow to minimize the impact of packet loss. We designate these methods as \"intra-flow\" loss recovery and control. At the end-to-end level, identification of packets sensitive to loss (sender) as well as loss concealment (receiver) takes place. Hop-by-hop support schemes then allow to (statistically) trade the loss of one packet, which is considered more important, against another one of the same flow which is of lower importance. As both packets require the same cost in terms of network transmission, a gain in user perception is obtainable. We show that significant speech quality improvements can bem achieved and additional data and delay overhead can be avoided while still maintaining a network service which is virtually identical to best effort in the long term.","PeriodicalId":416848,"journal":{"name":"MULTIMEDIA '01","volume":"202 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"35","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"MULTIMEDIA '01","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/500141.500206","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 35

Abstract

"Best effort" packet-switched networks, like the Internet, do not offer a reliable transmission of packets to applications with real-time constraints such as voice. Thus, the loss of packets impairs the application-level utility. For voice this utility impairment is twofold: on one hand, even short bursts of lost packets may decrease significantly the ability of the receiver to conceal the packet loss and the speech signal playout is interrupted. On the other hand, some packets may be particular sensitive to loss as they carry more important information in terms of user perception than other packets.We first develop an end-to-end model based on loss run-lengths with which we can describe the loss distribution within a flow. These packet-level metrics are then linked to user-level objective speech quality metrics. Using this framework, we find that for low-compressing sample-based codecs (PCM) with loss concealment isolated packet losses can be concealed well, whereas burst losses have a higher perceptual impact. For high-compressing frame-based codecs (G.729) on one hand the impact of loss is amplified through error propagation caused by the decoder filter memories, though on the other hand such coding schemes help to perform loss concealment by extrapolation of decoder state. Contrary to sample-based codecs we show that the concealment performance may "break" at transitions within the speech signal however.We then propose mechanisms which differentiate between packets within a voice data flow to minimize the impact of packet loss. We designate these methods as "intra-flow" loss recovery and control. At the end-to-end level, identification of packets sensitive to loss (sender) as well as loss concealment (receiver) takes place. Hop-by-hop support schemes then allow to (statistically) trade the loss of one packet, which is considered more important, against another one of the same flow which is of lower importance. As both packets require the same cost in terms of network transmission, a gain in user perception is obtainable. We show that significant speech quality improvements can bem achieved and additional data and delay overhead can be avoided while still maintaining a network service which is virtually identical to best effort in the long term.

查看原文本刊更多论文

VoIP的流内损失恢复和控制

“尽最大努力”的分组交换网络，如Internet，不能向具有实时限制(如语音)的应用程序提供可靠的分组传输。因此，数据包的丢失会损害应用程序级实用程序。对于语音来说，这种实用程序的损害是双重的:一方面，即使是短时间的数据包丢失也可能显著降低接收器隐藏数据包丢失的能力，从而中断语音信号的播放。另一方面，有些数据包可能对丢失特别敏感，因为就用户感知而言，它们比其他数据包携带更重要的信息。首先，我们开发了一个基于损失运行长度的端到端模型，我们可以用它来描述流中的损失分布。然后将这些分组级指标与用户级客观语音质量指标联系起来。使用该框架，我们发现对于具有丢失隐藏的低压缩基于样本的编解码器(PCM)，可以很好地隐藏孤立的数据包丢失，而突发丢失具有更高的感知影响。对于基于高压缩帧的编解码器(G.729)，一方面，由于解码器滤波器存储器引起的错误传播，损耗的影响被放大，尽管另一方面，这种编码方案有助于通过外推解码器状态来执行损耗隐藏。与基于样本的编解码器相反，我们表明隐藏性能可能在语音信号的转换中“中断”。然后，我们提出了在语音数据流中区分数据包的机制，以尽量减少数据包丢失的影响。我们将这些方法称为“流内”损失恢复和控制。在端到端级别，对丢失敏感的数据包(发送方)和丢失隐藏(接收方)进行识别。逐跳支持方案然后允许(统计上)交易一个数据包的丢失，这被认为是更重要的，与另一个相同流的重要性较低的。由于两个数据包在网络传输方面需要相同的成本，因此可以获得用户感知的增益。我们表明可以实现显著的语音质量改进，并且可以避免额外的数据和延迟开销，同时仍然保持与长期最佳努力几乎相同的网络服务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

MULTIMEDIA '01

自引率

0.00%

发文量