Twitch’s CDN as an Open Population Ecosystem

Proceedings of the 16th Asian Internet Engineering Conference Pub Date : 2021-12-14 DOI:10.1145/3497777.3498551

Wei-Shiang Wung, Guan-Ting Ting, Ruey-Tzer Hsu, Cheng Hsu, Yu-Chien Tsai, Caleb Wang, Yuan-Tai Liu, Hsi Chen, Polly Huang

{"title":"Twitch’s CDN as an Open Population Ecosystem","authors":"Wei-Shiang Wung, Guan-Ting Ting, Ruey-Tzer Hsu, Cheng Hsu, Yu-Chien Tsai, Caleb Wang, Yuan-Tai Liu, Hsi Chen, Polly Huang","doi":"10.1145/3497777.3498551","DOIUrl":null,"url":null,"abstract":"The quality and continuity of the video services such as Twitch depend on the scale and well-being of their content distribution networks (CDNs). Each CDN may consist of 1000s of servers, physically feeding the videos to the clients. Opting for a better understanding, researchers have attempted to measure and analyze the CDNs of popular video services [10, 11, 12, 19]. These works are, however, one-time effort. Given the widespread use of Twitch, we find continuous survey of its CDN an important subject of study. The challenge lies in the cost of performing the Internet-scale scans – the probing traffic. The larger the CDNs and the more frequent the scans are, the higher the overhead. Instead of performing full scans repeatedly, we envision a cost-effective alternative that samples and estimates the CDN size (i.e., the number of servers). Only when the size change is significant, does the system trigger a full scan. To this end and inspired by Capture-Mark-Recapture (CMR), a methodology widely used in Ecology to estimate animal population with little human effort, we propose two mechanisms to estimate the CDN size with lightweight traffic. Using a data set collected in Nov 2019, we find a 7.25% average estimation error. Provided an estimation error bound, we can identify as well the best parameter combination to minimize the probing traffic.","PeriodicalId":248679,"journal":{"name":"Proceedings of the 16th Asian Internet Engineering Conference","volume":"85 3-4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 16th Asian Internet Engineering Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3497777.3498551","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

The quality and continuity of the video services such as Twitch depend on the scale and well-being of their content distribution networks (CDNs). Each CDN may consist of 1000s of servers, physically feeding the videos to the clients. Opting for a better understanding, researchers have attempted to measure and analyze the CDNs of popular video services [10, 11, 12, 19]. These works are, however, one-time effort. Given the widespread use of Twitch, we find continuous survey of its CDN an important subject of study. The challenge lies in the cost of performing the Internet-scale scans – the probing traffic. The larger the CDNs and the more frequent the scans are, the higher the overhead. Instead of performing full scans repeatedly, we envision a cost-effective alternative that samples and estimates the CDN size (i.e., the number of servers). Only when the size change is significant, does the system trigger a full scan. To this end and inspired by Capture-Mark-Recapture (CMR), a methodology widely used in Ecology to estimate animal population with little human effort, we propose two mechanisms to estimate the CDN size with lightweight traffic. Using a data set collected in Nov 2019, we find a 7.25% average estimation error. Provided an estimation error bound, we can identify as well the best parameter combination to minimize the probing traffic.

查看原文本刊更多论文

Twitch的CDN是一个开放的人口生态系统

Twitch等视频服务的质量和连续性取决于其内容分发网络(cdn)的规模和健康状况。每个CDN可能由数千台服务器组成，将视频物理馈送到客户端。为了更好地理解，研究人员试图测量和分析流行视频服务的cdn[10,11,12,19]。然而，这些工作是一次性的。鉴于Twitch的广泛使用，我们发现持续调查其CDN是一个重要的研究课题。挑战在于执行互联网规模扫描的成本——探测流量。cdn越大，扫描越频繁，开销就越高。与其重复执行完整扫描，我们设想了一种经济有效的替代方案，即采样和估计CDN大小(即服务器数量)。只有当大小发生重大变化时，系统才触发一次完整扫描。为此，受捕获-标记-再捕获(CMR)(一种在生态学中广泛用于估算动物种群的方法)的启发，我们提出了两种机制来估算轻量级流量的CDN大小。使用2019年11月收集的数据集，我们发现平均估计误差为7.25%。在给出估计误差范围的情况下，我们还可以确定最佳的参数组合，从而使探测流量最小化。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 16th Asian Internet Engineering Conference

自引率

0.00%

发文量