Derivation Procedure of Coefficients of Metadata-based Model for Adaptive Bitrate Streaming Services

IF 0.6 4区计算机科学 Q4 ENGINEERING, ELECTRICAL & ELECTRONIC

IEICE Transactions on Communications Pub Date : 2021-01-01 DOI:10.1587/TRANSCOM.2020CQP0002

Kazuhisa Yamagishi, N. Egi, Noriko Yoshimura, Pierre R. Lebreton

{"title":"Derivation Procedure of Coefficients of Metadata-based Model for Adaptive Bitrate Streaming Services","authors":"Kazuhisa Yamagishi, N. Egi, Noriko Yoshimura, Pierre R. Lebreton","doi":"10.1587/TRANSCOM.2020CQP0002","DOIUrl":null,"url":null,"abstract":"Since the quality of video streaming services is degraded due to the encoding, network congestion, and lack of a playout buffer, the normality of services needs to be monitored by gathering the quality measured at the end clients. When measuring quality at the end clients, the computational power should be sufficiently low, the bitstream information cannot be accessed for the quality estimation due to the encryption, and reference video cannot be used at the end clients. Therefore, metadata-based models have been developed and standardized that take metadata such as the resolution, framerate, and bitrate, and stalling information as input and calculate the quality. However, calculated quality for linear TV and video on demand (VoD) services cannot be compared because metadata-based models cannot calculate the impacts of codec strategies (e.g., H.264/AVC, H.265/HEVC, and AV1) and configurations (e.g., 1-pass encoding for linear TV or 2-pass encoding for VoD) on the quality. To take into account the impact of codec strategies and configurations, coefficients of metadatabased model need to be optimized per codec strategy and configuration using subjective quality. However, extensive subjective assessment tests are difficult to frequently conduct because they are expensive and time consuming and need to be conducted by video quality experts. Therefore, to monitor the quality of several types of video streaming services (e.g., linear TV and VoD) and to compare these qualities, a derivation procedure is proposed for obtaining coefficients of metadata-based models using a fullreference model. To validate the procedure, extensive subjective assessment tests were conducted. Finally, it is shown that three metadata-based models (i.e., the P.1203.1 mode 0 model, extended P.1203.1 mode 0 model, and model proposed by Yamagishi et al.) based on the proposed procedure using the video multimethod assessment fusion (VMAF) algorithm estimate quality accurately in terms of root mean squared error. key words: adaptive bitrate streaming, codec configuration, full-reference model, metadata-based model","PeriodicalId":50385,"journal":{"name":"IEICE Transactions on Communications","volume":"1 1","pages":""},"PeriodicalIF":0.6000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEICE Transactions on Communications","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1587/TRANSCOM.2020CQP0002","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

Since the quality of video streaming services is degraded due to the encoding, network congestion, and lack of a playout buffer, the normality of services needs to be monitored by gathering the quality measured at the end clients. When measuring quality at the end clients, the computational power should be sufficiently low, the bitstream information cannot be accessed for the quality estimation due to the encryption, and reference video cannot be used at the end clients. Therefore, metadata-based models have been developed and standardized that take metadata such as the resolution, framerate, and bitrate, and stalling information as input and calculate the quality. However, calculated quality for linear TV and video on demand (VoD) services cannot be compared because metadata-based models cannot calculate the impacts of codec strategies (e.g., H.264/AVC, H.265/HEVC, and AV1) and configurations (e.g., 1-pass encoding for linear TV or 2-pass encoding for VoD) on the quality. To take into account the impact of codec strategies and configurations, coefficients of metadatabased model need to be optimized per codec strategy and configuration using subjective quality. However, extensive subjective assessment tests are difficult to frequently conduct because they are expensive and time consuming and need to be conducted by video quality experts. Therefore, to monitor the quality of several types of video streaming services (e.g., linear TV and VoD) and to compare these qualities, a derivation procedure is proposed for obtaining coefficients of metadata-based models using a fullreference model. To validate the procedure, extensive subjective assessment tests were conducted. Finally, it is shown that three metadata-based models (i.e., the P.1203.1 mode 0 model, extended P.1203.1 mode 0 model, and model proposed by Yamagishi et al.) based on the proposed procedure using the video multimethod assessment fusion (VMAF) algorithm estimate quality accurately in terms of root mean squared error. key words: adaptive bitrate streaming, codec configuration, full-reference model, metadata-based model

查看原文本刊更多论文

自适应比特率流服务元数据模型系数推导过程

由于编码、网络拥塞和缺乏播放缓冲区导致视频流服务的质量下降，因此需要通过收集在终端客户端测量的质量来监控服务的正常运行。在终端客户端进行质量测量时，计算能力应足够低，由于加密，无法访问码流信息进行质量估计，并且不能在终端客户端使用参考视频。因此，基于元数据的模型被开发和标准化，该模型将分辨率、帧率、比特率和延迟信息等元数据作为输入并计算质量。然而，线性电视和视频点播(VoD)服务的计算质量无法进行比较，因为基于元数据的模型无法计算编解码器策略(例如H.264/AVC、H.265/HEVC和AV1)和配置(例如线性电视的1通编码或VoD的2通编码)对质量的影响。为了考虑编解码器策略和配置的影响，需要利用主观质量对每个编解码器策略和配置的元数据库模型系数进行优化。然而，广泛的主观评估测试很难经常进行，因为它们既昂贵又耗时，而且需要由视频质量专家进行。因此，为了监测几种类型的视频流服务(如线性电视和VoD)的质量并比较这些质量，提出了一个推导过程，用于使用全参考模型获得基于元数据的模型的系数。为了验证该程序，进行了广泛的主观评估测试。最后，利用视频多方法评估融合(VMAF)算法，基于所提出过程的三个基于元数据的模型(即P.1203.1模式0模型、扩展的P.1203.1模式0模型和Yamagishi等人提出的模型)能够准确地从均方根误差方面估计质量。关键词:自适应比特率流，编解码器配置，全参考模型，元数据模型

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEICE Transactions on Communications 工程技术-电信学

CiteScore

1.40

自引率

28.60%

发文量

101

审稿时长

3.7 months

期刊介绍： The IEICE Transactions on Communications is an all-electronic journal published occasionally by the Institute of Electronics, Information and Communication Engineers (IEICE) and edited by the Communications Society in IEICE. The IEICE Transactions on Communications publishes original, peer-reviewed papers that embrace the entire field of communications, including: - Fundamental Theories for Communications - Energy in Electronics Communications - Transmission Systems and Transmission Equipment for Communications - Optical Fiber for Communications - Fiber-Optic Transmission for Communications - Network System - Network - Internet - Network Management/Operation - Antennas and Propagation - Electromagnetic Compatibility (EMC) - Wireless Communication Technologies - Terrestrial Wireless Communication/Broadcasting Technologies - Satellite Communications - Sensing - Navigation, Guidance and Control Systems - Space Utilization Systems for Communications - Multimedia Systems for Communication