Proceedings of the 12th ACM Multimedia Systems Conference最新文献_第3页

COSMOS on Steroids: a Cheap Detector for Cheapfakes 类固醇上的COSMOS:廉价假货的廉价探测器

Proceedings of the 12th ACM Multimedia Systems Conference Pub Date : 2021-06-24 DOI: 10.1145/3458305.3479968

Tankut Akgul, T. Civelek, Deniz Ugur, A. Begen

引用次数: 11

uvgVenctester: Open-Source Test Automation Framework for Comprehensive Video Encoder Benchmarking uvgVenctester:全面视频编码器基准测试的开源测试自动化框架

Proceedings of the 12th ACM Multimedia Systems Conference Pub Date : 2021-06-24 DOI: 10.1145/3458305.3478445

Joose Sainio, Alexandre Mercat, Jarno Vanne

{"title":"uvgVenctester: Open-Source Test Automation Framework for Comprehensive Video Encoder Benchmarking","authors":"Joose Sainio, Alexandre Mercat, Jarno Vanne","doi":"10.1145/3458305.3478445","DOIUrl":"https://doi.org/10.1145/3458305.3478445","url":null,"abstract":"The agile and efficient development of modern video encoders calls for automated testing methodologies. This paper presents the first-of-its-kind open-source test automation framework called uvgVenctester (github.com/ultravideo/uvgVenctester) that is designed for comprehensive performance and conformance testing of video encoders with the desired set of test video sequences. Our framework comes with built-in support for the popular AVC, HEVC, VVC, VP9, and AV1 video coding formats and the state-of-the-art HM, Kvazaar, x265, VTM, VVenC, SVT-VP9, and SVT-AV1 video encoders. Furthermore, there are no technical limitations of adopting other formats or encoders. The developers can evaluate the encoder of interest under the three primary usage scenarios: 1) conformance testing of the encoded bitstream; 2) rate-distortion-complexity comparison with the other encoders; and 3) systematic exploration of encoding parameters. The framework provides commonly used analysis tools to quantify encoding quality, speed, and bitrate with versatile set of absolute and comparative results such as Bjøntegaard Delta (BD)-Rate for PSNR, SSIM, and VMAF quality metrics. The supported output formats include CSV, graph, and comparison table. They ensure that the results are available in human and machine-readable formats. To the best of our knowledge, the proposed framework is currently the most comprehensive and modular open-source software toolset for video encoder benchmarking.","PeriodicalId":138399,"journal":{"name":"Proceedings of the 12th ACM Multimedia Systems Conference","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122786236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

In-Network Scalable Video Adaption Using Big Packet Protocol 利用大数据包协议实现网络内可扩展视频自适应

Proceedings of the 12th ACM Multimedia Systems Conference Pub Date : 2021-06-24 DOI: 10.1145/3458305.3478440

S. Clayman, M. Sayıt

引用次数: 3

PePa Ping Dataset: Comprehensive Contextualization of Periodic Passive Ping in Wireless Networks PePa Ping数据集:无线网络周期性无源Ping的综合情境化

Proceedings of the 12th ACM Multimedia Systems Conference Pub Date : 2021-06-24 DOI: 10.1145/3458305.3478456

Diego Madariaga, Lucas Torrealba, Javier Madariaga, Javier Bustos-Jiménez, B. Bustos

{"title":"PePa Ping Dataset: Comprehensive Contextualization of Periodic Passive Ping in Wireless Networks","authors":"Diego Madariaga, Lucas Torrealba, Javier Madariaga, Javier Bustos-Jiménez, B. Bustos","doi":"10.1145/3458305.3478456","DOIUrl":"https://doi.org/10.1145/3458305.3478456","url":null,"abstract":"Among all Internet Quality of Service (QoS) indicators, Round-trip time (RTT), jitter and packet loss have been thoroughly studied due to their great impact on the overall network's performance and the Quality of Experience (QoE) perceived by the users. Considering that, we managed to generate a real-world dataset with a comprehensive contextualization of these important quality indicators by passively monitoring the network in user-space. To generate this dataset, we first developed a novel Periodic Passive Ping (PePa Ping) methodology for Android devices. Contrary to other works, PePa Ping periodically obtains RTT, jitter, and number of lost packets of all TCP connections. This passive approach relies on the implementation of a local VPN server residing inside the client device to manage all Internet traffic and obtain QoS information of the connections established. The collected QoS indicators are provided directly by the Linux kernel, and therefore, they are exceptionally close to real QoS values experienced by users' devices. Additionally, the PePa Ping application continuously measured other indicators related to each individual network flow, the state of the device, and the state of the Internet connection (either WiFi or Mobile). With all the collected information, each network flow can be precisely linked to a set of environmental data that provides a comprehensive contextualization of each individual connection.","PeriodicalId":138399,"journal":{"name":"Proceedings of the 12th ACM Multimedia Systems Conference","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128161990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Machine Learning Based Video Coding Enhancements for HTTP Adaptive Streaming 基于机器学习的HTTP自适应流视频编码增强

Proceedings of the 12th ACM Multimedia Systems Conference Pub Date : 2021-06-24 DOI: 10.1145/3458305.3478468

Ekrem Çetinkaya

引用次数: 0

A Hybrid Receiver-side Congestion Control Scheme for Web Real-time Communication 一种用于Web实时通信的混合接收端拥塞控制方案

Proceedings of the 12th ACM Multimedia Systems Conference Pub Date : 2021-06-24 DOI: 10.1145/3458305.3479970

Bo Wang, Yuan Zhang, Si-Ze Qian, Zipeng Pan, Yuhong Xie

{"title":"A Hybrid Receiver-side Congestion Control Scheme for Web Real-time Communication","authors":"Bo Wang, Yuan Zhang, Si-Ze Qian, Zipeng Pan, Yuhong Xie","doi":"10.1145/3458305.3479970","DOIUrl":"https://doi.org/10.1145/3458305.3479970","url":null,"abstract":"Web real-time communication (WebRTC) employs congestion control to ensure the quality of experience (QoE). Different from congestion control schemes for TCP, WebRTC keeps a low-level playback buffer that considers excessively delayed packets as losses, which makes the congestion control for WebRTC more challenging. Existing heuristic schemes estimate the network conditions based on hand-crafted rules that may be suboptimal, leading to under-utilization or over-utilization of link capacity in many cases. On the other hand, the existing learning-based schemes train a model that acts in a large action space, which is hard to converge to a stable status and has low performance over unpredictable network conditions. In this paper, we propose a hybrid receiver-side congestion control (HRCC) framework, which combines a heuristic congestion control scheme with an RL-Agent that periodically generates a gain coefficient to tune the bandwidth estimated by the heuristic scheme. Extensive simulation experiments demonstrate that the HRCC's RL-Agent effectively tunes the bandwidth estimate of the heuristic scheme. The hybrid scheme achieves higher bandwidth utilization than the fully heuristic scheme with similar queuing delay and packet loss and outperforms the fully RL-based scheme on overall performance.","PeriodicalId":138399,"journal":{"name":"Proceedings of the 12th ACM Multimedia Systems Conference","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123288633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Standards-based Streaming Analytics and its Visualization 基于标准的流分析及其可视化

Proceedings of the 12th ACM Multimedia Systems Conference Pub Date : 2021-06-24 DOI: 10.1145/3458305.3478438

S. Pham, M. Avelino, D. Silhavy, Troung-Sinh An, S. Arbanowski

引用次数: 6

4DLFVD

Proceedings of the 12th ACM Multimedia Systems Conference Pub Date : 2021-06-24 DOI: 10.1145/3458305.3478450

Xinjue Hu, Chen-chao Wang, Yuxuan Pan, Yunming Liu, Yumei Wang, Yu Liu, Lin Zhang, S. Shirmohammadi

{"title":"4DLFVD","authors":"Xinjue Hu, Chen-chao Wang, Yuxuan Pan, Yunming Liu, Yumei Wang, Yu Liu, Lin Zhang, S. Shirmohammadi","doi":"10.1145/3458305.3478450","DOIUrl":"https://doi.org/10.1145/3458305.3478450","url":null,"abstract":"We present a 4D Light Field (LF) video dataset, collected by a custom-made camera matrix, to be used for designing and testing algorithms and systems for LF video coding, processing, and streaming. Compared to existing LF datasets, ours provides LF videos, as opposed to only images, and at higher frame resolution, higher number of viewpoints, and/or higher framerate, offering the best visual quality LF video dataset. To achieve this, we built a 10 x 10 LF capture matrix composed of 100 cameras, each with a 1920 x 1056 resolution. We used this matrix to record videos in real and varying illumination and scene dynamics conditions. The dataset contains a total of nine groups of LF videos: eight groups collected with a fixed camera matrix position and orientation recording indoor potted plants, furniture, etc., and the last group collected by rotating around an outdoor environment with roadside vehicles, pedestrians, etc. Each group of LF videos consists of 100 video streams encoded with H.265/HEVC. Scene changes vary from static to slightly dynamic to highly dynamic, providing a good level of diversity. As an example, we present the results of a depth estimation method and show that our dataset can be used for applications such as objection detection, 3D modeling, and others.","PeriodicalId":138399,"journal":{"name":"Proceedings of the 12th ACM Multimedia Systems Conference","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128333598","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

HYPERAKTIV 蓝

Proceedings of the 12th ACM Multimedia Systems Conference Pub Date : 2021-06-24 DOI: 10.1145/3458305.3478454

S. Hicks, A. Stautland, Ole Bernt Fasmer, Wenche Førland, H. Hammer, P. Halvorsen, K. Mjeldheim, K. Oedegaard, B. Osnes, Vigdis Elin Giæver Syrstad, M. Riegler, P. Jakobsen

{"title":"HYPERAKTIV","authors":"S. Hicks, A. Stautland, Ole Bernt Fasmer, Wenche Førland, H. Hammer, P. Halvorsen, K. Mjeldheim, K. Oedegaard, B. Osnes, Vigdis Elin Giæver Syrstad, M. Riegler, P. Jakobsen","doi":"10.1145/3458305.3478454","DOIUrl":"https://doi.org/10.1145/3458305.3478454","url":null,"abstract":"Machine learning research within healthcare frequently lacks the public data needed to be fully reproducible and comparable. Datasets are often restricted due to privacy concerns and legal requirements that come with patient-related data. Consequentially, many algorithms and models get published on the same topic without a standard benchmark to measure against. Therefore, this paper presents HYPERAKTIV, a public dataset containing health, activity, and heart rate data from patients diagnosed with attention deficit hyperactivity disorder, better known as ADHD. The dataset consists of data collected from 51 patients with ADHD and 52 clinical controls. In addition to the activity and heart rate data, we also include a series of patient attributes such as their age, sex, and information about their mental state, as well as output data from a computerized neuropsychological test. Together with the presented dataset, we also provide baseline experiments using traditional machine learning algorithms to predict ADHD based on the included activity data. We hope that this dataset can be used as a starting point for computer scientists who want to contribute to the field of mental health, and as a common benchmark for future work in ADHD analysis.","PeriodicalId":138399,"journal":{"name":"Proceedings of the 12th ACM Multimedia Systems Conference","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124492549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

CDN and SDN Support and Player Interaction for HTTP Adaptive Video Streaming HTTP自适应视频流的CDN和SDN支持和播放器交互

Proceedings of the 12th ACM Multimedia Systems Conference Pub Date : 2021-06-24 DOI: 10.1145/3458305.3478464

R. Farahani

引用次数: 10