Rotary: A Resource Arbitration Framework for Progressive Iterative Analytics

2023 IEEE 39th International Conference on Data Engineering (ICDE) Pub Date : 2023-04-01 DOI:10.1109/ICDE55515.2023.00166

Rui Liu, Aaron J. Elmore, M. Franklin, S. Krishnan

{"title":"Rotary: A Resource Arbitration Framework for Progressive Iterative Analytics","authors":"Rui Liu, Aaron J. Elmore, M. Franklin, S. Krishnan","doi":"10.1109/ICDE55515.2023.00166","DOIUrl":null,"url":null,"abstract":"Increasingly modern computing applications employ progressive iterative analytics, as best exemplified by two prevalent cases, approximate query processing (AQP) and deep learning training (DLT). In comparison to classic computing applications that only return the results after processing all the input data, progressive iterative analytics keep providing approximate or partial results to users by performing computations on a subset of the entire dataset until either the users are satisfied with the results, or the predefined completion criteria are achieved. Typically, progressive iterative analytic jobs have various completion criteria, produce diminishing returns, and process data at different rates, which necessitates a novel resource arbitration that can continuously prioritize the progressive iterative analytic jobs and determine if/when to reallocate and preempt the resources. We propose and design a resource arbitration framework, Rotary, and implement two resource arbitration systems, Rotary-AQP and Rotary-DLT, for approximate query processing and deep learning training. We build a TPC-H based AQP workload and a survey-based DLT workload to evaluate the two systems, respectively. The evaluation results demonstrate that Rotary-AQP and Rotary-DLT outperform the state-of-the-art systems and confirm the generality and practicality of the proposed resource arbitration framework.","PeriodicalId":434744,"journal":{"name":"2023 IEEE 39th International Conference on Data Engineering (ICDE)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 39th International Conference on Data Engineering (ICDE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDE55515.2023.00166","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Increasingly modern computing applications employ progressive iterative analytics, as best exemplified by two prevalent cases, approximate query processing (AQP) and deep learning training (DLT). In comparison to classic computing applications that only return the results after processing all the input data, progressive iterative analytics keep providing approximate or partial results to users by performing computations on a subset of the entire dataset until either the users are satisfied with the results, or the predefined completion criteria are achieved. Typically, progressive iterative analytic jobs have various completion criteria, produce diminishing returns, and process data at different rates, which necessitates a novel resource arbitration that can continuously prioritize the progressive iterative analytic jobs and determine if/when to reallocate and preempt the resources. We propose and design a resource arbitration framework, Rotary, and implement two resource arbitration systems, Rotary-AQP and Rotary-DLT, for approximate query processing and deep learning training. We build a TPC-H based AQP workload and a survey-based DLT workload to evaluate the two systems, respectively. The evaluation results demonstrate that Rotary-AQP and Rotary-DLT outperform the state-of-the-art systems and confirm the generality and practicality of the proposed resource arbitration framework.

查看原文本刊更多论文

扶轮:渐进式迭代分析的资源仲裁框架

越来越多的现代计算应用采用渐进式迭代分析，两种流行的案例，近似查询处理(AQP)和深度学习训练(DLT)就是最好的例子。与只在处理完所有输入数据后才返回结果的经典计算应用程序相比，渐进式迭代分析通过对整个数据集的一个子集执行计算，不断向用户提供近似或部分结果，直到用户对结果感到满意，或者达到预定义的完成标准。通常，渐进式迭代分析作业具有不同的完成标准，产生递减的收益，并以不同的速率处理数据，这就需要一种新的资源仲裁，它可以持续地优先考虑渐进式迭代分析作业，并确定是否/何时重新分配和抢占资源。我们提出并设计了一个资源仲裁框架Rotary，并实现了两个资源仲裁系统Rotary- aqp和Rotary- dlt，用于近似查询处理和深度学习训练。我们分别构建了一个基于TPC-H的AQP工作负载和一个基于调查的DLT工作负载来评估这两个系统。评估结果表明，Rotary-AQP和Rotary-DLT优于最先进的系统，并证实了拟议资源仲裁框架的通用性和实用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE 39th International Conference on Data Engineering (ICDE)

自引率

0.00%

发文量