The Preemptive Resource Allocation Problem

Foundations of Software Technology and Theoretical Computer Science Pub Date : 2018-11-18 DOI:10.4230/LIPIcs.FSTTCS.2019.26

Kanthi Kiran Sarpatwar, B. Schieber, H. Shachnai

{"title":"The Preemptive Resource Allocation Problem","authors":"Kanthi Kiran Sarpatwar, B. Schieber, H. Shachnai","doi":"10.4230/LIPIcs.FSTTCS.2019.26","DOIUrl":null,"url":null,"abstract":"We revisit a classical scheduling model to incorporate modern trends in data center networks and cloud services. Addressing some key challenges in the allocation of shared resources to user requests (jobs) in such settings, we consider the following variants of the classic {\\em resource allocation problem} (\\textsf{RAP}). The input to our problems is a set $J$ of jobs and a set $M$ of homogeneous hosts, each has an available amount of some resource. A job is associated with a release time, a due date, a weight, and a given length, as well as its resource requirement. A \\emph{feasible} schedule is an allocation of the resource to a subset of the jobs, satisfying the job release times/due dates as well as the resource constraints. A crucial distinction between classic {\\textsf{RAP}} and our problems is that we allow preemption and migration of jobs, motivated by virtualization techniques. \nWe consider two natural objectives: {\\em throughput maximization} (\\textsf{MaxT}), which seeks a maximum weight subset of the jobs that can be feasibly scheduled on the hosts in $M$, and {\\em resource minimization} (\\textsf{MinR}), that is finding the minimum number of (homogeneous) hosts needed to feasibly schedule all jobs. Both problems are known to be NP-hard. \nWe first present a $\\Omega(1)$-approximation algorithm for \\textsf{MaxT} instances where time-windows form a laminar family of intervals. We then extend the algorithm to handle instances with arbitrary time-windows, assuming there is sufficient slack for each job to be completed. \nFor \\textsf{MinR} we study a more general setting with $d$ resources and derive an $O(\\log d)$-approximation for any fixed $d \\geq 1$, under the assumption that time-windows are not too small. This assumption can be removed leading to a slightly worse ratio of $O(\\log d\\log^* T)$, where $T$ is the maximum due date of any job.","PeriodicalId":175000,"journal":{"name":"Foundations of Software Technology and Theoretical Computer Science","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Foundations of Software Technology and Theoretical Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4230/LIPIcs.FSTTCS.2019.26","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

We revisit a classical scheduling model to incorporate modern trends in data center networks and cloud services. Addressing some key challenges in the allocation of shared resources to user requests (jobs) in such settings, we consider the following variants of the classic {\em resource allocation problem} (\textsf{RAP}). The input to our problems is a set $J$ of jobs and a set $M$ of homogeneous hosts, each has an available amount of some resource. A job is associated with a release time, a due date, a weight, and a given length, as well as its resource requirement. A \emph{feasible} schedule is an allocation of the resource to a subset of the jobs, satisfying the job release times/due dates as well as the resource constraints. A crucial distinction between classic {\textsf{RAP}} and our problems is that we allow preemption and migration of jobs, motivated by virtualization techniques. We consider two natural objectives: {\em throughput maximization} (\textsf{MaxT}), which seeks a maximum weight subset of the jobs that can be feasibly scheduled on the hosts in $M$, and {\em resource minimization} (\textsf{MinR}), that is finding the minimum number of (homogeneous) hosts needed to feasibly schedule all jobs. Both problems are known to be NP-hard. We first present a $\Omega(1)$-approximation algorithm for \textsf{MaxT} instances where time-windows form a laminar family of intervals. We then extend the algorithm to handle instances with arbitrary time-windows, assuming there is sufficient slack for each job to be completed. For \textsf{MinR} we study a more general setting with $d$ resources and derive an $O(\log d)$-approximation for any fixed $d \geq 1$, under the assumption that time-windows are not too small. This assumption can be removed leading to a slightly worse ratio of $O(\log d\log^* T)$, where $T$ is the maximum due date of any job.

查看原文本刊更多论文

抢占式资源分配问题

我们回顾了一个经典的调度模型，以结合数据中心网络和云服务的现代趋势。为了解决在这种设置中向用户请求(作业)分配共享资源的一些关键挑战，我们考虑了经典{\em资源分配问题}\textsf{(RAP)}的以下变体。我们的问题的输入是一组$J$作业和一组$M$同质主机，每个主机都有可用的资源。作业与发布时间、截止日期、权重和给定长度以及资源需求相关联。\emph{可行}的调度是将资源分配给作业的子集，满足作业的发布时间/到期日期以及资源约束。经典{\textsf{RAP}}和我们的问题之间的一个关键区别是，我们允许由虚拟化技术驱动的作业的抢占和迁移。我们考虑两个自然目标:{\em吞吐量最大化}\textsf{(MaxT)}，它寻求可以在$M$中的主机上可行地调度的作业的最大权重子集，以及{\em资源最小化}\textsf{(MinR)}，即找到可行地调度所有作业所需的(同质)主机的最小数量。这两个问题都是np困难的。我们首先提出了一个$\Omega(1)$ -近似算法的\textsf{MaxT}实例，其中时间窗形成层流的区间族。然后，我们扩展该算法来处理具有任意时间窗的实例，假设每个作业都有足够的空闲时间来完成。对于\textsf{MinR}，我们使用$d$资源研究更一般的设置，并在假设时间窗不太小的情况下，推导出任何固定$d \geq 1$的$O(\log d)$ -近似。如果去掉这个假设，就会得到一个稍微差一点的比率$O(\log d\log^* T)$，其中$T$是任何工作的最大到期日。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Foundations of Software Technology and Theoretical Computer Science

自引率

0.00%

发文量