{"title":"A Survey of Fundamental Principles and Analysis for Job Scheduling on Warehouse-Scale Computers","authors":"Kevin Exton, Maria Rodriguez Read","doi":"10.1145/3766543","DOIUrl":null,"url":null,"abstract":"Over the last ten years, the search for efficient scheduling algorithms that can cope with heterogeneous workload demands on large (warehouse) scale computers has reached a feverish tempo. We focus on examining scheduling techniques for highly parallelizable jobs on warehouse-scale computers through the lens of basic results in relevant fundamental theories. The objective of this survey is to connect the disparate scheduling ideas and approaches in the literature under a loose framework of mathematical results that can be used to compare superficially different scheduling methodologies under a common goal. As the mathematical problem is NP-Hard in general, we do not emphasize rigorous mathematical proof, rather, we advocate for the use of mathematical results to guide intuition. We provide readers with some basic tools to use in navigating the fragmented research around job scheduling for distributed applications. We also highlight some common misunderstandings of fundamental theory in the literature that are skewing results and may be limiting research progress.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"16 1","pages":""},"PeriodicalIF":28.0000,"publicationDate":"2025-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Computing Surveys","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1145/3766543","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Over the last ten years, the search for efficient scheduling algorithms that can cope with heterogeneous workload demands on large (warehouse) scale computers has reached a feverish tempo. We focus on examining scheduling techniques for highly parallelizable jobs on warehouse-scale computers through the lens of basic results in relevant fundamental theories. The objective of this survey is to connect the disparate scheduling ideas and approaches in the literature under a loose framework of mathematical results that can be used to compare superficially different scheduling methodologies under a common goal. As the mathematical problem is NP-Hard in general, we do not emphasize rigorous mathematical proof, rather, we advocate for the use of mathematical results to guide intuition. We provide readers with some basic tools to use in navigating the fragmented research around job scheduling for distributed applications. We also highlight some common misunderstandings of fundamental theory in the literature that are skewing results and may be limiting research progress.
期刊介绍:
ACM Computing Surveys is an academic journal that focuses on publishing surveys and tutorials on various areas of computing research and practice. The journal aims to provide comprehensive and easily understandable articles that guide readers through the literature and help them understand topics outside their specialties. In terms of impact, CSUR has a high reputation with a 2022 Impact Factor of 16.6. It is ranked 3rd out of 111 journals in the field of Computer Science Theory & Methods.
ACM Computing Surveys is indexed and abstracted in various services, including AI2 Semantic Scholar, Baidu, Clarivate/ISI: JCR, CNKI, DeepDyve, DTU, EBSCO: EDS/HOST, and IET Inspec, among others.