{"title":"最小化云计算加速系统的平均作业完成时间","authors":"Ke Li, Qiang Yang, Shunrui Xiong, P. Fan","doi":"10.1109/CCIS57298.2022.10016316","DOIUrl":null,"url":null,"abstract":"With the development of computation intensive applications, such as deep neural network inference and deep packet inspection, the conventional computation resources are exhausted by these computing tasks, which results in large application response time. To improve the user experience, more and more providers deploy accelerators in their computing clusters. Accordingly, there is a problem arising: how should we schedule the non-preemptive jobs such that the average job completion time can be minimized. To answer this question, we first formulate the problem to be a mathematical programming model. Based on solid analysis, we find that the problem we need to solve is NP-hard. Due to the hardness of this problem, we propose a (6 – 2/M)-approximation algorithm to solve it efficiently, where M is the number of accelerator servers in the system. Through extensive simulations, we find that the proposed algorithm outperforms the conventional scheduling algorithms, FIFO and Shortest Job First (SJF), by 24.24% and 29.07%, respectively.","PeriodicalId":374660,"journal":{"name":"2022 IEEE 8th International Conference on Cloud Computing and Intelligent Systems (CCIS)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Minimizing the Average Job Completion Time for Acceleration Systems in Cloud Computing\",\"authors\":\"Ke Li, Qiang Yang, Shunrui Xiong, P. Fan\",\"doi\":\"10.1109/CCIS57298.2022.10016316\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the development of computation intensive applications, such as deep neural network inference and deep packet inspection, the conventional computation resources are exhausted by these computing tasks, which results in large application response time. To improve the user experience, more and more providers deploy accelerators in their computing clusters. Accordingly, there is a problem arising: how should we schedule the non-preemptive jobs such that the average job completion time can be minimized. To answer this question, we first formulate the problem to be a mathematical programming model. Based on solid analysis, we find that the problem we need to solve is NP-hard. Due to the hardness of this problem, we propose a (6 – 2/M)-approximation algorithm to solve it efficiently, where M is the number of accelerator servers in the system. Through extensive simulations, we find that the proposed algorithm outperforms the conventional scheduling algorithms, FIFO and Shortest Job First (SJF), by 24.24% and 29.07%, respectively.\",\"PeriodicalId\":374660,\"journal\":{\"name\":\"2022 IEEE 8th International Conference on Cloud Computing and Intelligent Systems (CCIS)\",\"volume\":\"40 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 8th International Conference on Cloud Computing and Intelligent Systems (CCIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCIS57298.2022.10016316\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 8th International Conference on Cloud Computing and Intelligent Systems (CCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCIS57298.2022.10016316","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Minimizing the Average Job Completion Time for Acceleration Systems in Cloud Computing
With the development of computation intensive applications, such as deep neural network inference and deep packet inspection, the conventional computation resources are exhausted by these computing tasks, which results in large application response time. To improve the user experience, more and more providers deploy accelerators in their computing clusters. Accordingly, there is a problem arising: how should we schedule the non-preemptive jobs such that the average job completion time can be minimized. To answer this question, we first formulate the problem to be a mathematical programming model. Based on solid analysis, we find that the problem we need to solve is NP-hard. Due to the hardness of this problem, we propose a (6 – 2/M)-approximation algorithm to solve it efficiently, where M is the number of accelerator servers in the system. Through extensive simulations, we find that the proposed algorithm outperforms the conventional scheduling algorithms, FIFO and Shortest Job First (SJF), by 24.24% and 29.07%, respectively.