A Simulation for Forecasting Compute Resource Usage

Proceedings of the 2021 3rd Asia Pacific Information Technology Conference Pub Date : 2021-01-15 DOI:10.1145/3449365.3449370

Saurabh Adhikari, C. Plewnia, C. Netramai, H. Lichter

{"title":"A Simulation for Forecasting Compute Resource Usage","authors":"Saurabh Adhikari, C. Plewnia, C. Netramai, H. Lichter","doi":"10.1145/3449365.3449370","DOIUrl":null,"url":null,"abstract":"The usage of compute resources by data processing jobs may change over time, requiring careful resource planning when an organization operates these resources itself in an on-premise private cloud. Ideally, the currently available resources always match the need of jobs executed on them. This way the resources would neither be overutilized, which is usually undesirable as the jobs might take longer, nor underutilized, which causes unnecessary costs for unused resources. When an organization decides to extend its private cloud resources, it can still take months until the servers are bought, delivered, and installed. Thus, the resources have to be planned carefully in advance. Estimating the future resource needs is difficult and influenced by many factors. In our experience, creating the estimate is often a manual process supported by self-designed spreadsheets; these spreadsheets are maintained by a single person from time to time and might even be replaced completely if someone else assumes that person's responsibility. However, this approach does not lead to transparent and verifiable forecasts that enable collaboration and learning from past decisions. This paper addresses the problem of generating a transparent and verifiable compute resource usage forecast by proposing a simulation approach. It requires a user to model an estimate of the future workload development of the data processing jobs as well as the current compute resource setup. The simulation can then be run to identify possible future resource bottlenecks. This can be repeated for different scenarios, including situations of failing resources as well as the addition of resources to compensate for bottlenecks and failures. We further provide a first qualitative case study of this approach that demonstrates its potential.","PeriodicalId":188200,"journal":{"name":"Proceedings of the 2021 3rd Asia Pacific Information Technology Conference","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 3rd Asia Pacific Information Technology Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3449365.3449370","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The usage of compute resources by data processing jobs may change over time, requiring careful resource planning when an organization operates these resources itself in an on-premise private cloud. Ideally, the currently available resources always match the need of jobs executed on them. This way the resources would neither be overutilized, which is usually undesirable as the jobs might take longer, nor underutilized, which causes unnecessary costs for unused resources. When an organization decides to extend its private cloud resources, it can still take months until the servers are bought, delivered, and installed. Thus, the resources have to be planned carefully in advance. Estimating the future resource needs is difficult and influenced by many factors. In our experience, creating the estimate is often a manual process supported by self-designed spreadsheets; these spreadsheets are maintained by a single person from time to time and might even be replaced completely if someone else assumes that person's responsibility. However, this approach does not lead to transparent and verifiable forecasts that enable collaboration and learning from past decisions. This paper addresses the problem of generating a transparent and verifiable compute resource usage forecast by proposing a simulation approach. It requires a user to model an estimate of the future workload development of the data processing jobs as well as the current compute resource setup. The simulation can then be run to identify possible future resource bottlenecks. This can be repeated for different scenarios, including situations of failing resources as well as the addition of resources to compensate for bottlenecks and failures. We further provide a first qualitative case study of this approach that demonstrates its potential.

查看原文本刊更多论文

一种预测计算资源使用的仿真方法

数据处理作业对计算资源的使用可能会随着时间的推移而变化，当组织在内部部署私有云中操作这些资源时，需要仔细规划资源。理想情况下，当前可用的资源总是与在其上执行的作业的需求相匹配。通过这种方式，资源既不会被过度利用(这通常是不希望的，因为作业可能需要更长的时间)，也不会被未充分利用(这会导致未使用资源的不必要成本)。当组织决定扩展其私有云资源时，购买、交付和安装服务器仍然需要几个月的时间。因此，必须事先仔细规划资源。估计未来的资源需求是困难的，而且受到许多因素的影响。根据我们的经验，创建评估通常是一个由自己设计的电子表格支持的手动过程;这些电子表格不时由一个人维护，如果其他人承担了这个人的责任，甚至可能被完全取代。然而，这种方法不能导致透明和可验证的预测，从而使协作和从过去的决策中学习成为可能。本文提出了一种仿真方法，解决了生成透明且可验证的计算资源使用预测的问题。它要求用户对数据处理作业的未来工作负载开发以及当前的计算资源设置进行建模。然后可以运行模拟以确定未来可能出现的资源瓶颈。这可以在不同的场景中重复，包括资源失败的情况，以及为补偿瓶颈和故障而添加的资源。我们进一步提供了该方法的第一个定性案例研究，以证明其潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2021 3rd Asia Pacific Information Technology Conference

自引率

0.00%

发文量