Profiling and Predicting Application Performance on the Cloud

2018 IEEE/ACM 11th International Conference on Utility and Cloud Computing (UCC) Pub Date : 2018-12-01 DOI:10.1109/UCC.2018.00011

Matt Baughman, Ryan Chard, Logan T. Ward, Jason Pitt, K. Chard, Ian T Foster

{"title":"Profiling and Predicting Application Performance on the Cloud","authors":"Matt Baughman, Ryan Chard, Logan T. Ward, Jason Pitt, K. Chard, Ian T Foster","doi":"10.1109/UCC.2018.00011","DOIUrl":null,"url":null,"abstract":"Cloud providers continue to expand and diversify their collection of leasable resources to meet the needs of an increasingly wide range of applications. While this flexibility is a key benefit of the cloud, it also creates a complex landscape in which users are faced with many resource choices for a given application. Suboptimal selections can both degrade performance and increase costs. Given the rapidly evolving pool of resources, it is infeasible for users alone to select instance types; instead, automated methods are needed to simplify and guide resource provisioning. Here we present a method for the automatic prediction of application performance on arbitrary cloud instances. We combine offline and online profiling approaches, using historical data gathered from non-cloud environments and targeted profiling runs on cloud environments to create a composite application model that can predict run times on a given cloud instance type for a given input data size. We demonstrate average error of 17.2% across nine applications used in production bioinformatics workflows. Finally, we evaluate an experiment design approach to explore the trade-off between the cost of profiling and the accuracy of our models. Using this approach, with no prior knowledge, we show that using 4 selectively chosen experiments we can achieve performance within 30% of a model trained using all instance types.","PeriodicalId":288232,"journal":{"name":"2018 IEEE/ACM 11th International Conference on Utility and Cloud Computing (UCC)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE/ACM 11th International Conference on Utility and Cloud Computing (UCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UCC.2018.00011","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 15

Abstract

Cloud providers continue to expand and diversify their collection of leasable resources to meet the needs of an increasingly wide range of applications. While this flexibility is a key benefit of the cloud, it also creates a complex landscape in which users are faced with many resource choices for a given application. Suboptimal selections can both degrade performance and increase costs. Given the rapidly evolving pool of resources, it is infeasible for users alone to select instance types; instead, automated methods are needed to simplify and guide resource provisioning. Here we present a method for the automatic prediction of application performance on arbitrary cloud instances. We combine offline and online profiling approaches, using historical data gathered from non-cloud environments and targeted profiling runs on cloud environments to create a composite application model that can predict run times on a given cloud instance type for a given input data size. We demonstrate average error of 17.2% across nine applications used in production bioinformatics workflows. Finally, we evaluate an experiment design approach to explore the trade-off between the cost of profiling and the accuracy of our models. Using this approach, with no prior knowledge, we show that using 4 selectively chosen experiments we can achieve performance within 30% of a model trained using all instance types.

查看原文本刊更多论文

分析和预测云上的应用程序性能

云提供商继续扩展和多样化他们的可出租资源集合，以满足越来越广泛的应用程序的需求。虽然这种灵活性是云的一个关键优势，但它也造成了一个复杂的环境，在这个环境中，用户面临着针对给定应用程序的许多资源选择。次优选择既会降低性能又会增加成本。考虑到快速发展的资源池，用户单独选择实例类型是不可行的;相反，需要自动化的方法来简化和指导资源供应。在这里，我们提出了一种在任意云实例上自动预测应用程序性能的方法。我们结合了离线和在线分析方法，使用从非云环境收集的历史数据和在云环境上运行的目标分析来创建复合应用程序模型，该模型可以预测给定输入数据大小的给定云实例类型上的运行时间。我们展示了在生产生物信息学工作流程中使用的九个应用程序的平均误差为17.2%。最后，我们评估了一种实验设计方法，以探索分析成本和模型准确性之间的权衡。使用这种方法，在没有先验知识的情况下，我们表明，使用4个选择性选择的实验，我们可以在使用所有实例类型训练的模型的30%内实现性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE/ACM 11th International Conference on Utility and Cloud Computing (UCC)

自引率

0.00%

发文量