OMBM-ML:保证服务质量和提高服务器利用率的高效内存带宽管理

2018 IEEE 3rd International Workshops on Foundations and Applications of Self* Systems (FAS*W) Pub Date : 2018-09-01 DOI:10.1109/FAS-W.2018.00028

Min Jeesoo, Sung Hanul, Eom Hyeonsang

{"title":"OMBM-ML:保证服务质量和提高服务器利用率的高效内存带宽管理","authors":"Min Jeesoo, Sung Hanul, Eom Hyeonsang","doi":"10.1109/FAS-W.2018.00028","DOIUrl":null,"url":null,"abstract":"As cloud data centers are dramatically growing, various applications are moved to cloud data centers owing to cost benefits for maintenance and hardware resources. However, latency-critical workloads among them suffer from some problems to fully achieve the cost effectiveness. The latency-critical workloads should show latencies in a stable manner, to be predicted, for strictly meeting QoSs. However, if they are executed with other workloads to save the cost, they experience QoS violation due to the contention for the hardware resources shared with co-location workloads. In order to guarantee QoSs and to improve the hardware resourse utilization, we proposed a memory bandwidth management method with an effective prediction model using machine learning. The prediction model estimates the amount of memory bandwidth that will be allocated to the latency-critical workload based on a REP decision tree. To construct this model, we first collect data and train the model with the data. The generated model can estimate the amount of memory bandwidth for meeting the SLO of the latency-critical workload no matter what batch processing workloads are collocated. The use of our approach achieves up to 99% SLO assurance and improves the server utilization up to 6.8x on average.","PeriodicalId":164903,"journal":{"name":"2018 IEEE 3rd International Workshops on Foundations and Applications of Self* Systems (FAS*W)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"OMBM-ML: An Efficient Memory Bandwidth Management for Ensuring QoS and Improving Server Utilization\",\"authors\":\"Min Jeesoo, Sung Hanul, Eom Hyeonsang\",\"doi\":\"10.1109/FAS-W.2018.00028\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As cloud data centers are dramatically growing, various applications are moved to cloud data centers owing to cost benefits for maintenance and hardware resources. However, latency-critical workloads among them suffer from some problems to fully achieve the cost effectiveness. The latency-critical workloads should show latencies in a stable manner, to be predicted, for strictly meeting QoSs. However, if they are executed with other workloads to save the cost, they experience QoS violation due to the contention for the hardware resources shared with co-location workloads. In order to guarantee QoSs and to improve the hardware resourse utilization, we proposed a memory bandwidth management method with an effective prediction model using machine learning. The prediction model estimates the amount of memory bandwidth that will be allocated to the latency-critical workload based on a REP decision tree. To construct this model, we first collect data and train the model with the data. The generated model can estimate the amount of memory bandwidth for meeting the SLO of the latency-critical workload no matter what batch processing workloads are collocated. The use of our approach achieves up to 99% SLO assurance and improves the server utilization up to 6.8x on average.\",\"PeriodicalId\":164903,\"journal\":{\"name\":\"2018 IEEE 3rd International Workshops on Foundations and Applications of Self* Systems (FAS*W)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE 3rd International Workshops on Foundations and Applications of Self* Systems (FAS*W)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/FAS-W.2018.00028\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 3rd International Workshops on Foundations and Applications of Self* Systems (FAS*W)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FAS-W.2018.00028","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

随着云数据中心的急剧增长，由于维护和硬件资源的成本优势，各种应用程序被转移到云数据中心。但是，延迟关键型工作负载在完全实现成本效益方面存在一些问题。延迟关键型工作负载应该以稳定的方式显示延迟，以预测延迟，以严格满足qos。但是，如果它们与其他工作负载一起执行以节省成本，则由于争用与协同定位工作负载共享的硬件资源，它们会遇到QoS冲突。为了保证qos和提高硬件资源利用率，我们提出了一种基于机器学习的有效预测模型的内存带宽管理方法。预测模型根据REP决策树估计将分配给延迟关键工作负载的内存带宽量。为了构建这个模型，我们首先收集数据并用数据训练模型。生成的模型可以估计满足延迟关键型工作负载的SLO所需的内存带宽量，而不管并置了什么批处理工作负载。使用我们的方法可以实现高达99%的SLO保证，并将服务器利用率平均提高到6.8倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

OMBM-ML: An Efficient Memory Bandwidth Management for Ensuring QoS and Improving Server Utilization

As cloud data centers are dramatically growing, various applications are moved to cloud data centers owing to cost benefits for maintenance and hardware resources. However, latency-critical workloads among them suffer from some problems to fully achieve the cost effectiveness. The latency-critical workloads should show latencies in a stable manner, to be predicted, for strictly meeting QoSs. However, if they are executed with other workloads to save the cost, they experience QoS violation due to the contention for the hardware resources shared with co-location workloads. In order to guarantee QoSs and to improve the hardware resourse utilization, we proposed a memory bandwidth management method with an effective prediction model using machine learning. The prediction model estimates the amount of memory bandwidth that will be allocated to the latency-critical workload based on a REP decision tree. To construct this model, we first collect data and train the model with the data. The generated model can estimate the amount of memory bandwidth for meeting the SLO of the latency-critical workload no matter what batch processing workloads are collocated. The use of our approach achieves up to 99% SLO assurance and improves the server utilization up to 6.8x on average.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 IEEE 3rd International Workshops on Foundations and Applications of Self* Systems (FAS*W)

自引率

0.00%

发文量