Themis: Fair Memory Subsystem Resource Sharing with Differentiated QoS in Public Clouds

Proceedings of the 51st International Conference on Parallel Processing Pub Date : 2022-08-29 DOI:10.1145/3545008.3545064

Wenda Tang, Senbo Fu, Y. Ke, Qian Peng, Feng Gao

{"title":"Themis: Fair Memory Subsystem Resource Sharing with Differentiated QoS in Public Clouds","authors":"Wenda Tang, Senbo Fu, Y. Ke, Qian Peng, Feng Gao","doi":"10.1145/3545008.3545064","DOIUrl":null,"url":null,"abstract":"To reduce the increasing cost of building and operating cloud data centers, cloud providers are seeking various mechanisms to achieve higher resource effectiveness. For example, cloud operators are leveraging dynamic resource management techniques to consolidate a higher density of application workloads into commodity physical servers to maximize server resource utilization. However, higher workload density is a major source of performance interference problems in multi-tenant clouds. Existing performance isolation techniques such as dedicated CPU cores for specific workloads are not enough as there are still common resource (e.g., last-level cache and memory bandwidth in memory subsystem) on the processor that are shared among all CPUs on the same NUMA node. While prior work has proposed a variety of resource partitioning techniques, it still remains unexplored to characterize the impact of memory subsystem resource partitioning for the consolidated workloads with different priorities and investigate software support to dynamically manage memory subsystem resource sharing in a real-time manner. To bridge the gap, we propose Themis, a feedback-based controller that enables a priority-aware and fairness-aware memory subsystem resource management strategy to guarantee the performance of high-priority workloads while maintaining fairness across all colocated workloads in high-density clouds. Themis is evaluated with multiple typical cloud applications in our data center environment. The results show that Themis improves the performance of various workloads by up to 3.15%, and fairness by more than 70% in memory subsystem resource allocation compared to existing state-of-the-art work.","PeriodicalId":360504,"journal":{"name":"Proceedings of the 51st International Conference on Parallel Processing","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 51st International Conference on Parallel Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3545008.3545064","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

To reduce the increasing cost of building and operating cloud data centers, cloud providers are seeking various mechanisms to achieve higher resource effectiveness. For example, cloud operators are leveraging dynamic resource management techniques to consolidate a higher density of application workloads into commodity physical servers to maximize server resource utilization. However, higher workload density is a major source of performance interference problems in multi-tenant clouds. Existing performance isolation techniques such as dedicated CPU cores for specific workloads are not enough as there are still common resource (e.g., last-level cache and memory bandwidth in memory subsystem) on the processor that are shared among all CPUs on the same NUMA node. While prior work has proposed a variety of resource partitioning techniques, it still remains unexplored to characterize the impact of memory subsystem resource partitioning for the consolidated workloads with different priorities and investigate software support to dynamically manage memory subsystem resource sharing in a real-time manner. To bridge the gap, we propose Themis, a feedback-based controller that enables a priority-aware and fairness-aware memory subsystem resource management strategy to guarantee the performance of high-priority workloads while maintaining fairness across all colocated workloads in high-density clouds. Themis is evaluated with multiple typical cloud applications in our data center environment. The results show that Themis improves the performance of various workloads by up to 3.15%, and fairness by more than 70% in memory subsystem resource allocation compared to existing state-of-the-art work.

查看原文本刊更多论文

主题:公共云中具有差异化QoS的公平内存子系统资源共享

为了降低构建和运营云数据中心的成本，云提供商正在寻求各种机制来实现更高的资源效率。例如，云运营商正在利用动态资源管理技术，将更高密度的应用程序工作负载整合到商品物理服务器中，以最大限度地利用服务器资源。然而，更高的工作负载密度是多租户云中性能干扰问题的主要来源。现有的性能隔离技术，如针对特定工作负载的专用CPU内核是不够的，因为处理器上仍然存在公共资源(例如，内存子系统中的最后一级缓存和内存带宽)，这些资源在同一NUMA节点上的所有CPU之间共享。虽然以前的工作已经提出了各种各样的资源分区技术，但它仍然没有探索表征内存子系统资源分区对具有不同优先级的合并工作负载的影响，并研究软件支持以实时方式动态管理内存子系统资源共享。为了弥补这一差距，我们提出了Themis，一个基于反馈的控制器，它支持优先级感知和公平感知的内存子系统资源管理策略，以保证高优先级工作负载的性能，同时保持高密度云中所有并发工作负载的公平性。Themis在我们的数据中心环境中用多个典型的云应用程序进行了评估。结果表明，与现有的最先进的工作相比，Themis将各种工作负载的性能提高了3.15%，内存子系统资源分配的公平性提高了70%以上。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 51st International Conference on Parallel Processing

自引率

0.00%

发文量