DCAPS: dynamic cache allocation with partial sharing

Proceedings of the Thirteenth EuroSys Conference Pub Date : 2018-04-23 DOI:10.1145/3190508.3190511

Yaocheng Xiang, Xiaolin Wang, Zihui Huang, Zeyu Wang, Yingwei Luo, Zhenlin Wang

{"title":"DCAPS: dynamic cache allocation with partial sharing","authors":"Yaocheng Xiang, Xiaolin Wang, Zihui Huang, Zeyu Wang, Yingwei Luo, Zhenlin Wang","doi":"10.1145/3190508.3190511","DOIUrl":null,"url":null,"abstract":"In a multicore system, effective management of shared last level cache (LLC), such as hardware/software cache partitioning, has attracted significant research attention. Some eminent progress is that Intel introduced Cache Allocation Technology (CAT) to its commodity processors recently. CAT implements way partitioning and provides software interface to control cache allocation. Unfortunately, CAT can only allocate at way level, which does not scale well for a large thread or program count to serve their various performance goals effectively. This paper proposes Dynamic Cache Allocation with Partial Sharing (DCAPS), a framework that dynamically monitors and predicts a multi-programmed workload's cache demand, and reallocates LLC given a performance target. Further, DCAPS explores partial sharing of a cache partition among programs and thus practically achieves cache allocation at a finer granularity. DCAPS consists of three parts: (1) Online Practical Miss Rate Curve (OPMRC), a low-overhead software technique to predict online miss rate curves (MRCs) of individual programs of a workload; (2) a prediction model that estimates the LLC occupancy of each individual program under any CAT allocation scheme; (3) a simulated annealing algorithm that searches for a near-optimal CAT scheme given a specific performance goal. Our experimental results show that DCAPS is able to optimize for a wide range of performance targets and can scale to a large core count.","PeriodicalId":334267,"journal":{"name":"Proceedings of the Thirteenth EuroSys Conference","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"55","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Thirteenth EuroSys Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3190508.3190511","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 55

Abstract

In a multicore system, effective management of shared last level cache (LLC), such as hardware/software cache partitioning, has attracted significant research attention. Some eminent progress is that Intel introduced Cache Allocation Technology (CAT) to its commodity processors recently. CAT implements way partitioning and provides software interface to control cache allocation. Unfortunately, CAT can only allocate at way level, which does not scale well for a large thread or program count to serve their various performance goals effectively. This paper proposes Dynamic Cache Allocation with Partial Sharing (DCAPS), a framework that dynamically monitors and predicts a multi-programmed workload's cache demand, and reallocates LLC given a performance target. Further, DCAPS explores partial sharing of a cache partition among programs and thus practically achieves cache allocation at a finer granularity. DCAPS consists of three parts: (1) Online Practical Miss Rate Curve (OPMRC), a low-overhead software technique to predict online miss rate curves (MRCs) of individual programs of a workload; (2) a prediction model that estimates the LLC occupancy of each individual program under any CAT allocation scheme; (3) a simulated annealing algorithm that searches for a near-optimal CAT scheme given a specific performance goal. Our experimental results show that DCAPS is able to optimize for a wide range of performance targets and can scale to a large core count.

查看原文本刊更多论文

DCAPS:带有部分共享的动态缓存分配

在多核系统中，共享最后一级缓存(LLC)的有效管理，如硬件/软件缓存分区，已经引起了广泛的研究关注。一些显著的进步是英特尔最近在其商用处理器中引入了缓存分配技术(CAT)。CAT实现了路径分区，并提供了控制缓存分配的软件接口。不幸的是，CAT只能在路径级别进行分配，这对于大型线程或程序数量来说并不能很好地扩展，从而无法有效地满足它们的各种性能目标。本文提出了基于部分共享的动态缓存分配(DCAPS)框架，该框架能够动态监控和预测多程序工作负载的缓存需求，并在给定性能目标的情况下重新分配有限资源。此外，DCAPS探索程序之间缓存分区的部分共享，从而实际实现更细粒度的缓存分配。DCAPS由三个部分组成:(1)在线实际脱靶率曲线(OPMRC)，这是一种低开销的软件技术，用于预测工作负载中单个程序的在线脱靶率曲线(mrc);(2)在任何CAT分配方案下，估算每个单独项目的LLC占用率的预测模型;(3)在给定特定性能目标的情况下，搜索接近最优CAT方案的模拟退火算法。我们的实验结果表明，DCAPS能够针对广泛的性能目标进行优化，并且可以扩展到大的核心计数。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Thirteenth EuroSys Conference

自引率

0.00%

发文量