TokenTLB: A Token-Based Page Classification Approach

Proceedings of the 2016 International Conference on Supercomputing Pub Date : 2016-06-01 DOI:10.1145/2925426.2926280

Albert Esteve, Alberto Ros, A. Robles, M. E. Gómez, J. Duato

{"title":"TokenTLB: A Token-Based Page Classification Approach","authors":"Albert Esteve, Alberto Ros, A. Robles, M. E. Gómez, J. Duato","doi":"10.1145/2925426.2926280","DOIUrl":null,"url":null,"abstract":"Classifying memory accesses into private or shared data has become a fundamental approach to achieving efficiency and scalability in multi- and many-core systems. Since most memory accesses in both sequential and parallel applications are either private (accessed only by one core) or read-only (not written) data, devoting the full cost of coherence to every memory access results in sub-optimal performance and limits the scalability and efficiency of the multiprocessor. This work proposes TokenTLB, a page classification approach based on exchange and count of tokens. The key observation behind our proposal is that, opposed to coherence management, data classification meets all the benefits of a token-based approach without the burden of complex arbitration mechanisms, which has discouraged the implementation of token-based coherence protocols in commodity systems. Token counting on TLBs is a natural and efficient way for classifying memory pages. It does not require the use of complex and undesirable persistent requests or arbitration, since when two or more TLBs race for accessing a page, tokens are appropriately distributed classifying the page as shared. TokenTLB also favors shareability of translation information among TLBs, which improves system performance and constrains much of the TLB traffic compared to other broadcast-based approaches. It is achieved by requiring only TLBs holding extra tokens provide them along with the page translation (about one response per TLB miss). TokenTLB effectively increases blocks classified as private up to 61.1% while allowing read-only detection (24.4% shared-read-only blocks). When TokenTLB is applied to optimize the directory, it reduces the dynamic energy consumed by the cache hierarchy by nearly 27.3% over the baseline.","PeriodicalId":422112,"journal":{"name":"Proceedings of the 2016 International Conference on Supercomputing","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2016 International Conference on Supercomputing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2925426.2926280","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 12

Abstract

Classifying memory accesses into private or shared data has become a fundamental approach to achieving efficiency and scalability in multi- and many-core systems. Since most memory accesses in both sequential and parallel applications are either private (accessed only by one core) or read-only (not written) data, devoting the full cost of coherence to every memory access results in sub-optimal performance and limits the scalability and efficiency of the multiprocessor. This work proposes TokenTLB, a page classification approach based on exchange and count of tokens. The key observation behind our proposal is that, opposed to coherence management, data classification meets all the benefits of a token-based approach without the burden of complex arbitration mechanisms, which has discouraged the implementation of token-based coherence protocols in commodity systems. Token counting on TLBs is a natural and efficient way for classifying memory pages. It does not require the use of complex and undesirable persistent requests or arbitration, since when two or more TLBs race for accessing a page, tokens are appropriately distributed classifying the page as shared. TokenTLB also favors shareability of translation information among TLBs, which improves system performance and constrains much of the TLB traffic compared to other broadcast-based approaches. It is achieved by requiring only TLBs holding extra tokens provide them along with the page translation (about one response per TLB miss). TokenTLB effectively increases blocks classified as private up to 61.1% while allowing read-only detection (24.4% shared-read-only blocks). When TokenTLB is applied to optimize the directory, it reduces the dynamic energy consumed by the cache hierarchy by nearly 27.3% over the baseline.

查看原文本刊更多论文

TokenTLB:基于令牌的页面分类方法

将内存访问分类为私有或共享数据已经成为在多核和多核系统中实现效率和可伸缩性的基本方法。由于顺序和并行应用程序中的大多数内存访问要么是私有的(仅由一个核心访问)，要么是只读的(未写入的)数据，因此将一致性的全部成本用于每次内存访问会导致性能不理想，并限制了多处理器的可伸缩性和效率。这项工作提出了TokenTLB，一种基于令牌交换和计数的页面分类方法。我们的建议背后的关键观察是，与一致性管理相反，数据分类满足基于令牌方法的所有好处，而没有复杂仲裁机制的负担，这阻碍了在商品系统中实施基于令牌的一致性协议。tlb上的令牌计数是对内存页进行分类的一种自然而有效的方法。它不需要使用复杂且不受欢迎的持久请求或仲裁，因为当两个或多个tlb竞争访问页面时，令牌会被适当地分发，并将页面分类为共享。TokenTLB还支持TLB之间翻译信息的可共享性，与其他基于广播的方法相比，这提高了系统性能并限制了TLB流量。这是通过只要求持有额外令牌的TLB在页面翻译时提供它们来实现的(大约每次TLB错过一个响应)。TokenTLB有效地将分类为私有的块增加了61.1%，同时允许只读检测(24.4%的共享-只读块)。当应用TokenTLB来优化目录时，它将缓存层次结构消耗的动态能量比基线减少了近27.3%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2016 International Conference on Supercomputing

自引率

0.00%

发文量