Pollution control caching

Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors Pub Date : 1995-10-02 DOI:10.1109/ICCD.1995.528825

S. J. Walsh, J. Board

{"title":"Pollution control caching","authors":"S. J. Walsh, J. Board","doi":"10.1109/ICCD.1995.528825","DOIUrl":null,"url":null,"abstract":"The bandwidth mismatch of today's high speed processors and standard DRAMS is between a factor of 10 to 50. From 1995 to the year 2000 this mismatch is expected to grow to three orders of magnitude, necessitating greater emphasis for on-chip caches. Today on-chip caches typically consume from 20% to 50% of the total chip area and their cost is mostly a function of the chip area they consume. Clearly, any technique which can maintain memory performance and reduce chip area requirements is extremely important. In this paper we present two novel cache architectures called pollution control caching (PCC) and pollution control caching plus victim buffering (PCC+VB). We have used trace driven simulation to obtain miss ratio statistics and we developed analytical models of the expected clock cycles per instruction (E[CPI]) for each architecture and cache size studied. Analytical models were parameterized with the results of our trace driven simulation. These models incorporate provisions to study the effect that on-chip cache size has on access time, and the effect that this and different main memory latencies have on the E[CPI]. Chip area models were also developed for each architecture and used as a basis for comparison. Finally, we used ANOVA techniques to better quantify the differences in the miss rate performance of the cache sizes and cache architectures studied. Our research has shown that, given the constraints of our design space, PCC+VB equipped caches can match the miss rate performance and E[CPI] of direct napped caches that require greater than five times the silicon area.","PeriodicalId":281907,"journal":{"name":"Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1995-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCD.1995.528825","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 21

Abstract

The bandwidth mismatch of today's high speed processors and standard DRAMS is between a factor of 10 to 50. From 1995 to the year 2000 this mismatch is expected to grow to three orders of magnitude, necessitating greater emphasis for on-chip caches. Today on-chip caches typically consume from 20% to 50% of the total chip area and their cost is mostly a function of the chip area they consume. Clearly, any technique which can maintain memory performance and reduce chip area requirements is extremely important. In this paper we present two novel cache architectures called pollution control caching (PCC) and pollution control caching plus victim buffering (PCC+VB). We have used trace driven simulation to obtain miss ratio statistics and we developed analytical models of the expected clock cycles per instruction (E[CPI]) for each architecture and cache size studied. Analytical models were parameterized with the results of our trace driven simulation. These models incorporate provisions to study the effect that on-chip cache size has on access time, and the effect that this and different main memory latencies have on the E[CPI]. Chip area models were also developed for each architecture and used as a basis for comparison. Finally, we used ANOVA techniques to better quantify the differences in the miss rate performance of the cache sizes and cache architectures studied. Our research has shown that, given the constraints of our design space, PCC+VB equipped caches can match the miss rate performance and E[CPI] of direct napped caches that require greater than five times the silicon area.

查看原文本刊更多论文

污染控制缓存

今天的高速处理器和标准dram的带宽不匹配在10到50之间。从1995年到2000年，这种不匹配预计将增长到三个数量级，因此需要更加重视片上缓存。如今，片上高速缓存通常消耗总芯片面积的20%到50%，它们的成本主要是它们消耗的芯片面积的函数。显然，任何能够保持内存性能和减少芯片面积要求的技术都是极其重要的。本文提出了两种新的缓存结构:污染控制缓存(PCC)和污染控制缓存加受害者缓存(PCC+VB)。我们使用跟踪驱动仿真来获得缺失率统计数据，并针对所研究的每种架构和缓存大小开发了每指令预期时钟周期(E[CPI])的分析模型。用轨迹驱动仿真的结果对解析模型进行了参数化。这些模型包含了一些条款来研究片上缓存大小对访问时间的影响，以及这个和不同的主存延迟对E[CPI]的影响。还为每种架构开发了芯片面积模型，并用作比较的基础。最后，我们使用方差分析技术来更好地量化所研究的缓存大小和缓存架构在缺失率性能上的差异。我们的研究表明，考虑到我们设计空间的限制，配备PCC+VB的缓存可以匹配需要大于5倍硅面积的直接捕获缓存的失分率性能和E[CPI]。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors

自引率

0.00%

发文量