Workshop on Memory Performance Issues最新文献

Addressing mode driven low power data caches for embedded processors 用于嵌入式处理器的寻址模式驱动的低功耗数据缓存

Workshop on Memory Performance Issues Pub Date : 2004-06-20 DOI: 10.1145/1054943.1054961

R. Peri, John Fernando, R. Kolagotla

引用次数: 0

A study of performance impact of memory controller features in multi-processor server environment 多处理器服务器环境下内存控制器特性对性能影响的研究

Workshop on Memory Performance Issues Pub Date : 2004-06-20 DOI: 10.1145/1054943.1054954

C. Natarajan, Bruce Christenson, F. Briggs

引用次数: 80

The Opie compiler from row-major source to Morton-ordered matrices 从行为主源到莫顿有序矩阵的Opie编译器

Workshop on Memory Performance Issues Pub Date : 2004-06-20 DOI: 10.1145/1054943.1054962

Steven T. Gabriel, David S. Wise

引用次数: 11

Cache organizations for clustered microarchitectures 集群微架构的缓存组织

Workshop on Memory Performance Issues Pub Date : 2004-06-20 DOI: 10.1145/1054943.1054950

José González, Fernando Latorre, Antonio González

引用次数: 18

An analytical model for software-only main memory compression 纯软件主存压缩的分析模型

Workshop on Memory Performance Issues Pub Date : 2004-06-20 DOI: 10.1145/1054943.1054958

I. Tuduce, T. Gross

引用次数: 6

A low cost, multithreaded processing-in-memory system 一种低成本、多线程的内存处理系统

Workshop on Memory Performance Issues Pub Date : 2004-06-20 DOI: 10.1145/1054943.1054946

J. Brockman, Shyamkumar Thoziyoor, Shannon K. Kuntz, P. Kogge

引用次数: 39

A compressed memory hierarchy using an indirect index cache 使用间接索引缓存的压缩内存层次结构

Workshop on Memory Performance Issues Pub Date : 2004-06-20 DOI: 10.1145/1054943.1054945

Erik G. Hallnor, S. Reinhardt

{"title":"A compressed memory hierarchy using an indirect index cache","authors":"Erik G. Hallnor, S. Reinhardt","doi":"10.1145/1054943.1054945","DOIUrl":"https://doi.org/10.1145/1054943.1054945","url":null,"abstract":"The large and growing impact of memory hierarchies on overall system performance compels designers to investigate innovative techniques to improve memory-system efficiency. We propose and analyze a memory hierarchy that increases both the effective capacity of memory structures and the effective bandwidth of interconnects by storing and transmitting data in compressed form.Caches play a key role in hiding memory latencies. However, cache sizes are constrained by die area and cost. A cache's effective size can be increased by storing compressed data, if the storage unused by a compressed block can be allocated to other blocks. We use a modified Indirect Index Cache to allocate variable amounts of storage to different blocks, depending on their compressibility.By coupling our compressed cache design with a similarly compressed main memory, we can easily transfer data between these structures in a compressed state, increasing the effective memory bus bandwidth. This optimization further improves performance when bus bandwidth is critical.Our simulation results, using the SPEC CPU2000 benchmarks, show that our design increases performance by up to 225% on some benchmarks while degrading performance in general by no more than 2%, other than a 12% decrease on a single benchmark. Compressed bus transfers alone account for up to 80% of this improvement, with the remainder coming from increased effective cache capacity. As memory latencies increase, our design becomes even more beneficial.","PeriodicalId":249099,"journal":{"name":"Workshop on Memory Performance Issues","volume":"208 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123392756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 46

SCIMA-SMP: on-chip memory processor architecture for SMP SCIMA-SMP:用于SMP的片上存储器处理器架构

Workshop on Memory Performance Issues Pub Date : 2004-06-20 DOI: 10.1145/1054943.1054960

C. Takahashi, Masaaki Kondo, T. Boku, D. Takahashi, Hiroshi Nakamura, M. Sato

引用次数: 0

A localizing directory coherence protocol 本地化目录一致性协议

Workshop on Memory Performance Issues Pub Date : 2004-06-20 DOI: 10.1145/1054943.1054947

Collin McCurdy, C. Fischer

引用次数: 4

Scalable cache memory design for large-scale SMT architectures 大规模SMT架构的可扩展高速缓存设计

Workshop on Memory Performance Issues Pub Date : 2004-06-20 DOI: 10.1145/1054943.1054952

M. Mudawar

{"title":"Scalable cache memory design for large-scale SMT architectures","authors":"M. Mudawar","doi":"10.1145/1054943.1054952","DOIUrl":"https://doi.org/10.1145/1054943.1054952","url":null,"abstract":"The cache hierarchy design in existing SMT and superscalar processors is optimized for latency, but not for band-width. The size of the L1 data cache did not scale over the past decade. Instead, larger unified L2 and L3 caches were introduced. This cache hierarchy has a high overhead due to the principle of containment. It also has a complex design to maintain cache coherence across all levels. Furthermore, this cache hierarchy is not suitable for future large-scale SMT processors, which will demand high bandwidth instruction and data caches with a large number of ports.This paper suggests the elimination of the cache hierarchy and replacing it with one-level caches for instruction and data. Multiple instruction caches can be used in parallel to scale the instruction fetch bandwidth and the overall cache capacity. A one-level data cache can be split into a number of block-interleaved cache banks to serve multiple memory requests in parallel. An interconnect is used to connect the data cache ports to the different cache banks, thus increasing the data cache access time. This paper shows that large-scale SMTs can tolerate long data cache hit times. It also shows that small line buffers can enhance the performance and reduce the required number of ports to the banked data cache memory.","PeriodicalId":249099,"journal":{"name":"Workshop on Memory Performance Issues","volume":"60 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129723096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6