Workshop on Memory System Performance and Correctness最新文献

A study of data structures with a deep heap shape 具有深堆形状的数据结构研究

Workshop on Memory System Performance and Correctness Pub Date : 2013-06-16 DOI: 10.1145/2492408.2492413

Haggai Eran, E. Petrank

引用次数: 5

A new perspective on processing-in-memory architecture design 内存中处理架构设计的新视角

Workshop on Memory System Performance and Correctness Pub Date : 2013-06-16 DOI: 10.1145/2492408.2492418

D. Zhang, N. Jayasena, Alexander Lyashevsky, J. Greathouse, Mitesh R. Meswani, Mark Nutter, Mike Ignatowski

引用次数: 47

Software-controlled transparent management of heterogeneous memory resources in virtualized systems 虚拟化系统中异构内存资源的软件控制透明管理

Workshop on Memory System Performance and Correctness Pub Date : 2013-06-16 DOI: 10.1145/2492408.2492416

Min Lee, Vishal Gupta, K. Schwan

{"title":"Software-controlled transparent management of heterogeneous memory resources in virtualized systems","authors":"Min Lee, Vishal Gupta, K. Schwan","doi":"10.1145/2492408.2492416","DOIUrl":"https://doi.org/10.1145/2492408.2492416","url":null,"abstract":"This paper presents a software-controlled technique for managing the heterogeneous memory resources of next generation multicore platforms with fast 3D die-stacked memory and additional slow off-chip memory. Implemented for virtualized server systems, the technique detects the 'hot' pages critical to program performance in order to then maintain them in the scarce fast 3D memory resources. Challenges overcome for the technique's implementation include the need to minimize its runtime overheads, the lack of hypervisor-level direct visibility into the memory access behavior of guest virtual machines, and the need to make page migration transparent to guests. This paper presents hypervisor-level mechanisms that (i) build a page access history of virtual machines, by periodically scanning page-table access bits and (ii) intercept guest page table operations to create mirrored page-tables and enable guest-transparent page migration. The methods are implemented in the Xen hypervisor and evaluated on a larger scale multicore platform. The resulting ability to characterize the memory behavior of representative server workloads demonstrates the feasibility of software-managed heterogeneous memory resources.","PeriodicalId":130040,"journal":{"name":"Workshop on Memory System Performance and Correctness","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117034573","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

APE: accelerator processor extensions to optimize data-compute co-location APE:加速器处理器扩展，以优化数据计算协同定位

Workshop on Memory System Performance and Correctness Pub Date : 2013-06-16 DOI: 10.1145/2492408.2492412

Ganesh Venkatesh

{"title":"APE: accelerator processor extensions to optimize data-compute co-location","authors":"Ganesh Venkatesh","doi":"10.1145/2492408.2492412","DOIUrl":"https://doi.org/10.1145/2492408.2492412","url":null,"abstract":"Two technological trends we notice in the current day systems is the march towards many core systems and greater focus on power efficiency. The increase in core counts would result in smaller caches-per-compute node and greater reliance on exposing task-level parallelism in applications. However, this would potentially increase the amount of data that moves within and between the different tasks and hence, the related power costs. This will pose a new burden on the already power-constrained current day systems. The situation would only get worse as we go forward because the power consumed by the wires is not scaling down much with each technology generation, but the amount of data that these wires move is increasing per generation.\u0000 This paper addresses this concern by identifying the memory access patterns that accounts for much of the data movement and designing processor extensions, Apes to support them. These processor extensions are placed closer to the cache structures, rather than the core pipeline, to reduce the data movement and improve compute-data co-location. We show that by doing this we are able to reduce a task's memory accesses by ~2.5×, data movement by 4× and cache miss rate by 40% for a wide range of applications.","PeriodicalId":130040,"journal":{"name":"Workshop on Memory System Performance and Correctness","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130973857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Analyzing locality of memory references in GPU architectures 分析GPU架构中内存引用的局部性

Workshop on Memory System Performance and Correctness Pub Date : 2013-06-16 DOI: 10.1145/2492408.2492423

Saurabh Gupta, Ping Xiang, Huiyang Zhou

引用次数: 9

Introducing kernel-level page reuse for high performance computing 为高性能计算引入内核级页面重用

Workshop on Memory System Performance and Correctness Pub Date : 2013-06-16 DOI: 10.1145/2492408.2492414

S. Valat, Marc Pérache, W. Jalby

引用次数: 10

Software-level scheduling to exploit non-uniformly shared data cache on GPGPU 利用GPGPU非均匀共享数据缓存的软件级调度

Workshop on Memory System Performance and Correctness Pub Date : 2013-06-16 DOI: 10.1145/2492408.2492421

Bo Wu, Weilin Wang, Xipeng Shen

引用次数: 0

A coldness metric for cache optimization 缓存优化的冷度度量

Workshop on Memory System Performance and Correctness Pub Date : 2013-06-16 DOI: 10.1145/2492408.2492419

Raj Parihar, C. Ding, Michael C. Huang

引用次数: 1

Cache rationing for multicore 多核缓存配给

Workshop on Memory System Performance and Correctness Pub Date : 2013-06-16 DOI: 10.1145/2492408.2492422

Jacob Brock, C. Ding

引用次数: 0

All-window data liveness 全窗口数据活动性

Workshop on Memory System Performance and Correctness Pub Date : 2013-06-16 DOI: 10.1145/2492408.2492420

Pengcheng Li, C. Ding

引用次数: 8