SAMO: store aware memory optimizations

Proceedings of the 11th ACM Conference on Computing Frontiers Pub Date : 2014-05-20 DOI:10.1145/2597917.2597940

K. Raghavendra, Tripti S. Warrier, M. Mutyam

引用次数: 0

Abstract

Cache optimizations and DRAM scheduling play an important role in determining the performance of a system given that the demand for memory is ever increasing. In this paper we track stores both at cache and main memory and apply three different optimizations, one, at the cache level, so that stores are serviced faster and hence load store queue block cycles are reduced, two, at the miss handling architecture wherein we remove entries containing only store requests thereby reducing the cache stall cycles and three, at the main memory where stores are serviced with lesser priority so that actual reads get serviced faster. These three different memory optimizations combined together (store aware memory optimization, SAMO framework) on an average increase the performance of the system and can be augmented with any previously proposed optimization techniques at the memory. SAMO speeds-up the workloads on 4- and 8-core systems by a geometric mean of 5.0% and 7.4%, respectively, with a maximum speed-up of 21.9% and 17.8% on 4- and 8-core systems, respectively.

查看原文本刊更多论文

SAMO:存储感知内存优化

在内存需求不断增加的情况下，缓存优化和DRAM调度在决定系统性能方面起着重要作用。在本文中，我们跟踪了缓存和主存中的存储，并应用了三种不同的优化，一是在缓存级别，以便更快地为存储提供服务，从而减少了加载存储队列块周期;二是在丢失处理架构中，我们删除了只包含存储请求的条目，从而减少了缓存停滞周期;三是在主存中，存储的优先级较低，以便更快地为实际读取提供服务。这三种不同的内存优化组合在一起(存储感知内存优化、SAMO框架)平均可以提高系统的性能，并且可以使用任何先前提出的内存优化技术进行增强。SAMO将4核和8核系统上的工作负载的速度分别提高了5.0%和7.4%，在4核和8核系统上的最大速度分别提高了21.9%和17.8%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 11th ACM Conference on Computing Frontiers

自引率

0.00%

发文量