WildFire: a scalable path for SMPs

Proceedings Fifth International Symposium on High-Performance Computer Architecture Pub Date : 1999-01-09 DOI:10.1109/HPCA.1999.744361

Erik Hagersten, M. Koster

{"title":"WildFire: a scalable path for SMPs","authors":"Erik Hagersten, M. Koster","doi":"10.1109/HPCA.1999.744361","DOIUrl":null,"url":null,"abstract":"Researchers have searched for scalable alternatives to the symmetric multiprocessor (SMP) architecture since it was first introduced in 1982. The paper introduces an alternative view of the relationship between scalable technologies and SMPs. Instead of replacing large SMPs with scalable technology, we propose new scalable techniques that allow large SMPs to be tied together efficiently, while maintaining the compatibility with, and performance characteristics of, an SMP. The trade-offs of such an architecture differ from those of traditional, scalable, Non-Uniform Memory Architecture (cc-NUMA) approaches. WildFire is a distributed shared memory (DSM) prototype implementation based on large SMPs. It relies on two techniques for creating application-transparent locality: Coherent Memory Replication (CMR), which is a variation of Simple COMA/Reactive NUMA, and Hierarchical Affinity Scheduling (HAS). These two optimizations create extra node locality, which blurs the node boundaries to an application such that SMP-like performance can be achieved with no NUMA-specific optimizations. We present a performance study of a large OLTP benchmark running on DSMs built from various sized nodes and with varying amounts of application-transparent locality. WildFire's measured performance is shown to be more than two times that of an unoptimized NUMA implementation built from small nodes and within 13% of the performance of the ideal implementation: a large SMP with the same access time to its entire shared memory as the local memory access time of WildFire.","PeriodicalId":287867,"journal":{"name":"Proceedings Fifth International Symposium on High-Performance Computer Architecture","volume":"2013 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"152","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Fifth International Symposium on High-Performance Computer Architecture","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPCA.1999.744361","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 152

Abstract

Researchers have searched for scalable alternatives to the symmetric multiprocessor (SMP) architecture since it was first introduced in 1982. The paper introduces an alternative view of the relationship between scalable technologies and SMPs. Instead of replacing large SMPs with scalable technology, we propose new scalable techniques that allow large SMPs to be tied together efficiently, while maintaining the compatibility with, and performance characteristics of, an SMP. The trade-offs of such an architecture differ from those of traditional, scalable, Non-Uniform Memory Architecture (cc-NUMA) approaches. WildFire is a distributed shared memory (DSM) prototype implementation based on large SMPs. It relies on two techniques for creating application-transparent locality: Coherent Memory Replication (CMR), which is a variation of Simple COMA/Reactive NUMA, and Hierarchical Affinity Scheduling (HAS). These two optimizations create extra node locality, which blurs the node boundaries to an application such that SMP-like performance can be achieved with no NUMA-specific optimizations. We present a performance study of a large OLTP benchmark running on DSMs built from various sized nodes and with varying amounts of application-transparent locality. WildFire's measured performance is shown to be more than two times that of an unoptimized NUMA implementation built from small nodes and within 13% of the performance of the ideal implementation: a large SMP with the same access time to its entire shared memory as the local memory access time of WildFire.

查看原文本刊更多论文

WildFire: smp的可扩展路径

自对称多处理器(SMP)架构于1982年首次推出以来，研究人员一直在寻找可扩展的替代方案。本文介绍了可扩展技术和smp之间关系的另一种观点。我们不是用可扩展技术取代大型SMP，而是提出新的可扩展技术，允许将大型SMP有效地绑定在一起，同时保持与SMP的兼容性和性能特征。这种体系结构的权衡不同于传统的、可伸缩的、非统一内存体系结构(cc-NUMA)方法。WildFire是一种基于大型smp的分布式共享内存(DSM)原型实现。它依赖于两种技术来创建应用程序透明的局域性:相干内存复制(CMR)，它是简单昏迷/反应性NUMA的一种变体，以及分层亲和调度(HAS)。这两个优化创建了额外的节点局部性，模糊了应用程序的节点边界，这样就可以在没有特定于numa的优化的情况下实现类似smp的性能。我们提供了一个大型OLTP基准测试的性能研究，该测试运行在从不同大小的节点构建的dsm上，具有不同数量的应用程序透明局域。野火的测量性能是由小节点构建的未优化NUMA实现的两倍多，并且在理想实现的性能的13%以内:一个大型SMP对其整个共享内存的访问时间与野火的本地内存访问时间相同。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings Fifth International Symposium on High-Performance Computer Architecture

自引率

0.00%

发文量