Vm-based Shared Memory On Low-latency, Remote-memory-access Networks

L. Kontothanassis, G. Hunt, R. Stets, N. Hardavellas, Michal Cierniak, Srinivasan Parthasarathy, Wagner Meira, Jr, S. Dwarkadas, M. Scott
{"title":"Vm-based Shared Memory On Low-latency, Remote-memory-access Networks","authors":"L. Kontothanassis, G. Hunt, R. Stets, N. Hardavellas, Michal Cierniak, Srinivasan Parthasarathy, Wagner Meira, Jr, S. Dwarkadas, M. Scott","doi":"10.1145/264107.264163","DOIUrl":null,"url":null,"abstract":"Recent technological advances have produced network interfaces that provide users with very low-latency access to the memory of remote machines. We examine the impact of such networks on the implementation and performance of software DSM. Specifically, we compare two DSM systems---Cashmere and TreadMarks---on a 32-processor DEC Alpha cluster connected by a Memory Channel network.Both Cashmere and TreadMarks use virtual memory to maintain coherence on pages, and both use lazy, multi-writer release consistency. The systems differ dramatically, however, in the mechanisms used to track sharing information and to collect and merge concurrent updates to a page, with the result that Cashmere communicates much more frequently, and at a much finer grain.Our principal conclusion is that low-latency networks make DSM based on fine-grain communication competitive with more coarse-grain approaches, but that further hardware improvements will be needed before such systems can provide consistently superior performance. In our experiments, Cashmere scales slightly better than TreadMarks for applications with false sharing. At the same time, it is severely constrained by limitations of the current Memory Channel hardware. In general, performance is better for TreadMarks.","PeriodicalId":405506,"journal":{"name":"Conference Proceedings. The 24th Annual International Symposium on Computer Architecture","volume":"193 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1996-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"78","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference Proceedings. The 24th Annual International Symposium on Computer Architecture","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/264107.264163","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 78

Abstract

Recent technological advances have produced network interfaces that provide users with very low-latency access to the memory of remote machines. We examine the impact of such networks on the implementation and performance of software DSM. Specifically, we compare two DSM systems---Cashmere and TreadMarks---on a 32-processor DEC Alpha cluster connected by a Memory Channel network.Both Cashmere and TreadMarks use virtual memory to maintain coherence on pages, and both use lazy, multi-writer release consistency. The systems differ dramatically, however, in the mechanisms used to track sharing information and to collect and merge concurrent updates to a page, with the result that Cashmere communicates much more frequently, and at a much finer grain.Our principal conclusion is that low-latency networks make DSM based on fine-grain communication competitive with more coarse-grain approaches, but that further hardware improvements will be needed before such systems can provide consistently superior performance. In our experiments, Cashmere scales slightly better than TreadMarks for applications with false sharing. At the same time, it is severely constrained by limitations of the current Memory Channel hardware. In general, performance is better for TreadMarks.
低延迟、远程内存访问网络中基于虚拟机的共享内存
最近的技术进步已经产生了网络接口,为用户提供对远程机器内存的极低延迟访问。我们研究了这些网络对软件DSM的实施和性能的影响。具体来说,我们比较了两个DSM系统——Cashmere和TreadMarks——在一个由内存通道网络连接的32处理器DEC Alpha集群上。Cashmere和TreadMarks都使用虚拟内存来保持页面的一致性,并且都使用延迟的多写入器发布一致性。然而,在用于跟踪共享信息和收集并合并并发更新到页面的机制上,这两个系统有很大的不同,结果是Cashmere的通信更频繁,而且粒度更细。我们的主要结论是,低延迟网络使基于细粒度通信的DSM与更粗粒度的方法相竞争,但是在这种系统能够提供一致的卓越性能之前,还需要进一步的硬件改进。在我们的实验中,对于有虚假共享的应用程序,Cashmere的表现略好于TreadMarks。同时,它也受到当前内存通道硬件的严重限制。总的来说,TreadMarks的性能更好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信