Scalable In-Memory Computing

2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing Pub Date : 2015-05-04 DOI:10.1109/CCGrid.2015.106

Alexandru Uta, Andreea Sandu, S. Costache, T. Kielmann

{"title":"Scalable In-Memory Computing","authors":"Alexandru Uta, Andreea Sandu, S. Costache, T. Kielmann","doi":"10.1109/CCGrid.2015.106","DOIUrl":null,"url":null,"abstract":"Data-intensive scientific workflows are composed of many tasks that exhibit data precedence constraints leading to communication schemes expressed by means of intermediate files. In such scenarios, the storage layer is often a bottleneck, limiting overall application scalability, due to large volumes of data being generated during runtime at high I/O rates. To alleviate the storage pressure, applications take advantage of in-memory runtime distributed file systems that act as a fast, distributed cache, which greatly enhances I/O performance.In this paper, we present scalability results for MemFS, a distributed in-memory runtime file system. MemFS takes an opposite approach to data locality, by scattering all data among the nodes, leading to well balanced storage and network traffic, and thus making the system both highly per formant and scalable. Our results show that MemFS is platform independent, performing equally well on both private clusters and commercial clouds. On such platforms, running on up to 1024 cores, MemFS shows excellent horizontal scalability (using more nodes), while the vertical scalability (using more cores per node) is only limited by the network b and with. Further more, for this challenge we show how MemFS is able to scale elastically, at runtime, based on the application storage demands. In our experiments, we have successfully used up to 1TB memory when running a large instance of the Montage workflow.","PeriodicalId":6664,"journal":{"name":"2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing","volume":"10 1","pages":"805-810"},"PeriodicalIF":0.0000,"publicationDate":"2015-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCGrid.2015.106","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

Abstract

Data-intensive scientific workflows are composed of many tasks that exhibit data precedence constraints leading to communication schemes expressed by means of intermediate files. In such scenarios, the storage layer is often a bottleneck, limiting overall application scalability, due to large volumes of data being generated during runtime at high I/O rates. To alleviate the storage pressure, applications take advantage of in-memory runtime distributed file systems that act as a fast, distributed cache, which greatly enhances I/O performance.In this paper, we present scalability results for MemFS, a distributed in-memory runtime file system. MemFS takes an opposite approach to data locality, by scattering all data among the nodes, leading to well balanced storage and network traffic, and thus making the system both highly per formant and scalable. Our results show that MemFS is platform independent, performing equally well on both private clusters and commercial clouds. On such platforms, running on up to 1024 cores, MemFS shows excellent horizontal scalability (using more nodes), while the vertical scalability (using more cores per node) is only limited by the network b and with. Further more, for this challenge we show how MemFS is able to scale elastically, at runtime, based on the application storage demands. In our experiments, we have successfully used up to 1TB memory when running a large instance of the Montage workflow.

查看原文本刊更多论文

可扩展内存计算

数据密集型科学工作流由许多任务组成，这些任务表现出数据优先约束，导致通过中间文件表示的通信方案。在这种情况下，存储层通常是一个瓶颈，限制了整个应用程序的可伸缩性，因为在运行期间会以高I/O速率生成大量数据。为了减轻存储压力，应用程序利用内存中的运行时分布式文件系统作为快速的分布式缓存，这大大提高了I/O性能。在本文中，我们给出了MemFS(一个分布式内存运行时文件系统)的可伸缩性结果。MemFS采用与数据局部性相反的方法，将所有数据分散到节点之间，从而实现良好的存储和网络流量平衡，从而使系统具有高性能和可伸缩性。我们的结果表明，MemFS是平台无关的，在私有集群和商业云上的表现都一样好。在这样的平台上，MemFS最多可运行1024个内核，它显示出出色的水平可伸缩性(使用更多节点)，而垂直可伸缩性(每个节点使用更多内核)仅受网络b和网络的限制。此外，对于这个挑战，我们将展示MemFS如何能够在运行时根据应用程序存储需求进行弹性扩展。在我们的实验中，当运行一个大型的蒙太奇工作流实例时，我们已经成功地使用了高达1TB的内存。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing

自引率

0.00%

发文量