Productive Programming of Distributed Systems with the SHAD C++ Library

Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed Computing Pub Date : 2020-06-21 DOI:10.1145/3431379.3462765

Vito Giovanni Castellana, Marco Minutoli

{"title":"Productive Programming of Distributed Systems with the SHAD C++ Library","authors":"Vito Giovanni Castellana, Marco Minutoli","doi":"10.1145/3431379.3462765","DOIUrl":null,"url":null,"abstract":"High-performance computing (HPC) is often perceived as a matter of making large-scale systems (e.g., clusters) run as fast as possible, regardless the required programming effort. However, the idea of \"bringing HPC to the masses\" has recently emerged. Inspired by this vision, we have designed SHAD, the Scalable High-performance Algorithms and Data-structures library [1][6]. SHAD is open source software, written in C++, for C++ developers. Unlike other HPC libraries for distributed systems, which rely on SPMD models, SHAD adopts a shared-memory programming abstraction, to make C++ programmers feel at home. Underneath, SHAD manages tasking and data-movements, moving the computation where data resides and taking advantage of asynchrony to tolerate network latency. At the bottom of his stack, SHAD can interface with multiple runtime systems: this not only improves developer's productivity, by hiding the complexity of such software and of the underlying hardware, but also greatly enhance code portability. Thanks to its abstraction layers, SHAD can indeed target different systems, ranging from laptops to HPC clusters, without any need for modifying the user-level code.We have prototyped and open-sourced the implementation of (a subset of) the C++ standard library (STL) targeting multi-node HPC clusters. Our work allows plain STL-based C++ code to scale on HPC systems, with no need for rewriting the code to exploit the complex hardware. SHAD is available under Apache v2 License at https://github.com/pnnl/SHAD. In this paper we overview the design of the SHAD library, depicting its main components: runtime systems abstractions for tasking; parallel and distributed data-structures; STL-compliant interfaces and algorithms.","PeriodicalId":343991,"journal":{"name":"Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed Computing","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3431379.3462765","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

High-performance computing (HPC) is often perceived as a matter of making large-scale systems (e.g., clusters) run as fast as possible, regardless the required programming effort. However, the idea of "bringing HPC to the masses" has recently emerged. Inspired by this vision, we have designed SHAD, the Scalable High-performance Algorithms and Data-structures library [1][6]. SHAD is open source software, written in C++, for C++ developers. Unlike other HPC libraries for distributed systems, which rely on SPMD models, SHAD adopts a shared-memory programming abstraction, to make C++ programmers feel at home. Underneath, SHAD manages tasking and data-movements, moving the computation where data resides and taking advantage of asynchrony to tolerate network latency. At the bottom of his stack, SHAD can interface with multiple runtime systems: this not only improves developer's productivity, by hiding the complexity of such software and of the underlying hardware, but also greatly enhance code portability. Thanks to its abstraction layers, SHAD can indeed target different systems, ranging from laptops to HPC clusters, without any need for modifying the user-level code.We have prototyped and open-sourced the implementation of (a subset of) the C++ standard library (STL) targeting multi-node HPC clusters. Our work allows plain STL-based C++ code to scale on HPC systems, with no need for rewriting the code to exploit the complex hardware. SHAD is available under Apache v2 License at https://github.com/pnnl/SHAD. In this paper we overview the design of the SHAD library, depicting its main components: runtime systems abstractions for tasking; parallel and distributed data-structures; STL-compliant interfaces and algorithms.

查看原文本刊更多论文

用SHAD c++库实现分布式系统的生产性编程

高性能计算(HPC)通常被认为是使大型系统(例如集群)尽可能快地运行，而不考虑所需的编程工作。然而，最近出现了“将高性能计算推向大众”的想法。在这一愿景的启发下，我们设计了SHAD，即可扩展的高性能算法和数据结构库b[1][6]。SHAD是开源软件，用c++编写，面向c++开发人员。与其他依赖SPMD模型的分布式系统HPC库不同，SHAD采用了共享内存编程抽象，使c++程序员感到自在。在底层，SHAD管理任务和数据移动，将计算移到数据所在的位置，并利用异步来容忍网络延迟。在他的堆栈的底部，SHAD可以与多个运行时系统接口:这不仅通过隐藏此类软件和底层硬件的复杂性来提高开发人员的生产力，而且还大大增强了代码的可移植性。由于它的抽象层，SHAD确实可以针对不同的系统，从笔记本电脑到HPC集群，而不需要修改用户级代码。我们已经对针对多节点HPC集群的c++标准库(STL)的实现(一个子集)进行了原型化和开源。我们的工作允许基于stl的普通c++代码在HPC系统上扩展，而不需要重写代码来利用复杂的硬件。SHAD在Apache v2许可证下可在https://github.com/pnnl/SHAD上获得。在本文中，我们概述了SHAD库的设计，描述了它的主要组成部分:用于任务的运行时系统抽象;并行和分布式数据结构;兼容stl的接口和算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed Computing

自引率

0.00%

发文量