PONCHO: Dynamic Package Synthesis for Distributed and Serverless Python Applications

Barry Sly-Delgado, Nick Locascio, David Simonetti, B. Wiseman, Benjamín Tovar, D. Thain
{"title":"PONCHO: Dynamic Package Synthesis for Distributed and Serverless Python Applications","authors":"Barry Sly-Delgado, Nick Locascio, David Simonetti, B. Wiseman, Benjamín Tovar, D. Thain","doi":"10.1145/3526060.3535459","DOIUrl":null,"url":null,"abstract":"An increasing number of distributed applications operate by dispatching function invocations across the nodes of a distributed system. To operate correctly, the code and data dependencies of the function must be distributed along with the invocations in some way. When translating applications to work on large scale distributed systems, managing these dependencies becomes challenging: delivery must be scalable to thousands of nodes; the dependencies must be consistent across the system; and the method must be usable by an unprivileged developer. As a solution, in this paper we present PONCHO, which is a lightweight Python based toolkit which allows users to discover, package, and deploy dependencies as an integral part of distributed applications. PONCHO encapsulates a set of commands to be executed within an environment. PONCHO offers a lightweight solution to create and manage environments increasing the portability of scientific applications as well as reproducibility. In this paper, we evaluate PONCHO with real-world applications in the fields of physics, computational chemistry, and hyperparameter optimization, We observe the challenges that arise when creating and distributing an environment and measure the overheads that emerge as a result.","PeriodicalId":223581,"journal":{"name":"Proceedings of the 2nd Workshop on High Performance Serverless Computing","volume":"148 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2nd Workshop on High Performance Serverless Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3526060.3535459","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

An increasing number of distributed applications operate by dispatching function invocations across the nodes of a distributed system. To operate correctly, the code and data dependencies of the function must be distributed along with the invocations in some way. When translating applications to work on large scale distributed systems, managing these dependencies becomes challenging: delivery must be scalable to thousands of nodes; the dependencies must be consistent across the system; and the method must be usable by an unprivileged developer. As a solution, in this paper we present PONCHO, which is a lightweight Python based toolkit which allows users to discover, package, and deploy dependencies as an integral part of distributed applications. PONCHO encapsulates a set of commands to be executed within an environment. PONCHO offers a lightweight solution to create and manage environments increasing the portability of scientific applications as well as reproducibility. In this paper, we evaluate PONCHO with real-world applications in the fields of physics, computational chemistry, and hyperparameter optimization, We observe the challenges that arise when creating and distributing an environment and measure the overheads that emerge as a result.
PONCHO:分布式和无服务器Python应用程序的动态包合成
越来越多的分布式应用程序通过在分布式系统的节点之间调度函数调用来运行。为了正确操作,函数的代码和数据依赖关系必须以某种方式与调用一起分发。当将应用程序转换为在大型分布式系统上工作时,管理这些依赖关系变得具有挑战性:交付必须可扩展到数千个节点;依赖关系必须在整个系统中保持一致;并且该方法必须可供非特权开发人员使用。作为解决方案,在本文中,我们提出了PONCHO,这是一个轻量级的基于Python的工具包,它允许用户发现、打包和部署依赖项,作为分布式应用程序的一个组成部分。PONCHO封装了一组要在环境中执行的命令。PONCHO提供了一种轻量级的解决方案来创建和管理环境,提高了科学应用的可移植性和可重复性。在本文中,我们评估了PONCHO在物理、计算化学和超参数优化领域的实际应用,我们观察了在创建和分配环境时出现的挑战,并测量了由此产生的开销。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信