Exascale的现场工作流程:系统软件的救援

Matthieu Dreher, Swann Perarnau, T. Peterka, K. Iskra, P. Beckman
{"title":"Exascale的现场工作流程:系统软件的救援","authors":"Matthieu Dreher, Swann Perarnau, T. Peterka, K. Iskra, P. Beckman","doi":"10.1145/3144769.3144774","DOIUrl":null,"url":null,"abstract":"Implementing an in situ workflow involves several challenges related to data placement, task scheduling, efficient communications, scalability, and reliability. Most of the current implementations provide reasonably performant solutions to these issues by focusing on high-performance communications and low-overhead execution models at the cost of reliability and flexibility. One of the key design choices in such infrastructures is between providing a single-program, integrated environment or a multiple-program, connected environment, both solutions having their own strengths and weaknesses. While these approaches might be appropriate for current production systems, the expected characteristics of exascale machines will shift current priorities. After a survey of the trade-offs and challenges of integrated and connected in situ workflow solutions available today, we discuss in this paper how exascale systems will impact those designs. In particular, we identify missing features of current system-level software required for the evolution of in situ workflows toward exascale and how system software innovations from the Argo Exascale Computing Project can help address those challenges.","PeriodicalId":107517,"journal":{"name":"Proceedings of the In Situ Infrastructures on Enabling Extreme-Scale Analysis and Visualization","volume":"85 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"In Situ Workflows at Exascale: System Software to the Rescue\",\"authors\":\"Matthieu Dreher, Swann Perarnau, T. Peterka, K. Iskra, P. Beckman\",\"doi\":\"10.1145/3144769.3144774\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Implementing an in situ workflow involves several challenges related to data placement, task scheduling, efficient communications, scalability, and reliability. Most of the current implementations provide reasonably performant solutions to these issues by focusing on high-performance communications and low-overhead execution models at the cost of reliability and flexibility. One of the key design choices in such infrastructures is between providing a single-program, integrated environment or a multiple-program, connected environment, both solutions having their own strengths and weaknesses. While these approaches might be appropriate for current production systems, the expected characteristics of exascale machines will shift current priorities. After a survey of the trade-offs and challenges of integrated and connected in situ workflow solutions available today, we discuss in this paper how exascale systems will impact those designs. In particular, we identify missing features of current system-level software required for the evolution of in situ workflows toward exascale and how system software innovations from the Argo Exascale Computing Project can help address those challenges.\",\"PeriodicalId\":107517,\"journal\":{\"name\":\"Proceedings of the In Situ Infrastructures on Enabling Extreme-Scale Analysis and Visualization\",\"volume\":\"85 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the In Situ Infrastructures on Enabling Extreme-Scale Analysis and Visualization\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3144769.3144774\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the In Situ Infrastructures on Enabling Extreme-Scale Analysis and Visualization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3144769.3144774","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

实现原位工作流涉及到与数据放置、任务调度、高效通信、可伸缩性和可靠性相关的几个挑战。当前的大多数实现都以牺牲可靠性和灵活性为代价,专注于高性能通信和低开销执行模型,从而为这些问题提供了性能合理的解决方案。在这种基础设施中,关键的设计选择之一是提供单程序集成环境还是提供多程序连接环境,这两种解决方案都有各自的优缺点。虽然这些方法可能适用于当前的生产系统,但百亿亿级机器的预期特性将改变当前的优先级。在调查了当今集成和连接的现场工作流解决方案的利弊和挑战之后,我们在本文中讨论了百亿亿级系统将如何影响这些设计。特别地,我们确定了当前系统级软件的缺失特性,这些特性是现场工作流向exascale发展所必需的,以及来自Argo exascale计算项目的系统软件创新如何帮助解决这些挑战。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
In Situ Workflows at Exascale: System Software to the Rescue
Implementing an in situ workflow involves several challenges related to data placement, task scheduling, efficient communications, scalability, and reliability. Most of the current implementations provide reasonably performant solutions to these issues by focusing on high-performance communications and low-overhead execution models at the cost of reliability and flexibility. One of the key design choices in such infrastructures is between providing a single-program, integrated environment or a multiple-program, connected environment, both solutions having their own strengths and weaknesses. While these approaches might be appropriate for current production systems, the expected characteristics of exascale machines will shift current priorities. After a survey of the trade-offs and challenges of integrated and connected in situ workflow solutions available today, we discuss in this paper how exascale systems will impact those designs. In particular, we identify missing features of current system-level software required for the evolution of in situ workflows toward exascale and how system software innovations from the Argo Exascale Computing Project can help address those challenges.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信