Active workflow system for near real-time extreme-scale science

PPAA '14 Pub Date : 2014-02-16 DOI:10.1145/2567634.2567637

Yanwei Zhang, Qing Liu, S. Klasky, M. Wolf, K. Schwan, G. Eisenhauer, J. Choi, N. Podhorszki

{"title":"Active workflow system for near real-time extreme-scale science","authors":"Yanwei Zhang, Qing Liu, S. Klasky, M. Wolf, K. Schwan, G. Eisenhauer, J. Choi, N. Podhorszki","doi":"10.1145/2567634.2567637","DOIUrl":null,"url":null,"abstract":"In recent years, streaming-based data processing has been gaining substantial traction for dealing with overwhelming data generated by real-time applications, from both enterprise sources and scientific computing. In this work, however, we look at an emerging class of scientific data with Near Real-Time (NRT) requirement, in which data is typically generated in a bursty fashion with the near real-time constraints being applied primarily between bursts, rather than within a stream. A key challenge for this types of data sources is that the processing time per data element is not uniform, and not always feasible to predict. Given the observations on the increasing unpredictability of compute load and system dynamics, this work looks to adapt streaming-based approach to the context of this new class of large experiments and simulations that have complex run-time control and analysis issues.\n In particular, we deploy a novel two-tier scheme for handling the increasing unpredictability of runtime behaviors: Instead of relying on determining what and where to run the scientific workflows beforehand or partial dynamically, the decision will also be adaptively enhanced online according to system runtime status. This is enabled by embedding workflow along with data streams. Specifically, we break data outputs generated from experiments or simulations into multiple self-describing \"chunks\", which we call active data objects. As such, if there is a transient hotspot observed, a data object with unfinished workflow pipeline can break its previous schedule and search for a least loaded location to continue the execution. Our preliminary experiment results based on synthetic workloads demonstrate the proposed active workflow system as a very promising solution by outperforming the state-of-the-art semi-dynamic workflow schedulers with an improved workflow completion time, as well as a good scalability.","PeriodicalId":379963,"journal":{"name":"PPAA '14","volume":"138 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PPAA '14","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2567634.2567637","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

In recent years, streaming-based data processing has been gaining substantial traction for dealing with overwhelming data generated by real-time applications, from both enterprise sources and scientific computing. In this work, however, we look at an emerging class of scientific data with Near Real-Time (NRT) requirement, in which data is typically generated in a bursty fashion with the near real-time constraints being applied primarily between bursts, rather than within a stream. A key challenge for this types of data sources is that the processing time per data element is not uniform, and not always feasible to predict. Given the observations on the increasing unpredictability of compute load and system dynamics, this work looks to adapt streaming-based approach to the context of this new class of large experiments and simulations that have complex run-time control and analysis issues. In particular, we deploy a novel two-tier scheme for handling the increasing unpredictability of runtime behaviors: Instead of relying on determining what and where to run the scientific workflows beforehand or partial dynamically, the decision will also be adaptively enhanced online according to system runtime status. This is enabled by embedding workflow along with data streams. Specifically, we break data outputs generated from experiments or simulations into multiple self-describing "chunks", which we call active data objects. As such, if there is a transient hotspot observed, a data object with unfinished workflow pipeline can break its previous schedule and search for a least loaded location to continue the execution. Our preliminary experiment results based on synthetic workloads demonstrate the proposed active workflow system as a very promising solution by outperforming the state-of-the-art semi-dynamic workflow schedulers with an improved workflow completion time, as well as a good scalability.

查看原文本刊更多论文

近实时极端尺度科学的主动工作流系统

近年来，基于流的数据处理在处理实时应用程序生成的大量数据方面获得了巨大的吸引力，这些数据来自企业资源和科学计算。然而，在这项工作中，我们研究了一类具有近实时(NRT)要求的新兴科学数据，其中数据通常以突发方式生成，近实时约束主要应用于突发之间，而不是在流中。这类数据源面临的一个关键挑战是，每个数据元素的处理时间并不统一，而且并不总是可以预测。鉴于对计算负载和系统动力学日益增长的不可预测性的观察，这项工作希望将基于流的方法适应于这种具有复杂运行时控制和分析问题的新型大型实验和模拟的背景。特别地，我们部署了一种新的两层方案来处理运行时行为日益增加的不可预测性:代替依赖于事先或部分动态地决定运行科学工作流的内容和位置，决策也将根据系统运行时状态自适应地在线增强。这可以通过将工作流与数据流一起嵌入来实现。具体来说，我们将实验或模拟生成的数据输出分解为多个自我描述的“块”，我们称之为活动数据对象。因此，如果观察到一个暂态热点，具有未完成工作流管道的数据对象可以中断其先前的调度，并搜索加载最少的位置以继续执行。基于合成工作负载的初步实验结果表明，主动工作流系统是一种非常有前途的解决方案，其性能优于最先进的半动态工作流调度程序，具有改进的工作流完成时间和良好的可扩展性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

PPAA '14

自引率

0.00%

发文量