CNDAS-WF: Cloud Native Data Analysis System Based On Workflow Engine

Xinshi Zhou, Yuxuan Wu
{"title":"CNDAS-WF: Cloud Native Data Analysis System Based On Workflow Engine","authors":"Xinshi Zhou, Yuxuan Wu","doi":"10.1145/3584871.3584891","DOIUrl":null,"url":null,"abstract":"With the development of modern big data technology, data size in daily life is expanding rapidly and data relationship is more complex. However, the requirements of data analysis for different resources continuous to surging. Therefore, how to handle a large number of data analysis tasks with complex dependencies efficiently become the challenge. In this paper, we design and implement a cloud native data analysis system based on workflow engine. The system arranges the data analysis tasks, which deployed by containers, with dependency through the workflow engine based on cloud native technology. Flexibility of container cloud makes data analysis procedure effective and efficient. In addition, we designed a workflow engine and an operation and maintenance subsystem for overall system platform anomaly detection. Finally, we verify the effectiveness and efficiency of the system through scientific workflow data. The cloud native data analysis system based on workflow engine has passed all tests and has been applied in small and medium-sized enterprises.","PeriodicalId":173315,"journal":{"name":"Proceedings of the 2023 6th International Conference on Software Engineering and Information Management","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 6th International Conference on Software Engineering and Information Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3584871.3584891","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

With the development of modern big data technology, data size in daily life is expanding rapidly and data relationship is more complex. However, the requirements of data analysis for different resources continuous to surging. Therefore, how to handle a large number of data analysis tasks with complex dependencies efficiently become the challenge. In this paper, we design and implement a cloud native data analysis system based on workflow engine. The system arranges the data analysis tasks, which deployed by containers, with dependency through the workflow engine based on cloud native technology. Flexibility of container cloud makes data analysis procedure effective and efficient. In addition, we designed a workflow engine and an operation and maintenance subsystem for overall system platform anomaly detection. Finally, we verify the effectiveness and efficiency of the system through scientific workflow data. The cloud native data analysis system based on workflow engine has passed all tests and has been applied in small and medium-sized enterprises.
CNDAS-WF:基于工作流引擎的云原生数据分析系统
随着现代大数据技术的发展,日常生活中的数据量迅速扩大,数据关系更加复杂。然而,对不同资源的数据分析需求不断激增。因此,如何高效地处理大量具有复杂依赖关系的数据分析任务成为一个挑战。本文设计并实现了一个基于工作流引擎的云原生数据分析系统。系统通过基于云原生技术的工作流引擎,对容器部署的数据分析任务进行依赖安排。容器云的灵活性使数据分析过程有效和高效。此外,我们还设计了工作流引擎和运维子系统,用于整个系统平台的异常检测。最后,通过科学的工作流数据验证了系统的有效性和高效性。基于工作流引擎的云原生数据分析系统已通过各项测试,并已在中小企业中得到应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信