Special Issue on benchmarking, experimentation tools, and reproducible practices for data‐intensive systems from edge to cloud

Lauritz Thamsen, David Bermbach, Demetris Trihinas
{"title":"Special Issue on benchmarking, experimentation tools, and reproducible practices for <scp>data‐intensive</scp> systems from edge to cloud","authors":"Lauritz Thamsen, David Bermbach, Demetris Trihinas","doi":"10.1002/spe.3282","DOIUrl":null,"url":null,"abstract":"As data analytics and machine learning increasingly permeate our cities, factories, and homes, the computing infrastructure for data-intensive systems becomes more challenging. That is, the vision of pervasive, intelligent, and cyber-physical IoT systems will not be realized with centralized cloud resources alone. Such resources are simply too far away from sensor-equipped devices and users, resulting in high latency, bandwidth bottlenecks, and unnecessary energy consumption. In addition, there are often privacy and security requirements that mandate distributed architectures. As a result, new distributed computing paradigms are emerging that promise to bring computing and storage closer to data sources and users. The emerging distributed computing environments of edge and fog computing provide additional resources within mobile networks, ISP infrastructure, and even LEO satellites. These diverse and dynamic computing environments pose significant challenges to the performance, dependability, and efficiency of data-intensive systems running on such infrastructure. At the same time, it is far less clear how to properly benchmark, evaluate, and test the behavior of systems that span IoT devices, edge nodes, and cloud resources. For example, IoT sensor data stream processing systems can be leveraged to continuously optimize the operation of urban infrastructures (such as public transportation systems, water networks, or medical infrastructures). The behavior of such systems must be thoroughly assessed before they can be deployed to edge and fog infrastructure. In addition, these systems must be evaluated reproducibly under the expected computing environment conditions, including variations of those conditions, given the inherently unsteady nature of IoT environments. In addition, there is growing concern about the energy consumption and greenhouse gas emissions of ICT (and especially distributed ML-based applications), which further warrants close examination of the behavior of new data-intensive applications. Despite significant research and development efforts to improve benchmarking, experimentation tools, and reproducible practices for data-intensive systems spanning from the edge to the cloud, more research is urgently needed. We therefore invited high-quality research papers on this topic for this special issue of Software: Practice and Experience, and we were able to select two out of four submissions for this special issue with the help of our reviewers. The first accepted paper is titled “faas-sim: A Trace-Driven Simulation Framework for Serverless Edge Computing Platforms”.1 It is co-authored by Philipp Raith, Thomas Rausch, Alireza Furutanpey, and Schahram Dustdar. The paper presents the design and implementation of a new simulation framework, “faas-sim,” for modeling and evaluating serverless software architectures spanning the edge-cloud continuum based on a scenario description, a given network topology, and workload traces. The new simulator is demonstrated by using it for performance estimation, resource planning, co-simulation, and scientific evaluation. The authors also evaluate faas-sim's network simulation and resource utilization. Furthermore, they highlight traces that come with faas-sim and provide an overview of published research that has used faas-sim. The second accepted paper is titled “Software-in-the-Loop Simulation for Developing and Testing Carbon-Aware Applications”.2 It is co-authored by Philipp Wiesner, Marvin Steinke, Henrik Nickel, Yazan Kitana, and Odej Kao. As an alternative to relying on purely simulated or purely real testbeds, the paper proposes the use of software-in-the-loop simulation and hybrid testbeds for testing carbon-aware software applications in the context of energy simulations. The paper describes the design and implementation of a prototype, “Vessim,” as well as two experiments demonstrating the capabilities and features of the novel tool. In this way, the paper shows how a message broker can reliably and realistically connect currently running applications under test to real-time simulations, while the energy demand is continuously measured or modeled. We are grateful to the Editor-in-Chief of the Journal, Dr. Rajkumar Buyya, for inviting us to organize this special issue. We are also grateful for the valuable support from the administrative office of the journal. In addition, we are very grateful for the thorough and thoughtful reviews provided by our reviewers. Finally, we appreciate the hard work and trust of the authors who submitted papers to our special issue.","PeriodicalId":21899,"journal":{"name":"Software: Practice and Experience","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Software: Practice and Experience","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/spe.3282","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

As data analytics and machine learning increasingly permeate our cities, factories, and homes, the computing infrastructure for data-intensive systems becomes more challenging. That is, the vision of pervasive, intelligent, and cyber-physical IoT systems will not be realized with centralized cloud resources alone. Such resources are simply too far away from sensor-equipped devices and users, resulting in high latency, bandwidth bottlenecks, and unnecessary energy consumption. In addition, there are often privacy and security requirements that mandate distributed architectures. As a result, new distributed computing paradigms are emerging that promise to bring computing and storage closer to data sources and users. The emerging distributed computing environments of edge and fog computing provide additional resources within mobile networks, ISP infrastructure, and even LEO satellites. These diverse and dynamic computing environments pose significant challenges to the performance, dependability, and efficiency of data-intensive systems running on such infrastructure. At the same time, it is far less clear how to properly benchmark, evaluate, and test the behavior of systems that span IoT devices, edge nodes, and cloud resources. For example, IoT sensor data stream processing systems can be leveraged to continuously optimize the operation of urban infrastructures (such as public transportation systems, water networks, or medical infrastructures). The behavior of such systems must be thoroughly assessed before they can be deployed to edge and fog infrastructure. In addition, these systems must be evaluated reproducibly under the expected computing environment conditions, including variations of those conditions, given the inherently unsteady nature of IoT environments. In addition, there is growing concern about the energy consumption and greenhouse gas emissions of ICT (and especially distributed ML-based applications), which further warrants close examination of the behavior of new data-intensive applications. Despite significant research and development efforts to improve benchmarking, experimentation tools, and reproducible practices for data-intensive systems spanning from the edge to the cloud, more research is urgently needed. We therefore invited high-quality research papers on this topic for this special issue of Software: Practice and Experience, and we were able to select two out of four submissions for this special issue with the help of our reviewers. The first accepted paper is titled “faas-sim: A Trace-Driven Simulation Framework for Serverless Edge Computing Platforms”.1 It is co-authored by Philipp Raith, Thomas Rausch, Alireza Furutanpey, and Schahram Dustdar. The paper presents the design and implementation of a new simulation framework, “faas-sim,” for modeling and evaluating serverless software architectures spanning the edge-cloud continuum based on a scenario description, a given network topology, and workload traces. The new simulator is demonstrated by using it for performance estimation, resource planning, co-simulation, and scientific evaluation. The authors also evaluate faas-sim's network simulation and resource utilization. Furthermore, they highlight traces that come with faas-sim and provide an overview of published research that has used faas-sim. The second accepted paper is titled “Software-in-the-Loop Simulation for Developing and Testing Carbon-Aware Applications”.2 It is co-authored by Philipp Wiesner, Marvin Steinke, Henrik Nickel, Yazan Kitana, and Odej Kao. As an alternative to relying on purely simulated or purely real testbeds, the paper proposes the use of software-in-the-loop simulation and hybrid testbeds for testing carbon-aware software applications in the context of energy simulations. The paper describes the design and implementation of a prototype, “Vessim,” as well as two experiments demonstrating the capabilities and features of the novel tool. In this way, the paper shows how a message broker can reliably and realistically connect currently running applications under test to real-time simulations, while the energy demand is continuously measured or modeled. We are grateful to the Editor-in-Chief of the Journal, Dr. Rajkumar Buyya, for inviting us to organize this special issue. We are also grateful for the valuable support from the administrative office of the journal. In addition, we are very grateful for the thorough and thoughtful reviews provided by our reviewers. Finally, we appreciate the hard work and trust of the authors who submitted papers to our special issue.
关于从边缘到云的数据密集型系统的基准测试,实验工具和可重复实践的特刊
随着数据分析和机器学习越来越多地渗透到我们的城市、工厂和家庭,数据密集型系统的计算基础设施变得更具挑战性。也就是说,普及的、智能的、网络物理的物联网系统的愿景将不会仅仅通过集中的云资源来实现。这些资源离配备传感器的设备和用户太远,导致高延迟、带宽瓶颈和不必要的能源消耗。此外,通常还存在强制采用分布式体系结构的隐私和安全需求。因此,新的分布式计算范式正在出现,有望使计算和存储更接近数据源和用户。边缘和雾计算的新兴分布式计算环境在移动网络、ISP基础设施甚至LEO卫星中提供了额外的资源。这些多样化和动态的计算环境对运行在这些基础设施上的数据密集型系统的性能、可靠性和效率提出了重大挑战。与此同时,如何正确地对跨物联网设备、边缘节点和云资源的系统进行基准测试、评估和测试,目前还不太清楚。例如,可以利用物联网传感器数据流处理系统来持续优化城市基础设施(如公共交通系统、供水网络或医疗基础设施)的运行。在将此类系统部署到边缘和雾基础设施之前,必须对其行为进行彻底评估。此外,考虑到物联网环境固有的不稳定性,这些系统必须在预期的计算环境条件下进行可重复性评估,包括这些条件的变化。此外,人们越来越关注ICT(尤其是基于分布式机器学习的应用)的能源消耗和温室气体排放,这进一步需要对新的数据密集型应用的行为进行仔细检查。尽管为从边缘到云的数据密集型系统改进基准测试、实验工具和可重复实践做出了重大的研究和开发努力,但迫切需要更多的研究。因此,我们在本期《软件:实践与经验》特刊中邀请了有关该主题的高质量研究论文,并且在审稿人的帮助下,我们能够从四份提交中选择两份。第一篇被接受的论文题为“faas-sim:无服务器边缘计算平台的跟踪驱动仿真框架”本书由Philipp Raith、Thomas Rausch、Alireza Furutanpey和Schahram Dustdar共同撰写。本文介绍了一种新的仿真框架“faas-sim”的设计和实现,用于基于场景描述、给定网络拓扑和工作负载跟踪对跨边缘云连续体的无服务器软件架构进行建模和评估。在性能评估、资源规划、协同仿真和科学评估等方面对该仿真器进行了验证。作者还对faas-sim的网络仿真和资源利用率进行了评价。此外,他们强调了faas-sim带来的痕迹,并提供了使用faas-sim的已发表研究的概述。第二篇论文的题目是“开发和测试碳感知应用的软件在环模拟”本书由Philipp Wiesner、Marvin Steinke、Henrik Nickel、Yazan Kitana和Odej Kao共同撰写。作为依赖纯模拟或纯真实测试平台的替代方案,本文建议使用软件在环模拟和混合测试平台来测试能源模拟背景下的碳感知软件应用程序。本文描述了一个原型“Vessim”的设计和实现,以及两个演示新工具功能和特征的实验。通过这种方式,本文展示了消息代理如何可靠且真实地将当前正在运行的被测应用程序连接到实时模拟中,同时持续测量或建模能源需求。我们感谢《华尔街日报》总编辑Rajkumar Buyya博士邀请我们组织本期特刊。我们也非常感谢杂志社的宝贵支持。此外,我们非常感谢我们的审稿人提供的彻底和周到的审查。最后,我们感谢向我们特刊投稿的作者的辛勤工作和信任。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信