An Automated Pipeline for Advanced Fault Tolerance in Edge Computing Infrastructures

Proceedings of the 2nd Workshop on Flexible Resource and Application Management on the Edge Pub Date : 2022-07-01 DOI:10.1145/3526059.3533623

T. Theodoropoulos, Antonios Makris, John Violos, K. Tserpes

{"title":"An Automated Pipeline for Advanced Fault Tolerance in Edge Computing Infrastructures","authors":"T. Theodoropoulos, Antonios Makris, John Violos, K. Tserpes","doi":"10.1145/3526059.3533623","DOIUrl":null,"url":null,"abstract":"The very fabric of Edge Computing is intertwined with the necessity to be able to orchestrate and manage a huge number of heterogeneous computational resources. On top of that, the rather demanding Quality of Service (QoS) requirements of Internet of Things (IoT) applications that run on these resources, dictate that it is essential to establish robust Fault Tolerance mechanisms. These mechanisms should be able to guarantee that the requirements will be upheld regardless of any potential changes in task production rate. To that end, we suggest an Automated Pipeline for Advanced Fault Tolerance (APAFT) that consists of various components that are designed to operate as functional blocks of an automated closed-control loop. Furthermore, the suggested pipeline is able to carry out the various Horizontal Scaling operations in a proactive manner. These Proactive Scaling capabilities are achieved via the use of a dedicated Deep Learning (DL)-based component that is able to perform multi-step prediction. Our work aims to introduce a number of mechanisms that are able to leverage the benefits that are provided by the multi-step format in a more refined manner. Having access to information regarding multiple future instances allows us to design automated resource orchestration strategies that cater to the specific characteristics of each type of computational node that is part of the Edge Infrastructure.","PeriodicalId":351705,"journal":{"name":"Proceedings of the 2nd Workshop on Flexible Resource and Application Management on the Edge","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2nd Workshop on Flexible Resource and Application Management on the Edge","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3526059.3533623","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

Abstract

The very fabric of Edge Computing is intertwined with the necessity to be able to orchestrate and manage a huge number of heterogeneous computational resources. On top of that, the rather demanding Quality of Service (QoS) requirements of Internet of Things (IoT) applications that run on these resources, dictate that it is essential to establish robust Fault Tolerance mechanisms. These mechanisms should be able to guarantee that the requirements will be upheld regardless of any potential changes in task production rate. To that end, we suggest an Automated Pipeline for Advanced Fault Tolerance (APAFT) that consists of various components that are designed to operate as functional blocks of an automated closed-control loop. Furthermore, the suggested pipeline is able to carry out the various Horizontal Scaling operations in a proactive manner. These Proactive Scaling capabilities are achieved via the use of a dedicated Deep Learning (DL)-based component that is able to perform multi-step prediction. Our work aims to introduce a number of mechanisms that are able to leverage the benefits that are provided by the multi-step format in a more refined manner. Having access to information regarding multiple future instances allows us to design automated resource orchestration strategies that cater to the specific characteristics of each type of computational node that is part of the Edge Infrastructure.

查看原文本刊更多论文

边缘计算基础设施中高级容错的自动化流水线

边缘计算的结构与协调和管理大量异构计算资源的必要性交织在一起。最重要的是，运行在这些资源上的物联网(IoT)应用程序对服务质量(QoS)的要求相当高，这决定了建立健壮的容错机制至关重要。这些机制应该能够保证无论任务产生率有任何潜在的变化，需求都能得到维护。为此，我们建议采用高级容错自动化管道(APAFT)，该管道由各种组件组成，这些组件被设计为自动闭环控制回路的功能块。此外，建议的管道能够以主动的方式执行各种水平缩放操作。这些主动扩展功能是通过使用专用的基于深度学习(DL)的组件来实现的，该组件能够执行多步预测。我们的工作旨在引入一些机制，这些机制能够以更精细的方式利用多步骤格式所提供的好处。通过访问有关多个未来实例的信息，我们可以设计自动化的资源编排策略，以满足边缘基础设施中每种类型的计算节点的特定特征。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2nd Workshop on Flexible Resource and Application Management on the Edge

自引率

0.00%

发文量