ACRE: Actor Critic Reinforcement Learning for Failure-Aware Edge Computing Migrations

2023 57th Annual Conference on Information Sciences and Systems (CISS) Pub Date : 2023-03-22 DOI:10.1109/CISS56502.2023.10089694

Marie Siew, Shikhar Sharma, Carlee Joe-Wong

{"title":"ACRE: Actor Critic Reinforcement Learning for Failure-Aware Edge Computing Migrations","authors":"Marie Siew, Shikhar Sharma, Carlee Joe-Wong","doi":"10.1109/CISS56502.2023.10089694","DOIUrl":null,"url":null,"abstract":"In edge computing, users' service profiles are migrated in response to user mobility, to minimize the user-experienced delay, balanced against the migration cost. Due to imperfect information on transition probabilities and costs, reinforcement learning (RL) is often used to optimize service migration. Nevertheless, current works do not optimize service migration in light of occasional server failures. While server failures are rare, they impact the smooth and safe functioning of latency sensitive edge computing applications like autonomous driving and real-time obstacle detection, because users can no longer complete their computing jobs. As these failures occur at a low probability, it is difficult for RL algorithms, which are data and experience driven, to learn an optimal service migration policy for both the usual and rare event scenarios. Therefore, we propose an algorithm ImACRE, which integrates importance sampling into actor critic reinforcement learning, to learn the optimal service profile and backup placement policy. Our algorithm uses importance sampling to sample rare events in a simulator, at a rate proportional to their contribution to system costs, while balancing service migration trade-offs between delay and migration costs, with failure costs, backup placement and migration costs. We use trace driven experiments to show that our algorithm gives cost reductions in the event of failures.","PeriodicalId":243775,"journal":{"name":"2023 57th Annual Conference on Information Sciences and Systems (CISS)","volume":"141 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 57th Annual Conference on Information Sciences and Systems (CISS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISS56502.2023.10089694","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In edge computing, users' service profiles are migrated in response to user mobility, to minimize the user-experienced delay, balanced against the migration cost. Due to imperfect information on transition probabilities and costs, reinforcement learning (RL) is often used to optimize service migration. Nevertheless, current works do not optimize service migration in light of occasional server failures. While server failures are rare, they impact the smooth and safe functioning of latency sensitive edge computing applications like autonomous driving and real-time obstacle detection, because users can no longer complete their computing jobs. As these failures occur at a low probability, it is difficult for RL algorithms, which are data and experience driven, to learn an optimal service migration policy for both the usual and rare event scenarios. Therefore, we propose an algorithm ImACRE, which integrates importance sampling into actor critic reinforcement learning, to learn the optimal service profile and backup placement policy. Our algorithm uses importance sampling to sample rare events in a simulator, at a rate proportional to their contribution to system costs, while balancing service migration trade-offs between delay and migration costs, with failure costs, backup placement and migration costs. We use trace driven experiments to show that our algorithm gives cost reductions in the event of failures.

查看原文本刊更多论文

基于故障感知边缘计算迁移的Actor批评家强化学习

在边缘计算中，用户的服务配置文件根据用户的移动性进行迁移，以最大限度地减少用户体验的延迟，并平衡迁移成本。由于迁移概率和成本信息不完全，强化学习(RL)常用于优化服务迁移。然而，当前的工作并没有针对偶尔出现的服务器故障优化服务迁移。虽然服务器故障很少见，但它们会影响延迟敏感边缘计算应用程序(如自动驾驶和实时障碍物检测)的平稳和安全运行，因为用户无法再完成他们的计算工作。由于这些故障发生的概率很低，对于数据和经验驱动的强化学习算法来说，很难针对常见和罕见的事件场景学习到最优的服务迁移策略。因此，我们提出了一种ImACRE算法，该算法将重要性采样集成到演员评论强化学习中，以学习最优服务配置文件和备份放置策略。我们的算法使用重要性采样对模拟器中的罕见事件进行采样，采样率与它们对系统成本的贡献成正比，同时平衡服务迁移在延迟和迁移成本、故障成本、备份放置和迁移成本之间的权衡。我们使用跟踪驱动的实验来证明我们的算法在发生故障的情况下降低了成本。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 57th Annual Conference on Information Sciences and Systems (CISS)

自引率

0.00%

发文量