A Tutorial on Bandit Learning and Its Applications in 5G Mobile Edge Computing (Invited Paper)

IF 1.3 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC
Sige Liu, Peng Cheng, Zhuo Chen, B. Vucetic, Yonghui Li
{"title":"A Tutorial on Bandit Learning and Its Applications in 5G Mobile Edge Computing (Invited Paper)","authors":"Sige Liu, Peng Cheng, Zhuo Chen, B. Vucetic, Yonghui Li","doi":"10.3389/frsip.2022.864392","DOIUrl":null,"url":null,"abstract":"Due to the rapid development of 5G and Internet-of-Things (IoT), various emerging applications have been catalyzed, ranging from face recognition, virtual reality to autonomous driving, demanding ubiquitous computation services beyond the capacity of mobile users (MUs). Mobile cloud computing (MCC) enables MUs to offload their tasks to the remote central cloud with substantial computation and storage, at the expense of long propagation latency. To solve the latency issue, mobile edge computing (MEC) pushes its servers to the edge of the network much closer to the MUs. It jointly considers the communication and computation to optimize network performance by satisfying quality-of-service (QoS) and quality-of-experience (QoE) requirements. However, MEC usually faces a complex combinatorial optimization problem with the complexity of exponential scale. Moreover, many important parameters might be unknown a-priori due to the dynamic nature of the offloading environment and network topology. In this paper, to deal with the above issues, we introduce bandit learning (BL), which enables each agent (MU/server) to make a sequential selection from a set of arms (servers/MUs) and then receive some numerical rewards. BL brings extra benefits to the joint consideration of offloading decision and resource allocation in MEC, including the matched mechanism, situation awareness through learning, and adaptability. We present a brief tutorial on BL of different variations, covering the mathematical formulations and corresponding solutions. Furthermore, we provide several applications of BL in MEC, including system models, problem formulations, proposed algorithms and simulation results. At last, we introduce several challenges and directions in the future research of BL in 5G MEC.","PeriodicalId":93557,"journal":{"name":"Frontiers in signal processing","volume":"126 1","pages":""},"PeriodicalIF":1.3000,"publicationDate":"2022-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in signal processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/frsip.2022.864392","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Due to the rapid development of 5G and Internet-of-Things (IoT), various emerging applications have been catalyzed, ranging from face recognition, virtual reality to autonomous driving, demanding ubiquitous computation services beyond the capacity of mobile users (MUs). Mobile cloud computing (MCC) enables MUs to offload their tasks to the remote central cloud with substantial computation and storage, at the expense of long propagation latency. To solve the latency issue, mobile edge computing (MEC) pushes its servers to the edge of the network much closer to the MUs. It jointly considers the communication and computation to optimize network performance by satisfying quality-of-service (QoS) and quality-of-experience (QoE) requirements. However, MEC usually faces a complex combinatorial optimization problem with the complexity of exponential scale. Moreover, many important parameters might be unknown a-priori due to the dynamic nature of the offloading environment and network topology. In this paper, to deal with the above issues, we introduce bandit learning (BL), which enables each agent (MU/server) to make a sequential selection from a set of arms (servers/MUs) and then receive some numerical rewards. BL brings extra benefits to the joint consideration of offloading decision and resource allocation in MEC, including the matched mechanism, situation awareness through learning, and adaptability. We present a brief tutorial on BL of different variations, covering the mathematical formulations and corresponding solutions. Furthermore, we provide several applications of BL in MEC, including system models, problem formulations, proposed algorithms and simulation results. At last, we introduce several challenges and directions in the future research of BL in 5G MEC.
强盗学习及其在5G移动边缘计算中的应用教程(特邀论文)
由于5G和物联网(IoT)的快速发展,从人脸识别、虚拟现实到自动驾驶等各种新兴应用得到了催化,需要超越移动用户(mu)能力的无处不在的计算服务。移动云计算(MCC)使mu能够将其任务卸载到具有大量计算和存储的远程中央云,但代价是较长的传播延迟。为了解决延迟问题,移动边缘计算(MEC)将其服务器推到离mu更近的网络边缘。它综合考虑通信和计算,通过满足服务质量(QoS)和体验质量(QoE)要求来优化网络性能。然而,MEC通常面临一个复杂的组合优化问题,其复杂性为指数尺度。此外,由于卸载环境和网络拓扑结构的动态性,许多重要参数可能是先验未知的。在本文中,为了解决上述问题,我们引入了强盗学习(BL),它使每个代理(MU/服务器)从一组武器(服务器/MU)中进行顺序选择,然后获得一些数值奖励。BL为MEC中卸载决策和资源分配的共同考虑带来了额外的好处,包括匹配机制、通过学习的态势感知和适应性。我们简要介绍了不同类型的BL,包括数学公式和相应的解决方案。此外,我们还提供了BL在MEC中的几个应用,包括系统模型、问题表述、提出的算法和仿真结果。最后,介绍了5G MEC中BL未来研究的几个挑战和方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信