Multi-Agent Reinforcement Learning for Cooperative Edge Caching in Internet of Vehicles

2020 IEEE 17th International Conference on Mobile Ad Hoc and Sensor Systems (MASS) Pub Date : 2020-12-01 DOI:10.1109/MASS50613.2020.00062

Kai Jiang, Huan Zhou, Deze Zeng, Jie Wu

{"title":"Multi-Agent Reinforcement Learning for Cooperative Edge Caching in Internet of Vehicles","authors":"Kai Jiang, Huan Zhou, Deze Zeng, Jie Wu","doi":"10.1109/MASS50613.2020.00062","DOIUrl":null,"url":null,"abstract":"Edge caching has been emerged as a promising solution to alleviate the redundant traffic and the content access latency in the future Internet of Vehicles (IoVs). Several Reinforcement Learning (RL) based edge caching methods have been proposed to improve the cache utilization and reduce the backhaul traffic load. However, they can only obtain the local sub-optimal solution, as they neglect the influence of environment by other agents. In this paper, we investigate the edge caching strategy with consideration of the content delivery and cache replacement by exploiting the distributed Multi-Agent Reinforcement Learning (MARL). We first propose a hierarchical edge caching architecture for IoVs and formulate the corresponding problem with the objective to minimize the long-term cost of content delivery in the system. Then, we extend the Markov Decision Process (MDP) in the single agent RL to the multi-agent system, and propose a distributed MARL based edge caching algorithm to tackle the optimization problem. Finally, extensive simulations are conducted to evaluate the performance of the proposed distributed MARL based edge caching method. The simulation results show that the proposed MARL based edge caching method significantly outperforms other benchmark methods in terms of the total content access cost, edge hit rate and average delay. Especially, our proposed method greatly reduces an average of 32% total content access cost compared with the conventional RL based edge caching methods.","PeriodicalId":105795,"journal":{"name":"2020 IEEE 17th International Conference on Mobile Ad Hoc and Sensor Systems (MASS)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 17th International Conference on Mobile Ad Hoc and Sensor Systems (MASS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MASS50613.2020.00062","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 18

Abstract

Edge caching has been emerged as a promising solution to alleviate the redundant traffic and the content access latency in the future Internet of Vehicles (IoVs). Several Reinforcement Learning (RL) based edge caching methods have been proposed to improve the cache utilization and reduce the backhaul traffic load. However, they can only obtain the local sub-optimal solution, as they neglect the influence of environment by other agents. In this paper, we investigate the edge caching strategy with consideration of the content delivery and cache replacement by exploiting the distributed Multi-Agent Reinforcement Learning (MARL). We first propose a hierarchical edge caching architecture for IoVs and formulate the corresponding problem with the objective to minimize the long-term cost of content delivery in the system. Then, we extend the Markov Decision Process (MDP) in the single agent RL to the multi-agent system, and propose a distributed MARL based edge caching algorithm to tackle the optimization problem. Finally, extensive simulations are conducted to evaluate the performance of the proposed distributed MARL based edge caching method. The simulation results show that the proposed MARL based edge caching method significantly outperforms other benchmark methods in terms of the total content access cost, edge hit rate and average delay. Especially, our proposed method greatly reduces an average of 32% total content access cost compared with the conventional RL based edge caching methods.

查看原文本刊更多论文

基于多智能体强化学习的车联网协同边缘缓存

在未来的车联网中，边缘缓存作为缓解冗余流量和内容访问延迟的一种很有前景的解决方案而出现。为了提高缓存利用率和减少回程流量负载，提出了几种基于强化学习(RL)的边缘缓存方法。然而，由于忽略了其他agent对环境的影响，它们只能得到局部次优解。在本文中，我们利用分布式多智能体强化学习(MARL)研究了考虑内容传递和缓存替换的边缘缓存策略。我们首先为iov提出了一个分层边缘缓存架构，并制定了相应的问题，目标是最小化系统中内容交付的长期成本。然后，我们将单智能体强化学习中的马尔可夫决策过程(MDP)扩展到多智能体系统，并提出了一种基于分布式马尔可夫决策过程的边缘缓存算法来解决优化问题。最后，进行了大量的仿真来评估所提出的基于分布式MARL的边缘缓存方法的性能。仿真结果表明，所提出的基于MARL的边缘缓存方法在总内容访问成本、边缘命中率和平均延迟方面明显优于其他基准方法。特别是，与传统的基于RL的边缘缓存方法相比，我们提出的方法大大降低了平均32%的总内容访问成本。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE 17th International Conference on Mobile Ad Hoc and Sensor Systems (MASS)

自引率

0.00%

发文量