基于深度强化学习的无线网络优化:比较研究

IEEE INFOCOM 2020 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS) Pub Date : 2020-07-01 DOI:10.1109/infocomwkshps50562.2020.9162925

Kun Yang, Cong Shen, Tie Liu

{"title":"基于深度强化学习的无线网络优化:比较研究","authors":"Kun Yang, Cong Shen, Tie Liu","doi":"10.1109/infocomwkshps50562.2020.9162925","DOIUrl":null,"url":null,"abstract":"There is a growing interest in applying deep reinforcement learning (DRL) methods to optimizing the operation of wireless networks. In this paper, we compare three state of the art DRL methods, Deep Deterministic Policy Gradient (DDPG), Neural Episodic Control (NEC), and Variance Based Control (VBC), for the application of wireless network optimization. We describe how the general network optimization problem is formulated as RL and give details of the three methods in the context of wireless networking. Extensive experiments using a real-world network operation dataset are carried out, and the performance in terms of improving rate and convergence speed for these popular DRL methods is compared. We note that while DDPG and VBC demonstrate good potential in automating wireless network optimization, NEC has a much improved convergence rate but suffers from the limited action space and does not perform competitively in its current form.","PeriodicalId":104136,"journal":{"name":"IEEE INFOCOM 2020 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Deep Reinforcement Learning based Wireless Network Optimization: A Comparative Study\",\"authors\":\"Kun Yang, Cong Shen, Tie Liu\",\"doi\":\"10.1109/infocomwkshps50562.2020.9162925\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There is a growing interest in applying deep reinforcement learning (DRL) methods to optimizing the operation of wireless networks. In this paper, we compare three state of the art DRL methods, Deep Deterministic Policy Gradient (DDPG), Neural Episodic Control (NEC), and Variance Based Control (VBC), for the application of wireless network optimization. We describe how the general network optimization problem is formulated as RL and give details of the three methods in the context of wireless networking. Extensive experiments using a real-world network operation dataset are carried out, and the performance in terms of improving rate and convergence speed for these popular DRL methods is compared. We note that while DDPG and VBC demonstrate good potential in automating wireless network optimization, NEC has a much improved convergence rate but suffers from the limited action space and does not perform competitively in its current form.\",\"PeriodicalId\":104136,\"journal\":{\"name\":\"IEEE INFOCOM 2020 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE INFOCOM 2020 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/infocomwkshps50562.2020.9162925\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE INFOCOM 2020 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/infocomwkshps50562.2020.9162925","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 12

摘要

人们对应用深度强化学习(DRL)方法来优化无线网络的运行越来越感兴趣。在本文中，我们比较了三种最先进的DRL方法，深度确定性策略梯度(DDPG)，神经情景控制(NEC)和基于方差的控制(VBC)，用于无线网络优化的应用。我们描述了如何将一般网络优化问题表述为RL，并详细介绍了无线网络环境下的三种方法。利用实际网络运行数据集进行了大量的实验，比较了这些流行的DRL方法在提高速率和收敛速度方面的性能。我们注意到，虽然DDPG和VBC在自动化无线网络优化方面表现出良好的潜力，但NEC的收敛速度大大提高，但受到有限的操作空间的影响，并且在目前的形式下表现不具有竞争力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Deep Reinforcement Learning based Wireless Network Optimization: A Comparative Study

There is a growing interest in applying deep reinforcement learning (DRL) methods to optimizing the operation of wireless networks. In this paper, we compare three state of the art DRL methods, Deep Deterministic Policy Gradient (DDPG), Neural Episodic Control (NEC), and Variance Based Control (VBC), for the application of wireless network optimization. We describe how the general network optimization problem is formulated as RL and give details of the three methods in the context of wireless networking. Extensive experiments using a real-world network operation dataset are carried out, and the performance in terms of improving rate and convergence speed for these popular DRL methods is compared. We note that while DDPG and VBC demonstrate good potential in automating wireless network optimization, NEC has a much improved convergence rate but suffers from the limited action space and does not perform competitively in its current form.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE INFOCOM 2020 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS)

自引率

0.00%

发文量