针对网络拥塞的数据包路由:一种深度多智能体强化学习方法

2020 International Conference on Computing, Networking and Communications (ICNC) Pub Date : 2020-02-01 DOI:10.1109/ICNC47757.2020.9049759

Ruijin Ding, Yuwen Yang, Jun Liu, Hongyan Li, F. Gao

{"title":"针对网络拥塞的数据包路由:一种深度多智能体强化学习方法","authors":"Ruijin Ding, Yuwen Yang, Jun Liu, Hongyan Li, F. Gao","doi":"10.1109/ICNC47757.2020.9049759","DOIUrl":null,"url":null,"abstract":"The continuous growth of the network data would lead to the increased network congestion and the throughput decline. In this paper, we investigate the packet routing problem based on deep multi-agent reinforcement learning, where each router chooses the next hop router by itself intelligently. We design the modified deep Q-network in each router to evaluate the neighbor routers. The routers, each acting as an agent, choose the next hop router based on their local observation. Then they transfer the packets to the chosen routers and receive the reward and the observation of the next hop routers. Using their experience, the routers learn to improve the packet routing strategy by updating their Q-networks. We demonstrate that with proper reward set and training mechanism, the routers in the network can work in a distributed way to reduce the computational complexity compared with the single-agent reinforcement learning based algorithm. And the proposed algorithm can further reduce the congestion probability and improve the network performance.","PeriodicalId":437689,"journal":{"name":"2020 International Conference on Computing, Networking and Communications (ICNC)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"Packet Routing Against Network Congestion: A Deep Multi-agent Reinforcement Learning Approach\",\"authors\":\"Ruijin Ding, Yuwen Yang, Jun Liu, Hongyan Li, F. Gao\",\"doi\":\"10.1109/ICNC47757.2020.9049759\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The continuous growth of the network data would lead to the increased network congestion and the throughput decline. In this paper, we investigate the packet routing problem based on deep multi-agent reinforcement learning, where each router chooses the next hop router by itself intelligently. We design the modified deep Q-network in each router to evaluate the neighbor routers. The routers, each acting as an agent, choose the next hop router based on their local observation. Then they transfer the packets to the chosen routers and receive the reward and the observation of the next hop routers. Using their experience, the routers learn to improve the packet routing strategy by updating their Q-networks. We demonstrate that with proper reward set and training mechanism, the routers in the network can work in a distributed way to reduce the computational complexity compared with the single-agent reinforcement learning based algorithm. And the proposed algorithm can further reduce the congestion probability and improve the network performance.\",\"PeriodicalId\":437689,\"journal\":{\"name\":\"2020 International Conference on Computing, Networking and Communications (ICNC)\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Computing, Networking and Communications (ICNC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICNC47757.2020.9049759\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Computing, Networking and Communications (ICNC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNC47757.2020.9049759","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 13

摘要

网络数据的不断增长将导致网络拥塞加剧，吞吐量下降。本文研究了基于深度多智能体强化学习的分组路由问题，其中每个路由器自行智能地选择下一跳路由器。我们在每台路由器上设计了改进的深度q网络来评估相邻路由器。每个路由器作为一个代理，根据它们的本地观察选择下一跳路由器。然后，它们将数据包传输到选定的路由器，并接受奖励和下一跳路由器的观察。路由器利用它们的经验，通过更新它们的q网络来学习改进分组路由策略。研究表明，与基于单智能体的强化学习算法相比，通过适当的奖励集和训练机制，网络中的路由器可以以分布式的方式工作，从而降低了计算复杂度。该算法可以进一步降低拥塞概率，提高网络性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Packet Routing Against Network Congestion: A Deep Multi-agent Reinforcement Learning Approach

The continuous growth of the network data would lead to the increased network congestion and the throughput decline. In this paper, we investigate the packet routing problem based on deep multi-agent reinforcement learning, where each router chooses the next hop router by itself intelligently. We design the modified deep Q-network in each router to evaluate the neighbor routers. The routers, each acting as an agent, choose the next hop router based on their local observation. Then they transfer the packets to the chosen routers and receive the reward and the observation of the next hop routers. Using their experience, the routers learn to improve the packet routing strategy by updating their Q-networks. We demonstrate that with proper reward set and training mechanism, the routers in the network can work in a distributed way to reduce the computational complexity compared with the single-agent reinforcement learning based algorithm. And the proposed algorithm can further reduce the congestion probability and improve the network performance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 International Conference on Computing, Networking and Communications (ICNC)

自引率

0.00%

发文量