DRLMS: a multipath scheduler based on deep reinforcement learning

Mengyang Zhang, Kaiguo Yuan, Xiaoyong Li
{"title":"DRLMS: a multipath scheduler based on deep reinforcement learning","authors":"Mengyang Zhang, Kaiguo Yuan, Xiaoyong Li","doi":"10.1117/12.3031955","DOIUrl":null,"url":null,"abstract":"Most current network devices have multiple network interfaces, and multipath transport protocols can utilize multiple network paths (e.g., WiFi and cellular) to improve the performance and reliability of network transmission. The scheduler of the multipath transmission protocol determines the path to which each data packet should be transmitted, and is a key module that affects multipath transmission. However, current multipath schedulers cannot adapt well to various user usage scenarios. In this paper, we propose DRLMS, a deep reinforcement learning based multipath scheduler. DRLMS uses deep reinforcement learning to train neural networks to generate packet scheduling policies. It optimizes the scheduling strategy through feedback to the neural network through the reward function based on the current user usage scenario and QoS. We implement DRLMS in the MPQUIC protocol and compared it with current multipath schedulers. The results show that DRLMS's adaptability to user usage scenarios is significantly outperforms other schedulers.","PeriodicalId":342847,"journal":{"name":"International Conference on Algorithms, Microchips and Network Applications","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Algorithms, Microchips and Network Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.3031955","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Most current network devices have multiple network interfaces, and multipath transport protocols can utilize multiple network paths (e.g., WiFi and cellular) to improve the performance and reliability of network transmission. The scheduler of the multipath transmission protocol determines the path to which each data packet should be transmitted, and is a key module that affects multipath transmission. However, current multipath schedulers cannot adapt well to various user usage scenarios. In this paper, we propose DRLMS, a deep reinforcement learning based multipath scheduler. DRLMS uses deep reinforcement learning to train neural networks to generate packet scheduling policies. It optimizes the scheduling strategy through feedback to the neural network through the reward function based on the current user usage scenario and QoS. We implement DRLMS in the MPQUIC protocol and compared it with current multipath schedulers. The results show that DRLMS's adaptability to user usage scenarios is significantly outperforms other schedulers.
DRLMS:基于深度强化学习的多路径调度程序
目前大多数网络设备都有多个网络接口,多路径传输协议可以利用多条网络路径(如 WiFi 和蜂窝网络)来提高网络传输的性能和可靠性。多路径传输协议的调度器决定每个数据包应传输到哪条路径,是影响多路径传输的关键模块。然而,目前的多路径调度器不能很好地适应各种用户使用场景。本文提出了基于深度强化学习的多路径调度器 DRLMS。DRLMS 利用深度强化学习训练神经网络来生成数据包调度策略。它根据当前的用户使用场景和服务质量,通过奖励函数向神经网络提供反馈,从而优化调度策略。我们在 MPQUIC 协议中实施了 DRLMS,并将其与当前的多路径调度器进行了比较。结果表明,DRLMS 对用户使用场景的适应性明显优于其他调度器。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信