DRLMS: a multipath scheduler based on deep reinforcement learning

International Conference on Algorithms, Microchips and Network Applications Pub Date : 2024-06-08 DOI:10.1117/12.3031955

Mengyang Zhang, Kaiguo Yuan, Xiaoyong Li

引用次数: 0

Abstract

Most current network devices have multiple network interfaces, and multipath transport protocols can utilize multiple network paths (e.g., WiFi and cellular) to improve the performance and reliability of network transmission. The scheduler of the multipath transmission protocol determines the path to which each data packet should be transmitted, and is a key module that affects multipath transmission. However, current multipath schedulers cannot adapt well to various user usage scenarios. In this paper, we propose DRLMS, a deep reinforcement learning based multipath scheduler. DRLMS uses deep reinforcement learning to train neural networks to generate packet scheduling policies. It optimizes the scheduling strategy through feedback to the neural network through the reward function based on the current user usage scenario and QoS. We implement DRLMS in the MPQUIC protocol and compared it with current multipath schedulers. The results show that DRLMS's adaptability to user usage scenarios is significantly outperforms other schedulers.

查看原文本刊更多论文

DRLMS：基于深度强化学习的多路径调度程序

目前大多数网络设备都有多个网络接口，多路径传输协议可以利用多条网络路径（如 WiFi 和蜂窝网络）来提高网络传输的性能和可靠性。多路径传输协议的调度器决定每个数据包应传输到哪条路径，是影响多路径传输的关键模块。然而，目前的多路径调度器不能很好地适应各种用户使用场景。本文提出了基于深度强化学习的多路径调度器 DRLMS。DRLMS 利用深度强化学习训练神经网络来生成数据包调度策略。它根据当前的用户使用场景和服务质量，通过奖励函数向神经网络提供反馈，从而优化调度策略。我们在 MPQUIC 协议中实施了 DRLMS，并将其与当前的多路径调度器进行了比较。结果表明，DRLMS 对用户使用场景的适应性明显优于其他调度器。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Conference on Algorithms, Microchips and Network Applications

自引率

0.00%

发文量