Uncertainty-aware Weighted Fair Queueing for Routers Based on Deep Reinforcement Learning

Pengyue Wang, Zhaoyu Jiang, Meiyu Qi, Longfei Dai, Huiying Xu
{"title":"Uncertainty-aware Weighted Fair Queueing for Routers Based on Deep Reinforcement Learning","authors":"Pengyue Wang, Zhaoyu Jiang, Meiyu Qi, Longfei Dai, Huiying Xu","doi":"10.1109/ICECE54449.2021.9674580","DOIUrl":null,"url":null,"abstract":"In current computer communication networks, the increasing packet loss and delay caused by the increasing traffic becomes the bottleneck for the desired Quality of Service (QoS). Weighted Fair Queueing can be used to provide differentiated services according to the Service Level Agreement (SLA) associated with each packet. However, due to inaccurate measurements of queue usage, drop rate and delay in real routers, and the intrinsic property of a real network system that there will always be some unpredictable traffic patterns, current methods for WFQ updating can be improved and extended further. In this work, an uncertainty-aware soft actor-critic agent is introduced. First, the learned weights updating strategy is a maximum entropy policy, which is robust under estimation and model error. Second, the technique of model uncertainty estimation is adopted into the agent so that it is capable of detecting novel states that are unseen during the training period, which facilitates a strategy switching framework. The proposed algorithm shows the potential of using reinforcement learning for WFQ weights updating and is compatible with existing techniques by monitoring the model uncertainty, which makes a more robust and stable system. The benefits of applying the proposed algorithm is validated through the simulation studies, showing a promising direction for further exploration.","PeriodicalId":166178,"journal":{"name":"2021 IEEE 4th International Conference on Electronics and Communication Engineering (ICECE)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 4th International Conference on Electronics and Communication Engineering (ICECE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECE54449.2021.9674580","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

In current computer communication networks, the increasing packet loss and delay caused by the increasing traffic becomes the bottleneck for the desired Quality of Service (QoS). Weighted Fair Queueing can be used to provide differentiated services according to the Service Level Agreement (SLA) associated with each packet. However, due to inaccurate measurements of queue usage, drop rate and delay in real routers, and the intrinsic property of a real network system that there will always be some unpredictable traffic patterns, current methods for WFQ updating can be improved and extended further. In this work, an uncertainty-aware soft actor-critic agent is introduced. First, the learned weights updating strategy is a maximum entropy policy, which is robust under estimation and model error. Second, the technique of model uncertainty estimation is adopted into the agent so that it is capable of detecting novel states that are unseen during the training period, which facilitates a strategy switching framework. The proposed algorithm shows the potential of using reinforcement learning for WFQ weights updating and is compatible with existing techniques by monitoring the model uncertainty, which makes a more robust and stable system. The benefits of applying the proposed algorithm is validated through the simulation studies, showing a promising direction for further exploration.
基于深度强化学习的不确定性感知加权公平路由器排队
在当前的计算机通信网络中,由于业务量的增加而导致的丢包和时延的增加成为实现理想的服务质量(QoS)的瓶颈。加权公平排队可以根据每个报文所关联的SLA (Service Level Agreement)提供差异化的服务。然而,由于对真实路由器的队列使用率、丢包率和时延的测量并不准确,而且真实网络系统的固有特性是总会存在一些不可预测的流量模式,因此现有的WFQ更新方法还可以进一步改进和扩展。本文介绍了一种具有不确定性感知的软行为-评论代理。首先,学习到的权重更新策略是一种最大熵策略,在估计和模型误差下具有鲁棒性。其次,将模型不确定性估计技术引入到智能体中,使其能够检测到在训练期间未见过的新状态,从而便于策略切换框架;该算法显示了将强化学习用于WFQ权值更新的潜力,并通过监测模型的不确定性与现有技术相兼容,使系统更加鲁棒和稳定。通过仿真研究验证了该算法的优越性,为进一步探索提供了良好的方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信