Reinforcing State-Dependent N Best Quality of Service Routes in Communication Networks

2007 Workshop on High Performance Switching and Routing Pub Date : 2007-07-30 DOI:10.1109/HPSR.2007.4281269

A. Mellouk, S. Hoceini

{"title":"Reinforcing State-Dependent N Best Quality of Service Routes in Communication Networks","authors":"A. Mellouk, S. Hoceini","doi":"10.1109/HPSR.2007.4281269","DOIUrl":null,"url":null,"abstract":"In the context of modern high-speed Internet network, routing is often complicated by the notion of guaranteed quality of service (QoS), which can either be related to time, packet loss or bandwidth requirements: constraints related to various types of QoS make some routing inacceptable. Due to emerging real-time and multimedia applications, efficient routing of information packets in dynamically changing communication network requires that as the load levels, traffic patterns and topology of the network change, the routing policy also adapts. We focused in this paper on QoS based routing by developing a neuro-dynamic programming to construct dynamic state-dependent routing policies. In this paper, we propose an approach based on adaptive algorithm for packet routing using reinforcement learning called N best optimal path Q routing algorithm (NOQRA) which optimizes two criteria: cumulative cost path (or hop count if each link cost =1) and end-to-end delay. A load balancing policy depending on a dynamical traffic path probability distribution function is also defined and embodied in NOQRA to characterize the distribution of the traffic over the N Best Paths. Numerical results obtained with OPNET simulator for different levels of traffic's load show that NOQRA gives better results compared to standard optimal path routing and Q-routing algorithm based on Q-learning paradigm.","PeriodicalId":258491,"journal":{"name":"2007 Workshop on High Performance Switching and Routing","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 Workshop on High Performance Switching and Routing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPSR.2007.4281269","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

In the context of modern high-speed Internet network, routing is often complicated by the notion of guaranteed quality of service (QoS), which can either be related to time, packet loss or bandwidth requirements: constraints related to various types of QoS make some routing inacceptable. Due to emerging real-time and multimedia applications, efficient routing of information packets in dynamically changing communication network requires that as the load levels, traffic patterns and topology of the network change, the routing policy also adapts. We focused in this paper on QoS based routing by developing a neuro-dynamic programming to construct dynamic state-dependent routing policies. In this paper, we propose an approach based on adaptive algorithm for packet routing using reinforcement learning called N best optimal path Q routing algorithm (NOQRA) which optimizes two criteria: cumulative cost path (or hop count if each link cost =1) and end-to-end delay. A load balancing policy depending on a dynamical traffic path probability distribution function is also defined and embodied in NOQRA to characterize the distribution of the traffic over the N Best Paths. Numerical results obtained with OPNET simulator for different levels of traffic's load show that NOQRA gives better results compared to standard optimal path routing and Q-routing algorithm based on Q-learning paradigm.

查看原文本刊更多论文

通信网络中状态依赖N最佳服务质量路由的增强

在现代高速Internet网络的背景下，路由通常因保证服务质量(QoS)的概念而变得复杂，这可能与时间、数据包丢失或带宽要求有关:与各种类型的QoS相关的约束使某些路由不可接受。随着实时和多媒体应用的不断涌现，动态变化的通信网络中信息包的高效路由要求路由策略随着网络的负载水平、流量模式和拓扑结构的变化而变化。本文主要研究基于QoS的路由，通过神经动态规划来构建动态状态相关的路由策略。在本文中，我们提出了一种基于自适应算法的方法，用于使用强化学习的数据包路由，称为N最优路径Q路由算法(NOQRA)，它优化了两个标准:累积成本路径(或跳数，如果每个链路成本=1)和端到端延迟。在NOQRA中定义了基于动态流量路径概率分布函数的负载均衡策略，以表征N条最佳路径上的流量分布。在OPNET仿真器上对不同流量负载水平的仿真结果表明，与标准最优路径路由和基于q -学习范式的q -路由算法相比，NOQRA算法具有更好的优化效果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2007 Workshop on High Performance Switching and Routing

自引率

0.00%

发文量