Nour El Din El Sheikh, Esteban Paz, Juan Pinto, A. Beghelli
{"title":"Multi-band provisioning in dynamic elastic optical networks: a comparative study of a heuristic and a deep reinforcement learning approach","authors":"Nour El Din El Sheikh, Esteban Paz, Juan Pinto, A. Beghelli","doi":"10.23919/ONDM51796.2021.9492334","DOIUrl":null,"url":null,"abstract":"The blocking performance of a heuristic and a deep reinforcement learning approach for resource provisioning in a dynamic multi-band elastic optical network is evaluated. The heuristic is based on a previous proposal that prioritises the use of band C, then L, S, and E, in that order. The deep reinforcement learning approach uses a deep Q-network (DQN) agent trained on different multi-band scenarios. Results show, as expected, a significant decrease in blocking probability when moving from the C-band only scenario to the multi-band scenarios (C+L, C+L+S, C+L+S+E). However, the DQN agent did not outperform the heuristic. The lower performance of the agent, also observed in some previous works in optical networks, highlights the need for further research on how to better configure agents and improve the network representation used by the optical network environments.","PeriodicalId":163553,"journal":{"name":"2021 International Conference on Optical Network Design and Modeling (ONDM)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Optical Network Design and Modeling (ONDM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/ONDM51796.2021.9492334","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12
Abstract
The blocking performance of a heuristic and a deep reinforcement learning approach for resource provisioning in a dynamic multi-band elastic optical network is evaluated. The heuristic is based on a previous proposal that prioritises the use of band C, then L, S, and E, in that order. The deep reinforcement learning approach uses a deep Q-network (DQN) agent trained on different multi-band scenarios. Results show, as expected, a significant decrease in blocking probability when moving from the C-band only scenario to the multi-band scenarios (C+L, C+L+S, C+L+S+E). However, the DQN agent did not outperform the heuristic. The lower performance of the agent, also observed in some previous works in optical networks, highlights the need for further research on how to better configure agents and improve the network representation used by the optical network environments.