{"title":"Implications of Centralized and Distributed Multi-Agent Deep Reinforcement Learning in Dynamic Spectrum Access","authors":"Abdikarim Mohamed Ibrahim, K. Yau, Mee Hong Ling","doi":"10.1109/ISTT56288.2022.9966551","DOIUrl":null,"url":null,"abstract":"Multi-agent Deep Reinforcement Learning (MADRL) has been applied to a plethora of state-of-the-art applications such as resource allocations and network routing in both centralized and distributed manners. This paper investigates the performance of centralized and distributed MADRL in Dynamic Spectrum Access (DSA). We consider a multichannel wireless network with a shared bandwidth divided into k channels. The objective of the MADRL is to develop a spectrum access strategy by learning in both a centralized and distributed manner. To evaluate the performance of centralized and distributed MADRL, we tackle the spectrum access problem by applying centralized MADRL and distributed MADRL. Experimental results show that distributed MADRL outperforms the centralized MADRL by 15% in collision avoidance and accumulated rewards in DSA.","PeriodicalId":389716,"journal":{"name":"2022 IEEE 6th International Symposium on Telecommunication Technologies (ISTT)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 6th International Symposium on Telecommunication Technologies (ISTT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISTT56288.2022.9966551","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Multi-agent Deep Reinforcement Learning (MADRL) has been applied to a plethora of state-of-the-art applications such as resource allocations and network routing in both centralized and distributed manners. This paper investigates the performance of centralized and distributed MADRL in Dynamic Spectrum Access (DSA). We consider a multichannel wireless network with a shared bandwidth divided into k channels. The objective of the MADRL is to develop a spectrum access strategy by learning in both a centralized and distributed manner. To evaluate the performance of centralized and distributed MADRL, we tackle the spectrum access problem by applying centralized MADRL and distributed MADRL. Experimental results show that distributed MADRL outperforms the centralized MADRL by 15% in collision avoidance and accumulated rewards in DSA.