基于强化学习的三层异构网络动态节能算法

2019 IEEE 30th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC) Pub Date : 2019-09-01 DOI:10.1109/PIMRC.2019.8904290

Hao Sun, Tiejun Lv, Xuewei Zhang, Ziyu Liu

{"title":"基于强化学习的三层异构网络动态节能算法","authors":"Hao Sun, Tiejun Lv, Xuewei Zhang, Ziyu Liu","doi":"10.1109/PIMRC.2019.8904290","DOIUrl":null,"url":null,"abstract":"To cope with the rapidly growing demand for data traffic, heterogeneous network (HetNet), including different types of base stations (BSs), is advocated as a promising network architecture. Considering the different quality of service (QoS) requirements of the Internet of things (IoT) users and ordinary users, we propose a three-tier HetNet model with non-equal bandwidth. To reduce the power consumption caused by the dense deployment of BSs, we propose a novel reinforcement-learning (RL) based dynamic pico-cell base station (PBS) operation scheme. The proposed RL scheme is based on the asynchronous advantage actor-critic (A3C) algorithm, and can dynamically determine the on/off state of each PBS, aiming to achieve the minimal total power of the macro-cell without any prior information. Simulation results show that the proposed algorithm can achieve 92.1% performance gain of the optimal level in terms of the power consumption saving while needs less training time and lower running resource requirements compared to the benchmarks.","PeriodicalId":412182,"journal":{"name":"2019 IEEE 30th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)","volume":"79 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Reinforcement Learning Based Dynamic Energy-Saving Algorithm for Three-tier Heterogeneous Networks\",\"authors\":\"Hao Sun, Tiejun Lv, Xuewei Zhang, Ziyu Liu\",\"doi\":\"10.1109/PIMRC.2019.8904290\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To cope with the rapidly growing demand for data traffic, heterogeneous network (HetNet), including different types of base stations (BSs), is advocated as a promising network architecture. Considering the different quality of service (QoS) requirements of the Internet of things (IoT) users and ordinary users, we propose a three-tier HetNet model with non-equal bandwidth. To reduce the power consumption caused by the dense deployment of BSs, we propose a novel reinforcement-learning (RL) based dynamic pico-cell base station (PBS) operation scheme. The proposed RL scheme is based on the asynchronous advantage actor-critic (A3C) algorithm, and can dynamically determine the on/off state of each PBS, aiming to achieve the minimal total power of the macro-cell without any prior information. Simulation results show that the proposed algorithm can achieve 92.1% performance gain of the optimal level in terms of the power consumption saving while needs less training time and lower running resource requirements compared to the benchmarks.\",\"PeriodicalId\":412182,\"journal\":{\"name\":\"2019 IEEE 30th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)\",\"volume\":\"79 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 30th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PIMRC.2019.8904290\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 30th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PIMRC.2019.8904290","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

为了应对快速增长的数据流量需求，异构网络(HetNet)，包括不同类型的基站(BSs)，被认为是一种很有前途的网络架构。考虑到物联网用户和普通用户对服务质量(QoS)的不同要求，提出了一种非等带宽的三层HetNet模型。为了降低无线基站密集部署带来的功耗，提出了一种基于强化学习(RL)的动态微蜂窝基站(PBS)运行方案。提出的RL方案基于异步优势actor-critic (A3C)算法，能够动态确定每个PBS的开/关状态，在不需要任何先验信息的情况下实现宏单元的总功率最小。仿真结果表明，与基准测试相比，该算法在节省功耗方面的性能提升达到了最优水平的92.1%，同时所需的训练时间和运行资源也更少。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Reinforcement Learning Based Dynamic Energy-Saving Algorithm for Three-tier Heterogeneous Networks

To cope with the rapidly growing demand for data traffic, heterogeneous network (HetNet), including different types of base stations (BSs), is advocated as a promising network architecture. Considering the different quality of service (QoS) requirements of the Internet of things (IoT) users and ordinary users, we propose a three-tier HetNet model with non-equal bandwidth. To reduce the power consumption caused by the dense deployment of BSs, we propose a novel reinforcement-learning (RL) based dynamic pico-cell base station (PBS) operation scheme. The proposed RL scheme is based on the asynchronous advantage actor-critic (A3C) algorithm, and can dynamically determine the on/off state of each PBS, aiming to achieve the minimal total power of the macro-cell without any prior information. Simulation results show that the proposed algorithm can achieve 92.1% performance gain of the optimal level in terms of the power consumption saving while needs less training time and lower running resource requirements compared to the benchmarks.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 IEEE 30th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)

自引率

0.00%

发文量