基于actor - critical学习的无人机接入和回程网络能量优化。

IF 1.9 4区计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

EURASIP Journal on Wireless Communications and Networking Pub Date : 2021-01-01 Epub Date: 2021-04-07 DOI:10.1186/s13638-021-01960-0

Yaxiong Yuan, Lei Lei, Thang X Vu, Symeon Chatzinotas, Sumei Sun, Björn Ottersten

{"title":"基于actor - critical学习的无人机接入和回程网络能量优化。","authors":"Yaxiong Yuan, Lei Lei, Thang X Vu, Symeon Chatzinotas, Sumei Sun, Björn Ottersten","doi":"10.1186/s13638-021-01960-0","DOIUrl":null,"url":null,"abstract":"In unmanned aerial vehicle (UAV)-assisted networks, UAV acts as an aerial base station which acquires the requested data via backhaul link and then serves ground users (GUs) through an access network. In this paper, we investigate an energy minimization problem with a limited power supply for both backhaul and access links. The difficulties for solving such a non-convex and combinatorial problem lie at the high computational complexity/time. In solution development, we consider the approaches from both actor-critic deep reinforcement learning (AC-DRL) and optimization perspectives. First, two offline non-learning algorithms, i.e., an optimal and a heuristic algorithms, based on piecewise linear approximation and relaxation are developed as benchmarks. Second, toward real-time decision-making, we improve the conventional AC-DRL and propose two learning schemes: AC-based user group scheduling and backhaul power allocation (ACGP), and joint AC-based user group scheduling and optimization-based backhaul power allocation (ACGOP). Numerical results show that the computation time of both ACGP and ACGOP is reduced tenfold to hundredfold compared to the offline approaches, and ACGOP is better than ACGP in energy savings. The results also verify the superiority of proposed learning solutions in terms of guaranteeing the feasibility and minimizing the system energy compared to the conventional AC-DRL.","PeriodicalId":12040,"journal":{"name":"EURASIP Journal on Wireless Communications and Networking","volume":"2021 1","pages":"78"},"PeriodicalIF":1.9000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13638-021-01960-0","citationCount":"3","resultStr":"{\"title\":\"Actor-critic learning-based energy optimization for UAV access and backhaul networks.\",\"authors\":\"Yaxiong Yuan, Lei Lei, Thang X Vu, Symeon Chatzinotas, Sumei Sun, Björn Ottersten\",\"doi\":\"10.1186/s13638-021-01960-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In unmanned aerial vehicle (UAV)-assisted networks, UAV acts as an aerial base station which acquires the requested data via backhaul link and then serves ground users (GUs) through an access network. In this paper, we investigate an energy minimization problem with a limited power supply for both backhaul and access links. The difficulties for solving such a non-convex and combinatorial problem lie at the high computational complexity/time. In solution development, we consider the approaches from both actor-critic deep reinforcement learning (AC-DRL) and optimization perspectives. First, two offline non-learning algorithms, i.e., an optimal and a heuristic algorithms, based on piecewise linear approximation and relaxation are developed as benchmarks. Second, toward real-time decision-making, we improve the conventional AC-DRL and propose two learning schemes: AC-based user group scheduling and backhaul power allocation (ACGP), and joint AC-based user group scheduling and optimization-based backhaul power allocation (ACGOP). Numerical results show that the computation time of both ACGP and ACGOP is reduced tenfold to hundredfold compared to the offline approaches, and ACGOP is better than ACGP in energy savings. The results also verify the superiority of proposed learning solutions in terms of guaranteeing the feasibility and minimizing the system energy compared to the conventional AC-DRL.\",\"PeriodicalId\":12040,\"journal\":{\"name\":\"EURASIP Journal on Wireless Communications and Networking\",\"volume\":\"2021 1\",\"pages\":\"78\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2021-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1186/s13638-021-01960-0\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"EURASIP Journal on Wireless Communications and Networking\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1186/s13638-021-01960-0\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2021/4/7 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"EURASIP Journal on Wireless Communications and Networking","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1186/s13638-021-01960-0","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2021/4/7 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 3

摘要

在无人机辅助网络中，无人机作为空中基站，通过回程链路获取请求数据，然后通过接入网为地面用户提供服务。在这篇论文中，我们研究了在有限的电力供应下，回传和接入链路的能量最小化问题。求解此类非凸组合问题的难点在于计算复杂度/时间较高。在解决方案开发中，我们考虑了actor-critic深度强化学习(AC-DRL)和优化视角的方法。首先，提出了基于分段线性逼近和松弛的两种离线非学习算法，即最优算法和启发式算法作为基准。其次，在实时决策方面，对传统的AC-DRL进行了改进，提出了基于交流的用户组调度和回程功率分配(ACGP)和基于交流的用户组调度和优化回程功率分配(ACGOP)两种学习方案。数值结果表明，ACGP和ACGOP的计算时间都比离线方法缩短了10倍到100倍，ACGOP在节能方面优于ACGP。与传统的AC-DRL相比，所提出的学习方案在保证可行性和最小化系统能量方面具有优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Actor-critic learning-based energy optimization for UAV access and backhaul networks.

查看原文本刊更多论文

Actor-critic learning-based energy optimization for UAV access and backhaul networks.

In unmanned aerial vehicle (UAV)-assisted networks, UAV acts as an aerial base station which acquires the requested data via backhaul link and then serves ground users (GUs) through an access network. In this paper, we investigate an energy minimization problem with a limited power supply for both backhaul and access links. The difficulties for solving such a non-convex and combinatorial problem lie at the high computational complexity/time. In solution development, we consider the approaches from both actor-critic deep reinforcement learning (AC-DRL) and optimization perspectives. First, two offline non-learning algorithms, i.e., an optimal and a heuristic algorithms, based on piecewise linear approximation and relaxation are developed as benchmarks. Second, toward real-time decision-making, we improve the conventional AC-DRL and propose two learning schemes: AC-based user group scheduling and backhaul power allocation (ACGP), and joint AC-based user group scheduling and optimization-based backhaul power allocation (ACGOP). Numerical results show that the computation time of both ACGP and ACGOP is reduced tenfold to hundredfold compared to the offline approaches, and ACGOP is better than ACGP in energy savings. The results also verify the superiority of proposed learning solutions in terms of guaranteeing the feasibility and minimizing the system energy compared to the conventional AC-DRL.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

EURASIP Journal on Wireless Communications and Networking 工程技术-电信学

CiteScore

7.70

自引率

3.80%

发文量

109

审稿时长

8.0 months

期刊介绍： The overall aim of the EURASIP Journal on Wireless Communications and Networking (EURASIP JWCN) is to bring together science and applications of wireless communications and networking technologies with emphasis on signal processing techniques and tools. It is directed at both practicing engineers and academic researchers. EURASIP Journal on Wireless Communications and Networking will highlight the continued growth and new challenges in wireless technology, for both application development and basic research. Articles should emphasize original results relating to the theory and/or applications of wireless communications and networking. Review articles, especially those emphasizing multidisciplinary views of communications and networking, are also welcome. EURASIP Journal on Wireless Communications and Networking employs a paperless, electronic submission and evaluation system to promote a rapid turnaround in the peer-review process. The journal is an Open Access journal since 2004.