Spectrum Allocation for Covert Communications in Cellular-Enabled UAV Networks: A Deep Reinforcement Learning Approach

Journal of Networking and Network Applications Pub Date : 1900-01-01 DOI:10.33969/j-nana.2022.020302

Xinzhe Pi, Bin Yang

引用次数: 0

Abstract

This paper investigates the covert communications via spectrum allocations in a cellular-enabled unmanned aerial vehicle (UAV) network consisting of a base station (BS), UAVs, ground users (GUs), and a warden, where warden attempts to detect the transmission from a target GU to a UAV receiver. We formulate the spectrum allocation as an optimization problem with the constraints of covertness performance requirement and the qualities of service (QoS) of cellular communications. This is a nonlinear and nonconvex problem, which is generally challenging to be solved. Thus, we propose a deep reinforcement learning (DRL) approach to solve it. Under such an approach, we first model the multi-agent DRL environment in such networks. Then we define the state, action, reward and interaction mechanism of the DRL environment. Finally, a DRL algorithm is presented for learning the optimal policy of spectrum allocation.

查看原文本刊更多论文

蜂窝式无人机网络隐蔽通信的频谱分配:一种深度强化学习方法

本文研究了由基站(BS)、无人机、地面用户(GUs)和监狱长组成的蜂窝无人机(UAV)网络中通过频谱分配的秘密通信，其中监狱长试图检测从目标GU到无人机接收器的传输。我们将频谱分配描述为蜂窝通信的覆盖性、性能要求和服务质量(QoS)约束下的优化问题。这是一个非线性非凸问题，通常具有挑战性。因此，我们提出了一种深度强化学习(DRL)方法来解决它。在这种方法下，我们首先对这种网络中的多智能体DRL环境进行建模。然后定义了DRL环境的状态、行为、奖励和交互机制。最后，提出了一种学习最优频谱分配策略的DRL算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Networking and Network Applications

自引率

0.00%

发文量