基于深度强化学习的天线阵幅相离散引起的辐射方向图旁瓣升高补偿

2023 Photonics & Electromagnetics Research Symposium (PIERS) Pub Date : 2023-07-03 DOI:10.1109/PIERS59004.2023.10221258

Shiyuan Zhang, Chuan Shi, Ou Pan, M. Bai

{"title":"基于深度强化学习的天线阵幅相离散引起的辐射方向图旁瓣升高补偿","authors":"Shiyuan Zhang, Chuan Shi, Ou Pan, M. Bai","doi":"10.1109/PIERS59004.2023.10221258","DOIUrl":null,"url":null,"abstract":"In our previous research, we proposed a multi-layer perceptron method for synthesizing radiation patterns with complex requirements. However, this method has a limitation in that it requires continuous amplitude and phase excitations. The continuous excitations may not be practical in most applications, and when the amplitude and phase of the antenna element are discontinuous due to the resolution of the electronic device during the optimization process, this method may fail. In order to solve this problem, a combined method is proposed in this paper, utilizing the multi-layer perceptron network to optimize the radiation patterns of array antenna with continuous amplitude and phase, and introducing a deep reinforcement learning method to compensate for the elevated sidelobes in the radiation patterns caused by amplitude and phase discretization in post-processing. Specifically, the continuous amplitude and phase values obtained through the multi-layer perceptron network are rounded to approximate discrete results based on the resolution, serving as prior training experience for the deep reinforcement learning model. The compensation problem of elevated sidelobes caused by amplitude and phase discretization is then formulated as an optimization model, and a deep reinforcement learning model is constructed accordingly, with discrete excitations acting as the agent for exploration and search. The agent is trained using the Deep Q-learning network as the basic framework, combined with Double DQN technology and Dueling DQN technology to efficiently search for the best compensatory effect. The reward is carefully designed to incentivize the agent to search for discrete excitations with the optimal compensatory effect. A simulation experiment is conducted on a 50-element hemispheric conformal antenna array, demonstrating the effectiveness of the combined method.","PeriodicalId":354610,"journal":{"name":"2023 Photonics & Electromagnetics Research Symposium (PIERS)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Compensation for Elevated Sidelobe of Radiation Pattern of Antenna Array Caused by Amplitude and Phase Discretization Based on Deep Reinforcement Learning\",\"authors\":\"Shiyuan Zhang, Chuan Shi, Ou Pan, M. Bai\",\"doi\":\"10.1109/PIERS59004.2023.10221258\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In our previous research, we proposed a multi-layer perceptron method for synthesizing radiation patterns with complex requirements. However, this method has a limitation in that it requires continuous amplitude and phase excitations. The continuous excitations may not be practical in most applications, and when the amplitude and phase of the antenna element are discontinuous due to the resolution of the electronic device during the optimization process, this method may fail. In order to solve this problem, a combined method is proposed in this paper, utilizing the multi-layer perceptron network to optimize the radiation patterns of array antenna with continuous amplitude and phase, and introducing a deep reinforcement learning method to compensate for the elevated sidelobes in the radiation patterns caused by amplitude and phase discretization in post-processing. Specifically, the continuous amplitude and phase values obtained through the multi-layer perceptron network are rounded to approximate discrete results based on the resolution, serving as prior training experience for the deep reinforcement learning model. The compensation problem of elevated sidelobes caused by amplitude and phase discretization is then formulated as an optimization model, and a deep reinforcement learning model is constructed accordingly, with discrete excitations acting as the agent for exploration and search. The agent is trained using the Deep Q-learning network as the basic framework, combined with Double DQN technology and Dueling DQN technology to efficiently search for the best compensatory effect. The reward is carefully designed to incentivize the agent to search for discrete excitations with the optimal compensatory effect. A simulation experiment is conducted on a 50-element hemispheric conformal antenna array, demonstrating the effectiveness of the combined method.\",\"PeriodicalId\":354610,\"journal\":{\"name\":\"2023 Photonics & Electromagnetics Research Symposium (PIERS)\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 Photonics & Electromagnetics Research Symposium (PIERS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PIERS59004.2023.10221258\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 Photonics & Electromagnetics Research Symposium (PIERS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PIERS59004.2023.10221258","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在我们之前的研究中，我们提出了一种多层感知器方法来合成具有复杂要求的辐射模式。然而，这种方法的局限性在于它需要连续的幅度和相位激励。在大多数应用中，连续激励可能不实际，并且在优化过程中，由于电子设备的分辨率，天线元件的幅度和相位不连续时，该方法可能会失败。为了解决这一问题，本文提出了一种组合方法，利用多层感知器网络对幅相连续的阵列天线的辐射方向图进行优化，并引入深度强化学习方法来补偿后处理中幅相离散导致的辐射方向图副瓣升高。具体而言，通过多层感知器网络获得的连续振幅和相位值根据分辨率四舍五入到近似离散结果，作为深度强化学习模型的先验训练经验。然后将振幅和相位离散化引起的副瓣升高补偿问题作为优化模型，并以此为基础构建深度强化学习模型，以离散激励作为探索和搜索的代理。该智能体以Deep Q-learning网络为基本框架进行训练，结合Double DQN技术和Dueling DQN技术高效搜索最佳补偿效果。奖励是精心设计的，以激励智能体寻找具有最佳补偿效果的离散激励。在50元半球共形天线阵上进行了仿真实验，验证了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Compensation for Elevated Sidelobe of Radiation Pattern of Antenna Array Caused by Amplitude and Phase Discretization Based on Deep Reinforcement Learning

In our previous research, we proposed a multi-layer perceptron method for synthesizing radiation patterns with complex requirements. However, this method has a limitation in that it requires continuous amplitude and phase excitations. The continuous excitations may not be practical in most applications, and when the amplitude and phase of the antenna element are discontinuous due to the resolution of the electronic device during the optimization process, this method may fail. In order to solve this problem, a combined method is proposed in this paper, utilizing the multi-layer perceptron network to optimize the radiation patterns of array antenna with continuous amplitude and phase, and introducing a deep reinforcement learning method to compensate for the elevated sidelobes in the radiation patterns caused by amplitude and phase discretization in post-processing. Specifically, the continuous amplitude and phase values obtained through the multi-layer perceptron network are rounded to approximate discrete results based on the resolution, serving as prior training experience for the deep reinforcement learning model. The compensation problem of elevated sidelobes caused by amplitude and phase discretization is then formulated as an optimization model, and a deep reinforcement learning model is constructed accordingly, with discrete excitations acting as the agent for exploration and search. The agent is trained using the Deep Q-learning network as the basic framework, combined with Double DQN technology and Dueling DQN technology to efficiently search for the best compensatory effect. The reward is carefully designed to incentivize the agent to search for discrete excitations with the optimal compensatory effect. A simulation experiment is conducted on a 50-element hemispheric conformal antenna array, demonstrating the effectiveness of the combined method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 Photonics & Electromagnetics Research Symposium (PIERS)

自引率

0.00%

发文量