{"title":"Deep Reinforcement Learning-Based Adaptive Nulling in Phased Array Under Dynamic Environments","authors":"Ying-Dar Lin;Jen-Hao Chang;Yuan-Cheng Lai","doi":"10.1109/ACCESS.2025.3591643","DOIUrl":null,"url":null,"abstract":"A phased array consists of multiple antenna elements that can control the direction of the radiated signal by adjusting each antenna element’s phase and amplitude, which are encapsulated in the phased array weights. To obtain better communication quality, nulling, which can weaken the interference signal, is helpful by adjusting the phased array weights. In dynamic environments, rapid changes in the directions of both interference and desired signals demand equally rapid, continual updates of phased-array weights. Traditional heuristic optimizers—such as Particle Swarm Optimization (PSO) and Genetic Algorithms (GA)—struggle to keep up because their iterative searches depend on pre-computed configurations that are unrealistic to obtain on the fly. To date, no heuristic, supervised-learning, or reinforcement-learning method simultaneously achieves all three requirements: fast adaptation, dataset-free operation, and continuous complex-weight control under highly dynamic environments. In this paper, an innovative deep reinforcement learning-based adaptive nulling, called DRLNuller, is proposed. DRLnuller adopts the Proximal Policy Optimization (PPO) algorithm, a typical reinforcement learning algorithm, to dynamically optimize phased array weights through continuous interaction with the environment without relying on pre-computed or labeled data. In experiments, DRLNuller after the training process outperforms PSO and GA in computation speed by <inline-formula> <tex-math>$2.83\\times 10^{5}$ </tex-math></inline-formula> times faster and maintains effective communication quality, an average Signal-to-Interference Ratio (SIR) of 25.06 dB, under different conditions.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"13 ","pages":"130988-131002"},"PeriodicalIF":3.6000,"publicationDate":"2025-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11088112","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Access","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11088112/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
A phased array consists of multiple antenna elements that can control the direction of the radiated signal by adjusting each antenna element’s phase and amplitude, which are encapsulated in the phased array weights. To obtain better communication quality, nulling, which can weaken the interference signal, is helpful by adjusting the phased array weights. In dynamic environments, rapid changes in the directions of both interference and desired signals demand equally rapid, continual updates of phased-array weights. Traditional heuristic optimizers—such as Particle Swarm Optimization (PSO) and Genetic Algorithms (GA)—struggle to keep up because their iterative searches depend on pre-computed configurations that are unrealistic to obtain on the fly. To date, no heuristic, supervised-learning, or reinforcement-learning method simultaneously achieves all three requirements: fast adaptation, dataset-free operation, and continuous complex-weight control under highly dynamic environments. In this paper, an innovative deep reinforcement learning-based adaptive nulling, called DRLNuller, is proposed. DRLnuller adopts the Proximal Policy Optimization (PPO) algorithm, a typical reinforcement learning algorithm, to dynamically optimize phased array weights through continuous interaction with the environment without relying on pre-computed or labeled data. In experiments, DRLNuller after the training process outperforms PSO and GA in computation speed by $2.83\times 10^{5}$ times faster and maintains effective communication quality, an average Signal-to-Interference Ratio (SIR) of 25.06 dB, under different conditions.
IEEE AccessCOMPUTER SCIENCE, INFORMATION SYSTEMSENGIN-ENGINEERING, ELECTRICAL & ELECTRONIC
CiteScore
9.80
自引率
7.70%
发文量
6673
审稿时长
6 weeks
期刊介绍:
IEEE Access® is a multidisciplinary, open access (OA), applications-oriented, all-electronic archival journal that continuously presents the results of original research or development across all of IEEE''s fields of interest.
IEEE Access will publish articles that are of high interest to readers, original, technically correct, and clearly presented. Supported by author publication charges (APC), its hallmarks are a rapid peer review and publication process with open access to all readers. Unlike IEEE''s traditional Transactions or Journals, reviews are "binary", in that reviewers will either Accept or Reject an article in the form it is submitted in order to achieve rapid turnaround. Especially encouraged are submissions on:
Multidisciplinary topics, or applications-oriented articles and negative results that do not fit within the scope of IEEE''s traditional journals.
Practical articles discussing new experiments or measurement techniques, interesting solutions to engineering.
Development of new or improved fabrication or manufacturing techniques.
Reviews or survey articles of new or evolving fields oriented to assist others in understanding the new area.