Bayesian Inverse Learning and Online Changepoint Detection of Cognitive Radar Strategies

IEEE Transactions on Radar Systems Pub Date : 2025-03-13 DOI:10.1109/TRS.2025.3551066

C. V. Anoop;Anup Aprem

{"title":"Bayesian Inverse Learning and Online Changepoint Detection of Cognitive Radar Strategies","authors":"C. V. Anoop;Anup Aprem","doi":"10.1109/TRS.2025.3551066","DOIUrl":null,"url":null,"abstract":"In this article, we introduce an online electronic countermeasure framework for learning the strategies and detecting changes in the strategy of an adversarial cognitive radar (CR) in an inverse learning framework. We model the CR as a rational, constrained utility-maximizing agent and formulate the problem in the revealed preference setting. The utility of CR is modeled as a random direction vector, that follows the von Mises-Fisher distribution with unknown parameters, and we use Bayesian machine learning with revealed preference characterization to learn the radar’s utility, and extend it to the detection of changes in strategy in an online setting. The main contributions of the article are: 1) the development of Bayesian machine learning algorithms—HBOIL and HBOCPD, for inverse learning and detection of changes in strategies of an adversarial CR, respectively; 2) HBOIL and HBOCPD use a Hamiltonian Monte Carlo (HMC) sampling algorithm that exploits the Afriat’s theorem in revealed preference as well as a subsetting structure that arises in the posterior, and hence is devoid of the computational burden of solving optimization problems in existing techniques; 3) numerical results demonstrate the ability to characterize and determine the changes in the beam allocation strategy of a CR in noise-free and noisy adversarial settings. HBOIL and HBOCPD are robust to observation noise compared to the existing approaches; 4) HBOIL outperforms classical machine learning approaches in predicting optimal radar responses; and 5) HBOCPD performs, on an average, five times faster compared to the classical offline generalized likelihood ratio (GLR) approach, while using less restrictive assumptions.","PeriodicalId":100645,"journal":{"name":"IEEE Transactions on Radar Systems","volume":"3 ","pages":"562-575"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Radar Systems","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10925519/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In this article, we introduce an online electronic countermeasure framework for learning the strategies and detecting changes in the strategy of an adversarial cognitive radar (CR) in an inverse learning framework. We model the CR as a rational, constrained utility-maximizing agent and formulate the problem in the revealed preference setting. The utility of CR is modeled as a random direction vector, that follows the von Mises-Fisher distribution with unknown parameters, and we use Bayesian machine learning with revealed preference characterization to learn the radar’s utility, and extend it to the detection of changes in strategy in an online setting. The main contributions of the article are: 1) the development of Bayesian machine learning algorithms—HBOIL and HBOCPD, for inverse learning and detection of changes in strategies of an adversarial CR, respectively; 2) HBOIL and HBOCPD use a Hamiltonian Monte Carlo (HMC) sampling algorithm that exploits the Afriat’s theorem in revealed preference as well as a subsetting structure that arises in the posterior, and hence is devoid of the computational burden of solving optimization problems in existing techniques; 3) numerical results demonstrate the ability to characterize and determine the changes in the beam allocation strategy of a CR in noise-free and noisy adversarial settings. HBOIL and HBOCPD are robust to observation noise compared to the existing approaches; 4) HBOIL outperforms classical machine learning approaches in predicting optimal radar responses; and 5) HBOCPD performs, on an average, five times faster compared to the classical offline generalized likelihood ratio (GLR) approach, while using less restrictive assumptions.

查看原文本刊更多论文

认知雷达策略的贝叶斯逆学习与在线变点检测

在本文中，我们介绍了一个在线电子对抗框架，用于在逆学习框架中学习对抗性认知雷达（CR）的策略并检测策略变化。我们将CR建模为一个理性的、受约束的效用最大化代理，并在揭示的偏好设置中制定问题。CR的效用被建模为一个随机方向向量，它遵循未知参数的von Mises-Fisher分布，我们使用具有揭示偏好特征的贝叶斯机器学习来学习雷达的效用，并将其扩展到在线设置中策略变化的检测。本文的主要贡献有：1)开发了贝叶斯机器学习算法——hboil和HBOCPD，分别用于逆向学习和检测对抗性CR的策略变化；2) HBOIL和HBOCPD使用哈密顿蒙特卡罗（HMC）采样算法，该算法利用了显示偏好中的Afriat定理以及后验中产生的子集结构，因此没有现有技术中求解优化问题的计算负担；3)数值结果表明，该方法能够表征和确定无噪声和对抗噪声环境下CR的波束分配策略的变化。与现有方法相比，HBOIL和HBOCPD对观测噪声具有较强的鲁棒性；4) HBOIL在预测最优雷达响应方面优于经典机器学习方法；5)与经典的离线广义似然比（GLR）方法相比，HBOCPD方法的执行速度平均快5倍，同时使用的限制较少。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Radar Systems

自引率

0.00%

发文量