{"title":"Neural predictor aided policy optimization for adversarial controlled sensing","authors":"Nicholas Kalouptsidis, George Stamatelis","doi":"10.1016/j.sigpro.2025.110115","DOIUrl":null,"url":null,"abstract":"<div><div>This paper is concerned with the fundamental problem of controlled sensing, namely how to optimize signal processing resources in a sensor network in order to detect the true hidden state of the environment, when the sensors are subject to adversarial attacks. The sensing task is performed by a legitimate agent who actively selects observations generated by a set of sensors and makes inference about the true state by minimizing the error probability. The adversary may have access to all or a subset of the sensors and can influence the quality of observations. Agents may have only partial access to the complete data set, leading to different beliefs about the true state, different perceptions of the error probability. To address the complexities of this problem, we define well motivated approximate structures that fill the gap of partial information. We provide three different objective functions for training a neural predictor, and we demonstrate how prediction quality is a precondition for detection performance. Based on the above concepts, we propose a novel deep reinforcement learning (DRL) algorithm, termed Predictive Proximal Policy Optimization for Adversarial Controlled Sensing (3POACS) algorithm. This algorithm combines building blocks from single agent DRL, problem specific reward reshaping procedures, and a neural predictor. Finally, we use an anomaly detection example to demonstrate the superiority of the proposed method over previous non-adversarial approaches. Experiments show that the new algorithm favorably competes with DRL algorithms with access to oracle predictors.</div></div>","PeriodicalId":49523,"journal":{"name":"Signal Processing","volume":"238 ","pages":"Article 110115"},"PeriodicalIF":3.4000,"publicationDate":"2025-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165168425002294","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
This paper is concerned with the fundamental problem of controlled sensing, namely how to optimize signal processing resources in a sensor network in order to detect the true hidden state of the environment, when the sensors are subject to adversarial attacks. The sensing task is performed by a legitimate agent who actively selects observations generated by a set of sensors and makes inference about the true state by minimizing the error probability. The adversary may have access to all or a subset of the sensors and can influence the quality of observations. Agents may have only partial access to the complete data set, leading to different beliefs about the true state, different perceptions of the error probability. To address the complexities of this problem, we define well motivated approximate structures that fill the gap of partial information. We provide three different objective functions for training a neural predictor, and we demonstrate how prediction quality is a precondition for detection performance. Based on the above concepts, we propose a novel deep reinforcement learning (DRL) algorithm, termed Predictive Proximal Policy Optimization for Adversarial Controlled Sensing (3POACS) algorithm. This algorithm combines building blocks from single agent DRL, problem specific reward reshaping procedures, and a neural predictor. Finally, we use an anomaly detection example to demonstrate the superiority of the proposed method over previous non-adversarial approaches. Experiments show that the new algorithm favorably competes with DRL algorithms with access to oracle predictors.
期刊介绍:
Signal Processing incorporates all aspects of the theory and practice of signal processing. It features original research work, tutorial and review articles, and accounts of practical developments. It is intended for a rapid dissemination of knowledge and experience to engineers and scientists working in the research, development or practical application of signal processing.
Subject areas covered by the journal include: Signal Theory; Stochastic Processes; Detection and Estimation; Spectral Analysis; Filtering; Signal Processing Systems; Software Developments; Image Processing; Pattern Recognition; Optical Signal Processing; Digital Signal Processing; Multi-dimensional Signal Processing; Communication Signal Processing; Biomedical Signal Processing; Geophysical and Astrophysical Signal Processing; Earth Resources Signal Processing; Acoustic and Vibration Signal Processing; Data Processing; Remote Sensing; Signal Processing Technology; Radar Signal Processing; Sonar Signal Processing; Industrial Applications; New Applications.