{"title":"基于强化学习的不透明执行传感器激活策略优化","authors":"Jiahan He;Deguang Wang;Ming Yang;Chengbin Liang","doi":"10.1109/JSEN.2024.3471931","DOIUrl":null,"url":null,"abstract":"As a confidentiality property, opacity characterizes the ability of an external intruder to infer the secret information of a system. Ensuring opacity can be realized by dynamic sensor activation to manage event observability. By controlling which sensors are active and what events are observable, the system can effectively prevent the exposure of sensitive information, ensuring that the confidential parts of its behavior remain opaque. In practice, event hiding and sensor switching involved in dynamic sensor activation are recognized as costly operations. This study addresses the numerical optimization problem of sensor activation policy (SAP) to enforce opacity using reinforcement learning (RL). A most permissive observer (MPO) is used to incorporate all valid SAPs that ensure opacity. The quantitative objective of the optimization problem is to minimize the maximum discounted total cost. A systematic procedure is provided to convert MPO into a Markov game, facilitating the use of RL techniques. Minimax Q-learning is presented as the methodology to derive an optimal policy for sensor activation/deactivation decisions from the convergent Q-table. Finally, the effectiveness and applicability of the proposed method are demonstrated on a location-tracking problem in a smart factory setting.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"24 22","pages":"38429-38439"},"PeriodicalIF":4.3000,"publicationDate":"2024-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sensor Activation Policy Optimization for Opacity Enforcement Based on Reinforcement Learning\",\"authors\":\"Jiahan He;Deguang Wang;Ming Yang;Chengbin Liang\",\"doi\":\"10.1109/JSEN.2024.3471931\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As a confidentiality property, opacity characterizes the ability of an external intruder to infer the secret information of a system. Ensuring opacity can be realized by dynamic sensor activation to manage event observability. By controlling which sensors are active and what events are observable, the system can effectively prevent the exposure of sensitive information, ensuring that the confidential parts of its behavior remain opaque. In practice, event hiding and sensor switching involved in dynamic sensor activation are recognized as costly operations. This study addresses the numerical optimization problem of sensor activation policy (SAP) to enforce opacity using reinforcement learning (RL). A most permissive observer (MPO) is used to incorporate all valid SAPs that ensure opacity. The quantitative objective of the optimization problem is to minimize the maximum discounted total cost. A systematic procedure is provided to convert MPO into a Markov game, facilitating the use of RL techniques. Minimax Q-learning is presented as the methodology to derive an optimal policy for sensor activation/deactivation decisions from the convergent Q-table. Finally, the effectiveness and applicability of the proposed method are demonstrated on a location-tracking problem in a smart factory setting.\",\"PeriodicalId\":447,\"journal\":{\"name\":\"IEEE Sensors Journal\",\"volume\":\"24 22\",\"pages\":\"38429-38439\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2024-10-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Sensors Journal\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10706800/\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/10706800/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Sensor Activation Policy Optimization for Opacity Enforcement Based on Reinforcement Learning
As a confidentiality property, opacity characterizes the ability of an external intruder to infer the secret information of a system. Ensuring opacity can be realized by dynamic sensor activation to manage event observability. By controlling which sensors are active and what events are observable, the system can effectively prevent the exposure of sensitive information, ensuring that the confidential parts of its behavior remain opaque. In practice, event hiding and sensor switching involved in dynamic sensor activation are recognized as costly operations. This study addresses the numerical optimization problem of sensor activation policy (SAP) to enforce opacity using reinforcement learning (RL). A most permissive observer (MPO) is used to incorporate all valid SAPs that ensure opacity. The quantitative objective of the optimization problem is to minimize the maximum discounted total cost. A systematic procedure is provided to convert MPO into a Markov game, facilitating the use of RL techniques. Minimax Q-learning is presented as the methodology to derive an optimal policy for sensor activation/deactivation decisions from the convergent Q-table. Finally, the effectiveness and applicability of the proposed method are demonstrated on a location-tracking problem in a smart factory setting.
期刊介绍:
The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following:
-Sensor Phenomenology, Modelling, and Evaluation
-Sensor Materials, Processing, and Fabrication
-Chemical and Gas Sensors
-Microfluidics and Biosensors
-Optical Sensors
-Physical Sensors: Temperature, Mechanical, Magnetic, and others
-Acoustic and Ultrasonic Sensors
-Sensor Packaging
-Sensor Networks
-Sensor Applications
-Sensor Systems: Signals, Processing, and Interfaces
-Actuators and Sensor Power Systems
-Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting
-Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data)
-Sensors in Industrial Practice