Runtime Adaptation in Wireless Sensor Nodes Using Structured Learning

IF 2.9 Q3 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

ACM Transactions on Cyber-Physical Systems Pub Date : 2020-06-15 DOI:10.1145/3372153

A. Sapio, S. Bhattacharyya, M. Wolf

引用次数: 2

Abstract

Markov Decision Processes (MDPs) provide important capabilities for facilitating the dynamic adaptation and self-optimization of cyber physical systems at runtime. In recent years, this has primarily taken the form of Reinforcement Learning (RL) techniques that eliminate some MDP components for the purpose of reducing computational requirements. In this work, we show that recent advancements in Compact MDP Models (CMMs) provide sufficient cause to question this trend when designing wireless sensor network nodes. In this work, a novel CMM-based approach to designing self-aware wireless sensor nodes is presented and compared to Q-Learning, a popular RL technique. We show that a certain class of CPS nodes is not well served by RL methods and contrast RL versus CMM methods in this context. Through both simulation and a prototype implementation, we demonstrate that CMM methods can provide significantly better runtime adaptation performance relative to Q-Learning, with comparable resource requirements.

查看原文本刊更多论文

基于结构化学习的无线传感器节点运行时自适应

马尔可夫决策过程（MDP）为促进网络物理系统在运行时的动态适应和自优化提供了重要的能力。近年来，这主要采用强化学习（RL）技术的形式，该技术消除了一些MDP组件，以减少计算需求。在这项工作中，我们表明，在设计无线传感器网络节点时，紧凑型MDP模型（CMM）的最新进展为质疑这一趋势提供了充分的理由。在这项工作中，提出了一种新的基于CMM的自感知无线传感器节点设计方法，并将其与流行的RL技术Q-Learning进行了比较。我们证明了RL方法不能很好地服务于某类CPS节点，并在这种情况下对比了RL与CMM方法。通过仿真和原型实现，我们证明了CMM方法可以提供比Q-Learning更好的运行时自适应性能，并且具有相当的资源需求。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM Transactions on Cyber-Physical Systems COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS-

CiteScore

5.70

自引率

4.30%

发文量