Using Taint Analysis and Reinforcement Learning (TARL) to Repair Autonomous Robot Software

2020 IEEE Security and Privacy Workshops (SPW) Pub Date : 2020-05-01 DOI:10.1109/SPW50608.2020.00045

D. Lyons, Saba B. Zahra

{"title":"Using Taint Analysis and Reinforcement Learning (TARL) to Repair Autonomous Robot Software","authors":"D. Lyons, Saba B. Zahra","doi":"10.1109/SPW50608.2020.00045","DOIUrl":null,"url":null,"abstract":"It is important to be able to establish formal performance bounds for autonomous systems. However, formal verification techniques require a model of the environment in which the system operates; a challenge for autonomous systems, especially those expected to operate over longer timescales. This paper describes work in progress to automate the monitor and repair of ROS-based autonomous robot software written for an apriori partially known and possibly incorrect environment model. A taint analysis method is used to automatically extract the dataflow sequence from input topic to publish topic, and instrument that code. A unique reinforcement learning approximation of MDP utility is calculated, an empirical and non-invasive characterization of the inherent objectives of the software designers. By comparing design (a-priori) utility with deploy (deployed system) utility, we show, using a small but real ROS example, that it's possible to monitor a performance criterion and relate violations of the criterion to parts of the software. The software is then patched using automated software repair techniques and evaluated against the original off-line utility.","PeriodicalId":413600,"journal":{"name":"2020 IEEE Security and Privacy Workshops (SPW)","volume":"134 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Security and Privacy Workshops (SPW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPW50608.2020.00045","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

It is important to be able to establish formal performance bounds for autonomous systems. However, formal verification techniques require a model of the environment in which the system operates; a challenge for autonomous systems, especially those expected to operate over longer timescales. This paper describes work in progress to automate the monitor and repair of ROS-based autonomous robot software written for an apriori partially known and possibly incorrect environment model. A taint analysis method is used to automatically extract the dataflow sequence from input topic to publish topic, and instrument that code. A unique reinforcement learning approximation of MDP utility is calculated, an empirical and non-invasive characterization of the inherent objectives of the software designers. By comparing design (a-priori) utility with deploy (deployed system) utility, we show, using a small but real ROS example, that it's possible to monitor a performance criterion and relate violations of the criterion to parts of the software. The software is then patched using automated software repair techniques and evaluated against the original off-line utility.

查看原文本刊更多论文

利用污点分析和强化学习(TARL)修复自主机器人软件

能够为自治系统建立正式的性能界限是很重要的。然而，正式的验证技术需要系统运行环境的模型;这对自动驾驶系统来说是一个挑战，尤其是那些需要长时间运行的系统。本文描述了自动化监控和修复基于ros的自主机器人软件的工作，该软件是为先验的部分已知和可能不正确的环境模型编写的。采用污点分析方法自动提取从输入主题到发布主题的数据流序列，并对该代码进行检测。计算了MDP效用的独特强化学习近似，这是对软件设计者固有目标的经验和非侵入性表征。通过比较设计(先验)实用程序和部署(已部署系统)实用程序，我们使用一个小但真实的ROS示例显示，可以监视性能标准并将违反标准的情况与软件的某些部分联系起来。然后使用自动软件修复技术对软件进行修补，并根据原始的离线实用程序进行评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE Security and Privacy Workshops (SPW)

自引率

0.00%

发文量