Robust Control for Dynamical Systems With Non-Gaussian Noise via Formal Abstractions

IF 4.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Artificial Intelligence Research Pub Date : 2023-01-04 DOI:10.1613/jair.1.14253

Thom S. Badings, Licio Romao, A. Abate, D. Parker, Hasan A. Poonawala, M. Stoelinga, N. Jansen

{"title":"Robust Control for Dynamical Systems With Non-Gaussian Noise via Formal Abstractions","authors":"Thom S. Badings, Licio Romao, A. Abate, D. Parker, Hasan A. Poonawala, M. Stoelinga, N. Jansen","doi":"10.1613/jair.1.14253","DOIUrl":null,"url":null,"abstract":"Controllers for dynamical systems that operate in safety-critical settings must account for stochastic disturbances. Such disturbances are often modeled as process noise in a dynamical system, and common assumptions are that the underlying distributions are known and/or Gaussian. In practice, however, these assumptions may be unrealistic and can lead to poor approximations of the true noise distribution. We present a novel controller synthesis method that does not rely on any explicit representation of the noise distributions. In particular, we address the problem of computing a controller that provides probabilistic guarantees on safely reaching a target, while also avoiding unsafe regions of the state space. First, we abstract the continuous control system into a finite-state model that captures noise by probabilistic transitions between discrete states. As a key contribution, we adapt tools from the scenario approach to compute probably approximately correct (PAC) bounds on these transition probabilities, based on a finite number of samples of the noise. We capture these bounds in the transition probability intervals of a so-called interval Markov decision process (iMDP). This iMDP is, with a user-specified confidence probability, robust against uncertainty in the transition probabilities, and the tightness of the probability intervals can be controlled through the number of samples. We use state-of-the-art verification techniques to provide guarantees on the iMDP and compute a controller for which these guarantees carry over to the original control system. In addition, we develop a tailored computational scheme that reduces the complexity of the synthesis of these guarantees on the iMDP. Benchmarks on realistic control systems show the practical applicability of our method, even when the iMDP has hundreds of millions of transitions.","PeriodicalId":54877,"journal":{"name":"Journal of Artificial Intelligence Research","volume":"36 1","pages":"341-391"},"PeriodicalIF":4.5000,"publicationDate":"2023-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Artificial Intelligence Research","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1613/jair.1.14253","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 11

Abstract

Controllers for dynamical systems that operate in safety-critical settings must account for stochastic disturbances. Such disturbances are often modeled as process noise in a dynamical system, and common assumptions are that the underlying distributions are known and/or Gaussian. In practice, however, these assumptions may be unrealistic and can lead to poor approximations of the true noise distribution. We present a novel controller synthesis method that does not rely on any explicit representation of the noise distributions. In particular, we address the problem of computing a controller that provides probabilistic guarantees on safely reaching a target, while also avoiding unsafe regions of the state space. First, we abstract the continuous control system into a finite-state model that captures noise by probabilistic transitions between discrete states. As a key contribution, we adapt tools from the scenario approach to compute probably approximately correct (PAC) bounds on these transition probabilities, based on a finite number of samples of the noise. We capture these bounds in the transition probability intervals of a so-called interval Markov decision process (iMDP). This iMDP is, with a user-specified confidence probability, robust against uncertainty in the transition probabilities, and the tightness of the probability intervals can be controlled through the number of samples. We use state-of-the-art verification techniques to provide guarantees on the iMDP and compute a controller for which these guarantees carry over to the original control system. In addition, we develop a tailored computational scheme that reduces the complexity of the synthesis of these guarantees on the iMDP. Benchmarks on realistic control systems show the practical applicability of our method, even when the iMDP has hundreds of millions of transitions.

查看原文本刊更多论文

基于形式抽象的非高斯噪声动力系统鲁棒控制

在安全关键环境下运行的动力系统的控制器必须考虑随机干扰。这种扰动通常被建模为动力系统中的过程噪声，通常假设底层分布是已知的和/或高斯分布。然而，在实践中，这些假设可能是不现实的，并且可能导致对真实噪声分布的较差近似。我们提出了一种新的控制器合成方法，它不依赖于噪声分布的任何显式表示。特别是，我们解决了计算控制器的问题，该控制器提供了安全到达目标的概率保证，同时也避免了状态空间的不安全区域。首先，我们将连续控制系统抽象为一个有限状态模型，该模型通过离散状态之间的概率转移来捕获噪声。作为一项关键贡献，我们采用场景方法中的工具，基于有限数量的噪声样本，计算这些转移概率的可能近似正确(PAC)界限。我们在所谓的区间马尔可夫决策过程(iMDP)的转移概率区间中捕获这些边界。该iMDP具有用户指定的置信概率，对过渡概率的不确定性具有鲁棒性，并且可以通过样本数量来控制概率区间的紧密性。我们使用最先进的验证技术在iMDP上提供保证，并计算一个控制器，这些保证将延续到原始控制系统。此外，我们还开发了一种定制的计算方案，降低了在iMDP上综合这些保证的复杂性。实际控制系统的基准测试表明，即使iMDP有数亿次过渡，我们的方法也具有实际的适用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Artificial Intelligence Research 工程技术-计算机：人工智能

CiteScore

9.60

自引率

4.00%

发文量

审稿时长

4 months

期刊介绍： JAIR(ISSN 1076 - 9757) covers all areas of artificial intelligence (AI), publishing refereed research articles, survey articles, and technical notes. Established in 1993 as one of the first electronic scientific journals, JAIR is indexed by INSPEC, Science Citation Index, and MathSciNet. JAIR reviews papers within approximately three months of submission and publishes accepted articles on the internet immediately upon receiving the final versions. JAIR articles are published for free distribution on the internet by the AI Access Foundation, and for purchase in bound volumes by AAAI Press.