Sampling-based stochastic optimal control with metric interval temporal logic specifications

2016 IEEE Conference on Control Applications (CCA) Pub Date : 2016-10-13 DOI:10.1109/CCA.2016.7587911

Felipe J. Montana, Jun Liu, T. Dodd

{"title":"Sampling-based stochastic optimal control with metric interval temporal logic specifications","authors":"Felipe J. Montana, Jun Liu, T. Dodd","doi":"10.1109/CCA.2016.7587911","DOIUrl":null,"url":null,"abstract":"This paper describes a method to find optimal policies for stochastic dynamic systems that maximise the probability of satisfying real-time properties. The method consists of two phases. In the first phase, a coarse abstraction of the original system is created. In each region of the abstraction, a sampling-based algorithm is utilised to compute local policies that allow the system to move between regions. Then, in the second phase, the selection of a policy in each region is obtained by solving a reachability problem on the Cartesian product between the abstraction and a timed automaton representing a real-time specification given as a metric interval temporal logic formula. In contrast to current methods that require a fine abstraction, the proposed method achieves computational tractability by modelling the coarse abstraction of the system as a bounded-parameter Markov decision process (BMDP). Moreover, once the BMDP is created, this can be reused for new specifications assuming the same stochastic system and workspace. The method is demonstrated with an autonomous driving example.","PeriodicalId":301744,"journal":{"name":"2016 IEEE Conference on Control Applications (CCA)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE Conference on Control Applications (CCA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCA.2016.7587911","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

This paper describes a method to find optimal policies for stochastic dynamic systems that maximise the probability of satisfying real-time properties. The method consists of two phases. In the first phase, a coarse abstraction of the original system is created. In each region of the abstraction, a sampling-based algorithm is utilised to compute local policies that allow the system to move between regions. Then, in the second phase, the selection of a policy in each region is obtained by solving a reachability problem on the Cartesian product between the abstraction and a timed automaton representing a real-time specification given as a metric interval temporal logic formula. In contrast to current methods that require a fine abstraction, the proposed method achieves computational tractability by modelling the coarse abstraction of the system as a bounded-parameter Markov decision process (BMDP). Moreover, once the BMDP is created, this can be reused for new specifications assuming the same stochastic system and workspace. The method is demonstrated with an autonomous driving example.

查看原文本刊更多论文

基于抽样的度量区间时间逻辑规范随机最优控制

本文描述了一种寻找随机动态系统的最优策略的方法，该策略使满足实时特性的概率最大化。该方法包括两个阶段。在第一阶段，创建原始系统的粗略抽象。在抽象的每个区域中，使用基于采样的算法来计算允许系统在区域之间移动的本地策略。然后，在第二阶段，通过求解抽象和时间自动机之间笛卡尔积的可达性问题，得到每个区域的策略选择，时间自动机表示以度量区间时间逻辑公式给出的实时规范。与当前需要精细抽象的方法相比，该方法通过将系统的粗抽象建模为有界参数马尔可夫决策过程(BMDP)来实现计算可追溯性。此外，一旦创建了BMDP，就可以在假设相同的随机系统和工作空间的新规范中重用它。通过一个自动驾驶实例对该方法进行了验证。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 IEEE Conference on Control Applications (CCA)

自引率

0.00%

发文量