Towards online reachability analysis with temporal-differencing

Proceedings of the 18th International Conference on Hybrid Systems: Computation and Control Pub Date : 2015-04-14 DOI:10.1145/2728606.2728642

Anayo K. Akametalu, C. Tomlin

引用次数: 0

Abstract

Hamilton-Jacobi-Isaacs (HJI) reachability analysis has been employed to guarantee constraint satisfaction (safety) in a number of applications including robotics, air traffic control, and control of HVAC systems. However, the current standard for these methods can result in overly-conservative controllers that can degrade system performance with respect to lower priority objectives. There has been interest in incorporating online machine learning techniques to reduce the conservativeness of this approach. However, recent efforts have resulted in methods that are computationally inefficient and scale poorly with the dimension of the state space. We explore a novel online reachability update algorithm based on temporal-difference learning that is computationally more efficient than current methods. Our algorithm is demonstrated on a simulation of a quadrotor learning to track a trajectory in a confined space and a reach-avoid/pursuit-evader game.

查看原文本刊更多论文

基于时间差的在线可达性分析

Hamilton-Jacobi-Isaacs (HJI)可达性分析已在机器人、空中交通管制和HVAC系统控制等许多应用中用于保证约束满足(安全性)。然而，这些方法的当前标准可能导致过于保守的控制器，这可能会降低相对于较低优先级目标的系统性能。人们对结合在线机器学习技术来减少这种方法的保守性很感兴趣。然而，最近的努力导致了计算效率低下的方法，并且随着状态空间的维度的扩展很差。我们探索了一种新的基于时间差学习的在线可达性更新算法，该算法在计算上比现有方法更高效。我们的算法在一个模拟的四旋翼飞行器学习跟踪轨迹在一个有限的空间和到达-避免/追击-逃避游戏中进行了演示。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 18th International Conference on Hybrid Systems: Computation and Control

自引率

0.00%

发文量