Towards online reachability analysis with temporal-differencing

Anayo K. Akametalu, C. Tomlin
{"title":"Towards online reachability analysis with temporal-differencing","authors":"Anayo K. Akametalu, C. Tomlin","doi":"10.1145/2728606.2728642","DOIUrl":null,"url":null,"abstract":"Hamilton-Jacobi-Isaacs (HJI) reachability analysis has been employed to guarantee constraint satisfaction (safety) in a number of applications including robotics, air traffic control, and control of HVAC systems. However, the current standard for these methods can result in overly-conservative controllers that can degrade system performance with respect to lower priority objectives. There has been interest in incorporating online machine learning techniques to reduce the conservativeness of this approach. However, recent efforts have resulted in methods that are computationally inefficient and scale poorly with the dimension of the state space. We explore a novel online reachability update algorithm based on temporal-difference learning that is computationally more efficient than current methods. Our algorithm is demonstrated on a simulation of a quadrotor learning to track a trajectory in a confined space and a reach-avoid/pursuit-evader game.","PeriodicalId":377654,"journal":{"name":"Proceedings of the 18th International Conference on Hybrid Systems: Computation and Control","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 18th International Conference on Hybrid Systems: Computation and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2728606.2728642","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Hamilton-Jacobi-Isaacs (HJI) reachability analysis has been employed to guarantee constraint satisfaction (safety) in a number of applications including robotics, air traffic control, and control of HVAC systems. However, the current standard for these methods can result in overly-conservative controllers that can degrade system performance with respect to lower priority objectives. There has been interest in incorporating online machine learning techniques to reduce the conservativeness of this approach. However, recent efforts have resulted in methods that are computationally inefficient and scale poorly with the dimension of the state space. We explore a novel online reachability update algorithm based on temporal-difference learning that is computationally more efficient than current methods. Our algorithm is demonstrated on a simulation of a quadrotor learning to track a trajectory in a confined space and a reach-avoid/pursuit-evader game.
基于时间差的在线可达性分析
Hamilton-Jacobi-Isaacs (HJI)可达性分析已在机器人、空中交通管制和HVAC系统控制等许多应用中用于保证约束满足(安全性)。然而,这些方法的当前标准可能导致过于保守的控制器,这可能会降低相对于较低优先级目标的系统性能。人们对结合在线机器学习技术来减少这种方法的保守性很感兴趣。然而,最近的努力导致了计算效率低下的方法,并且随着状态空间的维度的扩展很差。我们探索了一种新的基于时间差学习的在线可达性更新算法,该算法在计算上比现有方法更高效。我们的算法在一个模拟的四旋翼飞行器学习跟踪轨迹在一个有限的空间和到达-避免/追击-逃避游戏中进行了演示。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信