Safe Explainable Agents for Autonomous Navigation using Evolving Behavior Trees

2023 IEEE International Conference on Assured Autonomy (ICAA) Pub Date : 2023-06-01 DOI:10.1109/ICAA58325.2023.00014

Nicholas Potteiger, X. Koutsoukos

{"title":"Safe Explainable Agents for Autonomous Navigation using Evolving Behavior Trees","authors":"Nicholas Potteiger, X. Koutsoukos","doi":"10.1109/ICAA58325.2023.00014","DOIUrl":null,"url":null,"abstract":"Machine learning and reinforcement learning are increasingly used to solve complex tasks in autonomous systems. However, autonomous agents represented by large neural networks are not transparent leading to their assurability and trustworthiness becoming critical challenges. Large models also result in a lack of interpretability which causes severe obstacles related to trust in autonomous agents and human-machine teaming. In this paper, we leverage the hierarchical structure of behavior trees and hierarchical reinforcement learning to develop a neurosymbolic model architecture for autonomous agents. The proposed model, referred to as Evolving Behavior Trees (EBTs), integrates the required components to represent the learning tasks as well as the switching between tasks to achieve complex long-term goals. We design an agent for autonomous navigation and we evaluate the approach against a state-of-the-art hierarchical reinforcement learning method using a Maze Simulation Environment. The results show autonomous agents represented by EBTs can be trained efficiently. The approach incorporates explicit safety constraints into the model and incurs significantly fewer safety violations during training and execution. Further, the model provides explanations for the behavior of the autonomous agent by associating the state of the executing EBT with agent actions.","PeriodicalId":190198,"journal":{"name":"2023 IEEE International Conference on Assured Autonomy (ICAA)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Assured Autonomy (ICAA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAA58325.2023.00014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Machine learning and reinforcement learning are increasingly used to solve complex tasks in autonomous systems. However, autonomous agents represented by large neural networks are not transparent leading to their assurability and trustworthiness becoming critical challenges. Large models also result in a lack of interpretability which causes severe obstacles related to trust in autonomous agents and human-machine teaming. In this paper, we leverage the hierarchical structure of behavior trees and hierarchical reinforcement learning to develop a neurosymbolic model architecture for autonomous agents. The proposed model, referred to as Evolving Behavior Trees (EBTs), integrates the required components to represent the learning tasks as well as the switching between tasks to achieve complex long-term goals. We design an agent for autonomous navigation and we evaluate the approach against a state-of-the-art hierarchical reinforcement learning method using a Maze Simulation Environment. The results show autonomous agents represented by EBTs can be trained efficiently. The approach incorporates explicit safety constraints into the model and incurs significantly fewer safety violations during training and execution. Further, the model provides explanations for the behavior of the autonomous agent by associating the state of the executing EBT with agent actions.

查看原文本刊更多论文

基于进化行为树的自主导航安全可解释智能体

机器学习和强化学习越来越多地用于解决自主系统中的复杂任务。然而，以大型神经网络为代表的自主代理不透明，导致其可靠性和可信度成为关键挑战。大型模型还会导致缺乏可解释性，从而导致与自主代理和人机团队信任相关的严重障碍。在本文中，我们利用行为树的层次结构和层次强化学习来开发自主代理的神经符号模型架构。所提出的模型被称为演化行为树(ebt)，它集成了表示学习任务和任务之间切换所需的组件，以实现复杂的长期目标。我们设计了一个自主导航代理，并使用迷宫模拟环境对最先进的分层强化学习方法进行了评估。结果表明，以ebt为代表的自主智能体可以有效地训练出来。该方法将显式的安全约束合并到模型中，并且在训练和执行期间显著减少了安全违规。此外，该模型通过将执行EBT的状态与代理动作相关联，为自主代理的行为提供解释。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE International Conference on Assured Autonomy (ICAA)

自引率

0.00%

发文量