不确定复杂非线性系统的在线后退地平线安全临界控制：一种生物脑启发的双重学习方法

IF 6.4 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

IEEE Transactions on Automation Science and Engineering Pub Date : 2025-03-07 DOI:10.1109/TASE.2025.3567610

Shawon Dey;Hao Xu

{"title":"不确定复杂非线性系统的在线后退地平线安全临界控制：一种生物脑启发的双重学习方法","authors":"Shawon Dey;Hao Xu","doi":"10.1109/TASE.2025.3567610","DOIUrl":null,"url":null,"abstract":"This research introduces a real-time, computationally efficient receding horizon control (RHC) framework with adaptive safety mechanisms under uncertainty. Achieving real-time optimal performance and safety with uncertain nonlinear systems and environments is a challenging task due to the heavy computational complexity involved in learning optimal solutions and managing system uncertainties. To address this, a novel RHC-based safe critical mechanism is designed, enhancing the classical RHC by integrating performance with a real-time safety framework that adapts to environmental uncertainties. Particularly, a human brain-inspired dual-learning algorithm is developed with a slow learning phase that gradually learns the system and environmental uncertainties and further achieves the optimal RHC solution by employing a situation-aware physics-informed neural network (SA-PINN). A fast-learning mechanism is then developed to maintain system safety with quick decision-making capabilities, utilizing a fast-learned adaptive control barrier (FLA-CBF) function that incorporates an adaptive bound relaxation. This approach enables the slow-learning phase to gradually learn the optimal solutions and adapt to uncertainties, thus enhancing the fast-learning phase, which supports real-time control execution with guaranteed safety. In addition, Lyapunov stability analysis and a series of experimental results have been provided to showcase the efficacy of the developed algorithm. Note to Practitioners—This paper investigates the critical challenge of ensuring safety and performance in real-time control of complex nonlinear systems with uncertainties, emphasizing the need to balance control effectiveness and safety in dynamic real-world environments. The approach detailed in the paper has potential applications across various fields that involve dynamic systems needing robust safe control and performance under uncertainty, like robotics, autonomous vehicles, and aerospace engineering. A key potential of this research lies in its novel dual-learning algorithm, which mimics human cognitive processes to balance safety and performance dynamically. This is particularly advantageous in environments where fast safety decisions are needed, and system responsiveness is critical. Here, the optimal control and model learning from the slow learning part are transferred to the fast learning phase, which quickly adapts to environmental changes using a modified CBF, known as FLA-CBF, with an initial strict safety threshold that is gradually relaxed. However, the selection of this bound threshold depends on the environment and requires further exploration. Also, the computation intensity of the algorithm needs to be improved for real-time application, especially in scenarios involving multiple agents.","PeriodicalId":51060,"journal":{"name":"IEEE Transactions on Automation Science and Engineering","volume":"22 ","pages":"16185-16200"},"PeriodicalIF":6.4000,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Online Receding Horizon Safe-Critical Control for Complex Nonlinear System Under Uncertainty: A Biological Brain-Inspired Dual Learning Approach\",\"authors\":\"Shawon Dey;Hao Xu\",\"doi\":\"10.1109/TASE.2025.3567610\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This research introduces a real-time, computationally efficient receding horizon control (RHC) framework with adaptive safety mechanisms under uncertainty. Achieving real-time optimal performance and safety with uncertain nonlinear systems and environments is a challenging task due to the heavy computational complexity involved in learning optimal solutions and managing system uncertainties. To address this, a novel RHC-based safe critical mechanism is designed, enhancing the classical RHC by integrating performance with a real-time safety framework that adapts to environmental uncertainties. Particularly, a human brain-inspired dual-learning algorithm is developed with a slow learning phase that gradually learns the system and environmental uncertainties and further achieves the optimal RHC solution by employing a situation-aware physics-informed neural network (SA-PINN). A fast-learning mechanism is then developed to maintain system safety with quick decision-making capabilities, utilizing a fast-learned adaptive control barrier (FLA-CBF) function that incorporates an adaptive bound relaxation. This approach enables the slow-learning phase to gradually learn the optimal solutions and adapt to uncertainties, thus enhancing the fast-learning phase, which supports real-time control execution with guaranteed safety. In addition, Lyapunov stability analysis and a series of experimental results have been provided to showcase the efficacy of the developed algorithm. Note to Practitioners—This paper investigates the critical challenge of ensuring safety and performance in real-time control of complex nonlinear systems with uncertainties, emphasizing the need to balance control effectiveness and safety in dynamic real-world environments. The approach detailed in the paper has potential applications across various fields that involve dynamic systems needing robust safe control and performance under uncertainty, like robotics, autonomous vehicles, and aerospace engineering. A key potential of this research lies in its novel dual-learning algorithm, which mimics human cognitive processes to balance safety and performance dynamically. This is particularly advantageous in environments where fast safety decisions are needed, and system responsiveness is critical. Here, the optimal control and model learning from the slow learning part are transferred to the fast learning phase, which quickly adapts to environmental changes using a modified CBF, known as FLA-CBF, with an initial strict safety threshold that is gradually relaxed. However, the selection of this bound threshold depends on the environment and requires further exploration. Also, the computation intensity of the algorithm needs to be improved for real-time application, especially in scenarios involving multiple agents.\",\"PeriodicalId\":51060,\"journal\":{\"name\":\"IEEE Transactions on Automation Science and Engineering\",\"volume\":\"22 \",\"pages\":\"16185-16200\"},\"PeriodicalIF\":6.4000,\"publicationDate\":\"2025-03-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Automation Science and Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10990160/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Automation Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10990160/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

本文介绍了一种具有不确定性下自适应安全机制的实时、计算效率高的后退地平线控制（RHC）框架。在不确定的非线性系统和环境中实现实时最优性能和安全性是一项具有挑战性的任务，因为学习最优解和管理系统不确定性涉及大量的计算复杂度。为了解决这个问题，设计了一种新的基于RHC的安全关键机制，通过将性能与适应环境不确定性的实时安全框架相结合，增强了传统的RHC。特别地，开发了一种人脑启发的双学习算法，该算法具有缓慢学习阶段，逐步学习系统和环境的不确定性，并通过采用态势感知物理信息神经网络（SA-PINN）进一步实现最优RHC解决方案。然后开发了一种快速学习机制，利用包含自适应约束松弛的快速学习自适应控制屏障（FLA-CBF）函数，通过快速决策能力来维持系统安全性。该方法使慢学习阶段逐渐学习最优解并适应不确定性，从而增强了快速学习阶段，支持实时控制执行并保证安全。此外，还提供了Lyapunov稳定性分析和一系列实验结果来展示所开发算法的有效性。本文研究了在具有不确定性的复杂非线性系统的实时控制中确保安全性和性能的关键挑战，强调了在动态现实环境中平衡控制有效性和安全性的必要性。论文中详细介绍的方法在各个领域都有潜在的应用，这些领域涉及需要鲁棒安全控制和不确定性下性能的动态系统，如机器人、自动驾驶汽车和航空航天工程。这项研究的一个关键潜力在于其新颖的双重学习算法，该算法模拟人类的认知过程，以动态地平衡安全性和性能。这在需要快速安全决策和系统响应能力至关重要的环境中尤其有利。在这里，最优控制和模型学习从慢学习部分转移到快速学习阶段，该阶段使用改进的CBF（称为FLA-CBF）快速适应环境变化，初始具有严格的安全阈值，逐渐放宽。然而，这个边界阈值的选择取决于环境，需要进一步探索。同时，为了实现实时应用，特别是在涉及多个智能体的场景下，算法的计算强度需要提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Online Receding Horizon Safe-Critical Control for Complex Nonlinear System Under Uncertainty: A Biological Brain-Inspired Dual Learning Approach

This research introduces a real-time, computationally efficient receding horizon control (RHC) framework with adaptive safety mechanisms under uncertainty. Achieving real-time optimal performance and safety with uncertain nonlinear systems and environments is a challenging task due to the heavy computational complexity involved in learning optimal solutions and managing system uncertainties. To address this, a novel RHC-based safe critical mechanism is designed, enhancing the classical RHC by integrating performance with a real-time safety framework that adapts to environmental uncertainties. Particularly, a human brain-inspired dual-learning algorithm is developed with a slow learning phase that gradually learns the system and environmental uncertainties and further achieves the optimal RHC solution by employing a situation-aware physics-informed neural network (SA-PINN). A fast-learning mechanism is then developed to maintain system safety with quick decision-making capabilities, utilizing a fast-learned adaptive control barrier (FLA-CBF) function that incorporates an adaptive bound relaxation. This approach enables the slow-learning phase to gradually learn the optimal solutions and adapt to uncertainties, thus enhancing the fast-learning phase, which supports real-time control execution with guaranteed safety. In addition, Lyapunov stability analysis and a series of experimental results have been provided to showcase the efficacy of the developed algorithm. Note to Practitioners—This paper investigates the critical challenge of ensuring safety and performance in real-time control of complex nonlinear systems with uncertainties, emphasizing the need to balance control effectiveness and safety in dynamic real-world environments. The approach detailed in the paper has potential applications across various fields that involve dynamic systems needing robust safe control and performance under uncertainty, like robotics, autonomous vehicles, and aerospace engineering. A key potential of this research lies in its novel dual-learning algorithm, which mimics human cognitive processes to balance safety and performance dynamically. This is particularly advantageous in environments where fast safety decisions are needed, and system responsiveness is critical. Here, the optimal control and model learning from the slow learning part are transferred to the fast learning phase, which quickly adapts to environmental changes using a modified CBF, known as FLA-CBF, with an initial strict safety threshold that is gradually relaxed. However, the selection of this bound threshold depends on the environment and requires further exploration. Also, the computation intensity of the algorithm needs to be improved for real-time application, especially in scenarios involving multiple agents.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Automation Science and Engineering 工程技术-自动化与控制系统

CiteScore

12.50

自引率

14.30%

发文量

404

审稿时长

3.0 months

期刊介绍： The IEEE Transactions on Automation Science and Engineering (T-ASE) publishes fundamental papers on Automation, emphasizing scientific results that advance efficiency, quality, productivity, and reliability. T-ASE encourages interdisciplinary approaches from computer science, control systems, electrical engineering, mathematics, mechanical engineering, operations research, and other fields. T-ASE welcomes results relevant to industries such as agriculture, biotechnology, healthcare, home automation, maintenance, manufacturing, pharmaceuticals, retail, security, service, supply chains, and transportation. T-ASE addresses a research community willing to integrate knowledge across disciplines and industries. For this purpose, each paper includes a Note to Practitioners that summarizes how its results can be applied or how they might be extended to apply in practice.