Safe tracking in games: Achieving optimal control with unknown dynamics and constraints

IF 2.7 4区计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS

Asian Journal of Control Pub Date : 2024-05-01 DOI:10.1002/asjc.3397

Xiaohong Cui, Wenjie Chen, Binrui Wang, Kun Zhou

引用次数: 0

Abstract

This paper introduces mix-zero-sum differential (MZSD) game theory to address multi-player tracking systems, offering a better understanding of the coexistence of cooperation and competition among players. Within this framework, we present an optimal safety tracking control (OSTC) method, which incorporates a control barrier function (CBF) into the value function to ensure that the tracking error remains within a specified range, thus guaranteeing safety while achieving optimization. Simultaneously, to eliminate the need for system dynamics, we propose a novel approach leveraging off-policy integral reinforcement learning (IRL) technology to obtain the Nash equilibrium solution of the MZSD games. We establish a unique critics–actors neural network (NN) structure that updates concurrently. Furthermore, we analyze stability and convergence using the Lyapunov method. We conduct two simulations to demonstrate the effectiveness of the proposed algorithm.

查看原文本刊更多论文

游戏中的安全追踪：在未知动态和约束条件下实现最优控制

本文将混合零和差分博弈论（MZSD）引入多玩家跟踪系统，从而更好地理解玩家之间合作与竞争并存的关系。在此框架内，我们提出了一种最优安全跟踪控制（OSTC）方法，该方法在值函数中加入了控制障碍函数（CBF），以确保跟踪误差保持在指定范围内，从而在实现优化的同时保证安全性。同时，为了消除对系统动力学的需求，我们提出了一种利用非政策积分强化学习（IRL）技术的新方法，以获得 MZSD 博弈的纳什均衡解。我们建立了一种独特的批评者-行动者神经网络（NN）结构，可同时更新。此外，我们还利用 Lyapunov 方法分析了稳定性和收敛性。我们进行了两次模拟，以证明所提算法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Asian Journal of Control 工程技术-自动化与控制系统

CiteScore

4.80

自引率

25.00%

发文量

253

审稿时长

7.2 months

期刊介绍： The Asian Journal of Control, an Asian Control Association (ACA) and Chinese Automatic Control Society (CACS) affiliated journal, is the first international journal originating from the Asia Pacific region. The Asian Journal of Control publishes papers on original theoretical and practical research and developments in the areas of control, involving all facets of control theory and its application. Published six times a year, the Journal aims to be a key platform for control communities throughout the world. The Journal provides a forum where control researchers and practitioners can exchange knowledge and experiences on the latest advances in the control areas, and plays an educational role for students and experienced researchers in other disciplines interested in this continually growing field. The scope of the journal is extensive. Topics include: The theory and design of control systems and components, encompassing: Robust and distributed control using geometric, optimal, stochastic and nonlinear methods Game theory and state estimation Adaptive control, including neural networks, learning, parameter estimation and system fault detection Artificial intelligence, fuzzy and expert systems Hierarchical and man-machine systems All parts of systems engineering which consider the reliability of components and systems Emerging application areas, such as: Robotics Mechatronics Computers for computer-aided design, manufacturing, and control of various industrial processes Space vehicles and aircraft, ships, and traffic Biomedical systems National economies Power systems Agriculture Natural resources.