Barrier-critic-disturbance approximate optimal control of nonzero-sum differential games for modular robot manipulators

IF 6 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Neural Networks Pub Date : 2024-11-06 DOI:10.1016/j.neunet.2024.106880

Bo Dong, Xinye Zhu, Tianjiao An, Hucheng Jiang, Bing Ma

{"title":"Barrier-critic-disturbance approximate optimal control of nonzero-sum differential games for modular robot manipulators","authors":"Bo Dong, Xinye Zhu, Tianjiao An, Hucheng Jiang, Bing Ma","doi":"10.1016/j.neunet.2024.106880","DOIUrl":null,"url":null,"abstract":"<div><div>In this paper, for addressing the safe control problem of modular robot manipulators (MRMs) system with uncertain disturbances, an approximate optimal control scheme of nonzero-sum (NZS) differential games is proposed based on the control barrier function (CBF). The dynamic model of the manipulator system integrates joint subsystems through the utilization of joint torque feedback (JTF) technique, incorporating interconnected dynamic coupling (IDC) effects. By integrating the cost functions relevant to each player with the CBF, the evolution of system states is ensured to remain within the safe region. Subsequently, the optimal tracking control problem for the MRM system is reformulated as an NZS game involving multiple joint subsystems. Based on the adaptive dynamic programming (ADP) algorithm, a cost function approximator for solving Hamilton–Jacobi (HJ) equation using only critic neural networks (NN) is established, which promotes the feasible derivation of the approximate optimal control strategy. The Lyapunov theory is utilized to demonstrate that the tracking error is uniformly ultimately bounded (UUB). Utilizing the CBF’s state constraint mechanism prevents the robot from deviating from the safe region, and the application of the NZS game approach ensures that the subsystems of the MRM reach a Nash equilibrium. The proposed control method effectively addresses the problem of safe and approximate optimal control of MRM system under uncertain disturbances. Finally, the effectiveness and superiority of the proposed method are verified through simulations and experiments.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"181 ","pages":"Article 106880"},"PeriodicalIF":6.0000,"publicationDate":"2024-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0893608024008098","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

In this paper, for addressing the safe control problem of modular robot manipulators (MRMs) system with uncertain disturbances, an approximate optimal control scheme of nonzero-sum (NZS) differential games is proposed based on the control barrier function (CBF). The dynamic model of the manipulator system integrates joint subsystems through the utilization of joint torque feedback (JTF) technique, incorporating interconnected dynamic coupling (IDC) effects. By integrating the cost functions relevant to each player with the CBF, the evolution of system states is ensured to remain within the safe region. Subsequently, the optimal tracking control problem for the MRM system is reformulated as an NZS game involving multiple joint subsystems. Based on the adaptive dynamic programming (ADP) algorithm, a cost function approximator for solving Hamilton–Jacobi (HJ) equation using only critic neural networks (NN) is established, which promotes the feasible derivation of the approximate optimal control strategy. The Lyapunov theory is utilized to demonstrate that the tracking error is uniformly ultimately bounded (UUB). Utilizing the CBF’s state constraint mechanism prevents the robot from deviating from the safe region, and the application of the NZS game approach ensures that the subsystems of the MRM reach a Nash equilibrium. The proposed control method effectively addresses the problem of safe and approximate optimal control of MRM system under uncertain disturbances. Finally, the effectiveness and superiority of the proposed method are verified through simulations and experiments.

查看原文本刊更多论文

模块化机器人操纵器的非零和微分博弈的障碍批判-扰动近似最优控制。

本文针对具有不确定干扰的模块化机器人机械手（MRMs）系统的安全控制问题，提出了一种基于控制障碍函数（CBF）的非零和（NZS）微分博弈近似最优控制方案。操纵器系统的动态模型通过利用关节扭矩反馈（JTF）技术，结合互连动态耦合（IDC）效应，集成了关节子系统。通过将与每个参与者相关的成本函数与 CBF 相结合，可确保系统状态的演变保持在安全区域内。随后，MRM 系统的最优跟踪控制问题被重新表述为涉及多个联合子系统的 NZS 博弈。基于自适应动态编程（ADP）算法，建立了仅使用批判神经网络（NN）求解汉密尔顿-雅可比（HJ）方程的代价函数近似器，从而促进了近似最优控制策略的可行推导。利用 Lyapunov 理论证明了跟踪误差是均匀最终有界的（UUB）。利用 CBF 的状态约束机制可防止机器人偏离安全区域，而 NZS 博弈方法的应用可确保 MRM 子系统达到纳什均衡。所提出的控制方法有效地解决了不确定干扰下 MRM 系统的安全近似最优控制问题。最后，通过仿真和实验验证了所提方法的有效性和优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neural Networks 工程技术-计算机：人工智能

CiteScore

13.90

自引率

7.70%

发文量

425

审稿时长

67 days

期刊介绍： Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.