基于强化学习的机械臂故障有限时间容错控制

IF 9.4 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

IEEE Transactions on Cybernetics Pub Date : 2025-04-24 DOI:10.1109/TCYB.2025.3557681

Pengxin Yang;Shuang Zhang;Xinbo Yu;Wei He

{"title":"基于强化学习的机械臂故障有限时间容错控制","authors":"Pengxin Yang;Shuang Zhang;Xinbo Yu;Wei He","doi":"10.1109/TCYB.2025.3557681","DOIUrl":null,"url":null,"abstract":"This study introduces a novel finite time fault tolerant controller integrating nonsingular terminal sliding mode (NTSM) and reinforcement learning (RL) strategies for manipulator systems with actuator faults. Leveraging an actor-critic network architecture, the RL algorithm facilitates the computation of the cost function and the approximation of unknown nonlinear dynamics. The inherent properties of NTSM mitigate the effects of parameter uncertainties, thereby enhancing system robustness. Furthermore, an adaptive law is crafted to counteract the deleterious effects of actuator faults. Through the direct Lyapunov function approach, it is demonstrated that the closed-loop system achieves semi-global practical finite-time stability. This control strategy diminishes the dependence on precise model accuracy and augments the system’s fault tolerance. The viability of the proposed algorithm is corroborated by simulation results, and its efficacy is further validated through experiments conducted on the 6-DOF Kinova Jaco 2 platform.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 6","pages":"2621-2632"},"PeriodicalIF":9.4000,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reinforcement-Learning-Based Finite Time Fault Tolerant Control for a Manipulator With Actuator Faults\",\"authors\":\"Pengxin Yang;Shuang Zhang;Xinbo Yu;Wei He\",\"doi\":\"10.1109/TCYB.2025.3557681\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study introduces a novel finite time fault tolerant controller integrating nonsingular terminal sliding mode (NTSM) and reinforcement learning (RL) strategies for manipulator systems with actuator faults. Leveraging an actor-critic network architecture, the RL algorithm facilitates the computation of the cost function and the approximation of unknown nonlinear dynamics. The inherent properties of NTSM mitigate the effects of parameter uncertainties, thereby enhancing system robustness. Furthermore, an adaptive law is crafted to counteract the deleterious effects of actuator faults. Through the direct Lyapunov function approach, it is demonstrated that the closed-loop system achieves semi-global practical finite-time stability. This control strategy diminishes the dependence on precise model accuracy and augments the system’s fault tolerance. The viability of the proposed algorithm is corroborated by simulation results, and its efficacy is further validated through experiments conducted on the 6-DOF Kinova Jaco 2 platform.\",\"PeriodicalId\":13112,\"journal\":{\"name\":\"IEEE Transactions on Cybernetics\",\"volume\":\"55 6\",\"pages\":\"2621-2632\"},\"PeriodicalIF\":9.4000,\"publicationDate\":\"2025-04-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Cybernetics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10976349/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Cybernetics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10976349/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

针对具有执行器故障的机械臂系统，提出了一种集成非奇异终端滑模（NTSM）和强化学习（RL）策略的有限时间容错控制器。利用参与者-评论家网络架构，RL算法简化了成本函数的计算和未知非线性动力学的近似。NTSM的固有特性减轻了参数不确定性的影响，从而增强了系统的鲁棒性。此外，设计了自适应律来抵消执行器故障的有害影响。通过直接Lyapunov函数方法，证明了闭环系统实现了半全局实用有限时间稳定性。该控制策略降低了对精确模型精度的依赖，提高了系统的容错能力。仿真结果验证了算法的可行性，并在6自由度Kinova Jaco 2平台上进行了实验，进一步验证了算法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Reinforcement-Learning-Based Finite Time Fault Tolerant Control for a Manipulator With Actuator Faults

This study introduces a novel finite time fault tolerant controller integrating nonsingular terminal sliding mode (NTSM) and reinforcement learning (RL) strategies for manipulator systems with actuator faults. Leveraging an actor-critic network architecture, the RL algorithm facilitates the computation of the cost function and the approximation of unknown nonlinear dynamics. The inherent properties of NTSM mitigate the effects of parameter uncertainties, thereby enhancing system robustness. Furthermore, an adaptive law is crafted to counteract the deleterious effects of actuator faults. Through the direct Lyapunov function approach, it is demonstrated that the closed-loop system achieves semi-global practical finite-time stability. This control strategy diminishes the dependence on precise model accuracy and augments the system’s fault tolerance. The viability of the proposed algorithm is corroborated by simulation results, and its efficacy is further validated through experiments conducted on the 6-DOF Kinova Jaco 2 platform.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Cybernetics COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-COMPUTER SCIENCE, CYBERNETICS

CiteScore

25.40

自引率

11.00%

发文量

1869

期刊介绍： The scope of the IEEE Transactions on Cybernetics includes computational approaches to the field of cybernetics. Specifically, the transactions welcomes papers on communication and control across machines or machine, human, and organizations. The scope includes such areas as computational intelligence, computer vision, neural networks, genetic algorithms, machine learning, fuzzy systems, cognitive systems, decision making, and robotics, to the extent that they contribute to the theme of cybernetics or demonstrate an application of cybernetics principles.