{"title":"基于强化学习的机械臂故障有限时间容错控制","authors":"Pengxin Yang;Shuang Zhang;Xinbo Yu;Wei He","doi":"10.1109/TCYB.2025.3557681","DOIUrl":null,"url":null,"abstract":"This study introduces a novel finite time fault tolerant controller integrating nonsingular terminal sliding mode (NTSM) and reinforcement learning (RL) strategies for manipulator systems with actuator faults. Leveraging an actor-critic network architecture, the RL algorithm facilitates the computation of the cost function and the approximation of unknown nonlinear dynamics. The inherent properties of NTSM mitigate the effects of parameter uncertainties, thereby enhancing system robustness. Furthermore, an adaptive law is crafted to counteract the deleterious effects of actuator faults. Through the direct Lyapunov function approach, it is demonstrated that the closed-loop system achieves semi-global practical finite-time stability. This control strategy diminishes the dependence on precise model accuracy and augments the system’s fault tolerance. The viability of the proposed algorithm is corroborated by simulation results, and its efficacy is further validated through experiments conducted on the 6-DOF Kinova Jaco 2 platform.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 6","pages":"2621-2632"},"PeriodicalIF":9.4000,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reinforcement-Learning-Based Finite Time Fault Tolerant Control for a Manipulator With Actuator Faults\",\"authors\":\"Pengxin Yang;Shuang Zhang;Xinbo Yu;Wei He\",\"doi\":\"10.1109/TCYB.2025.3557681\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study introduces a novel finite time fault tolerant controller integrating nonsingular terminal sliding mode (NTSM) and reinforcement learning (RL) strategies for manipulator systems with actuator faults. Leveraging an actor-critic network architecture, the RL algorithm facilitates the computation of the cost function and the approximation of unknown nonlinear dynamics. The inherent properties of NTSM mitigate the effects of parameter uncertainties, thereby enhancing system robustness. Furthermore, an adaptive law is crafted to counteract the deleterious effects of actuator faults. Through the direct Lyapunov function approach, it is demonstrated that the closed-loop system achieves semi-global practical finite-time stability. This control strategy diminishes the dependence on precise model accuracy and augments the system’s fault tolerance. The viability of the proposed algorithm is corroborated by simulation results, and its efficacy is further validated through experiments conducted on the 6-DOF Kinova Jaco 2 platform.\",\"PeriodicalId\":13112,\"journal\":{\"name\":\"IEEE Transactions on Cybernetics\",\"volume\":\"55 6\",\"pages\":\"2621-2632\"},\"PeriodicalIF\":9.4000,\"publicationDate\":\"2025-04-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Cybernetics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10976349/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Cybernetics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10976349/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Reinforcement-Learning-Based Finite Time Fault Tolerant Control for a Manipulator With Actuator Faults
This study introduces a novel finite time fault tolerant controller integrating nonsingular terminal sliding mode (NTSM) and reinforcement learning (RL) strategies for manipulator systems with actuator faults. Leveraging an actor-critic network architecture, the RL algorithm facilitates the computation of the cost function and the approximation of unknown nonlinear dynamics. The inherent properties of NTSM mitigate the effects of parameter uncertainties, thereby enhancing system robustness. Furthermore, an adaptive law is crafted to counteract the deleterious effects of actuator faults. Through the direct Lyapunov function approach, it is demonstrated that the closed-loop system achieves semi-global practical finite-time stability. This control strategy diminishes the dependence on precise model accuracy and augments the system’s fault tolerance. The viability of the proposed algorithm is corroborated by simulation results, and its efficacy is further validated through experiments conducted on the 6-DOF Kinova Jaco 2 platform.
期刊介绍:
The scope of the IEEE Transactions on Cybernetics includes computational approaches to the field of cybernetics. Specifically, the transactions welcomes papers on communication and control across machines or machine, human, and organizations. The scope includes such areas as computational intelligence, computer vision, neural networks, genetic algorithms, machine learning, fuzzy systems, cognitive systems, decision making, and robotics, to the extent that they contribute to the theme of cybernetics or demonstrate an application of cybernetics principles.