Observer-Based Adaptive Robust Actor–Critic Learning Saturated PID Controller for a Class of Euler–Lagrange Robotic Systems With Guaranteed Performance: Theory and Practice
{"title":"Observer-Based Adaptive Robust Actor–Critic Learning Saturated PID Controller for a Class of Euler–Lagrange Robotic Systems With Guaranteed Performance: Theory and Practice","authors":"Omid Elhaki;Khoshnam Shojaei;Abbas Chatraei;Allahyar Montazeri","doi":"10.1109/TSMC.2024.3506695","DOIUrl":null,"url":null,"abstract":"This article addresses the output-feedback reinforcement learning (RL)-based saturated proportional-integral-derivative (PID) control design for fully actuated Euler-Lagrange (EL) systems which are uncertain subject to actuator saturation with prescribed performance. It is assumed that the actuator input nonlinearity, uncertain nonlinearities and unmeasurable external disturbances have a significant impact on the system. The presence of actuator saturation and complex uncertainties may inevitably give rise to the breakdown of the EL control system. The lack of prior knowledge of the system dynamics renders the presented technique to achieve a robust prescribed tracking performance without using velocity sensors. To conquer mentioned obstacles, a novel RL saturated PID controller, which is not dependent on the system’s dynamics and only requires measurable output signals is designed via actor-critic structure to deeply estimate and compensate complex unknowns. An adaptive robust controller is used to reduce external disturbances effects adaptively. The prescribed performance funnel control way is considered to guarantee predetermined output constraints. The high-gain observer (HGO) is used to estimate velocities and derivatives free of system dynamics, and generalized saturation functions are utilized to efficiently decrease actuator saturation danger. It is proved that suggested technique ensures a robust prescribed performance with input constraints in the absence of velocity sensors and the existence of considerable complicated model uncertainties. A semi-global uniform ultimate boundedness (SGUUB) stability for tracking deviation errors and state estimation deviation is ensured through a Lyapunov stability study. Finally, experimental results on a real robotic arm is carried out to further demonstrate the effectiveness of all theoretical findings.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 2","pages":"1400-1412"},"PeriodicalIF":8.6000,"publicationDate":"2024-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Systems Man Cybernetics-Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10783087/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
This article addresses the output-feedback reinforcement learning (RL)-based saturated proportional-integral-derivative (PID) control design for fully actuated Euler-Lagrange (EL) systems which are uncertain subject to actuator saturation with prescribed performance. It is assumed that the actuator input nonlinearity, uncertain nonlinearities and unmeasurable external disturbances have a significant impact on the system. The presence of actuator saturation and complex uncertainties may inevitably give rise to the breakdown of the EL control system. The lack of prior knowledge of the system dynamics renders the presented technique to achieve a robust prescribed tracking performance without using velocity sensors. To conquer mentioned obstacles, a novel RL saturated PID controller, which is not dependent on the system’s dynamics and only requires measurable output signals is designed via actor-critic structure to deeply estimate and compensate complex unknowns. An adaptive robust controller is used to reduce external disturbances effects adaptively. The prescribed performance funnel control way is considered to guarantee predetermined output constraints. The high-gain observer (HGO) is used to estimate velocities and derivatives free of system dynamics, and generalized saturation functions are utilized to efficiently decrease actuator saturation danger. It is proved that suggested technique ensures a robust prescribed performance with input constraints in the absence of velocity sensors and the existence of considerable complicated model uncertainties. A semi-global uniform ultimate boundedness (SGUUB) stability for tracking deviation errors and state estimation deviation is ensured through a Lyapunov stability study. Finally, experimental results on a real robotic arm is carried out to further demonstrate the effectiveness of all theoretical findings.
期刊介绍:
The IEEE Transactions on Systems, Man, and Cybernetics: Systems encompasses the fields of systems engineering, covering issue formulation, analysis, and modeling throughout the systems engineering lifecycle phases. It addresses decision-making, issue interpretation, systems management, processes, and various methods such as optimization, modeling, and simulation in the development and deployment of large systems.