Weitian He;Fukai Zhang;Zhijia Zhao;Chenguang Yang;Cong Wang
{"title":"未知二自由度直升机系统基于强化动态学习的跟踪控制策略","authors":"Weitian He;Fukai Zhang;Zhijia Zhao;Chenguang Yang;Cong Wang","doi":"10.1109/TSMC.2025.3582830","DOIUrl":null,"url":null,"abstract":"This study investigates a multitrajectory tracking control strategy for an unknown 2-DOF helicopter system, integrating deterministic learning (DL) and reinforcement learning (RL). Initially, DL theory is applied to identify the local unknown dynamics of a 2-DOF helicopter system using radial basis function neural networks (RBFNNs). Subsequently, the identified dynamic knowledge is expressed and stored using constant RBFNNs. To mitigate the issue of partial knowledge failure due to deviations between the actual and learned trajectories, we introduce a RL framework for dynamic compensation. Finally, a composite control strategy incorporating both nominal and auxiliary components is designed to achieve multitrajectory tracking control. The stability of the closed-loop system is analyzed and demonstrated using the Lyapunov direct method. The simulation and experimental results demonstrate the effectiveness of the proposed control strategy.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 10","pages":"6777-6788"},"PeriodicalIF":8.7000,"publicationDate":"2025-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reinforcement Dynamic Learning-Based Tracking Control Strategy for an Unknown 2-DOF Helicopter System\",\"authors\":\"Weitian He;Fukai Zhang;Zhijia Zhao;Chenguang Yang;Cong Wang\",\"doi\":\"10.1109/TSMC.2025.3582830\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study investigates a multitrajectory tracking control strategy for an unknown 2-DOF helicopter system, integrating deterministic learning (DL) and reinforcement learning (RL). Initially, DL theory is applied to identify the local unknown dynamics of a 2-DOF helicopter system using radial basis function neural networks (RBFNNs). Subsequently, the identified dynamic knowledge is expressed and stored using constant RBFNNs. To mitigate the issue of partial knowledge failure due to deviations between the actual and learned trajectories, we introduce a RL framework for dynamic compensation. Finally, a composite control strategy incorporating both nominal and auxiliary components is designed to achieve multitrajectory tracking control. The stability of the closed-loop system is analyzed and demonstrated using the Lyapunov direct method. The simulation and experimental results demonstrate the effectiveness of the proposed control strategy.\",\"PeriodicalId\":48915,\"journal\":{\"name\":\"IEEE Transactions on Systems Man Cybernetics-Systems\",\"volume\":\"55 10\",\"pages\":\"6777-6788\"},\"PeriodicalIF\":8.7000,\"publicationDate\":\"2025-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Systems Man Cybernetics-Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11084854/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Systems Man Cybernetics-Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11084854/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Reinforcement Dynamic Learning-Based Tracking Control Strategy for an Unknown 2-DOF Helicopter System
This study investigates a multitrajectory tracking control strategy for an unknown 2-DOF helicopter system, integrating deterministic learning (DL) and reinforcement learning (RL). Initially, DL theory is applied to identify the local unknown dynamics of a 2-DOF helicopter system using radial basis function neural networks (RBFNNs). Subsequently, the identified dynamic knowledge is expressed and stored using constant RBFNNs. To mitigate the issue of partial knowledge failure due to deviations between the actual and learned trajectories, we introduce a RL framework for dynamic compensation. Finally, a composite control strategy incorporating both nominal and auxiliary components is designed to achieve multitrajectory tracking control. The stability of the closed-loop system is analyzed and demonstrated using the Lyapunov direct method. The simulation and experimental results demonstrate the effectiveness of the proposed control strategy.
期刊介绍:
The IEEE Transactions on Systems, Man, and Cybernetics: Systems encompasses the fields of systems engineering, covering issue formulation, analysis, and modeling throughout the systems engineering lifecycle phases. It addresses decision-making, issue interpretation, systems management, processes, and various methods such as optimization, modeling, and simulation in the development and deployment of large systems.