{"title":"Optimized Leader-Follower Consensus Control of Multi-QUAV Attitude System Using Reinforcement Learning and Backstepping","authors":"Guoxing Wen;Yanfen Song;Zijun Li;Bin Li","doi":"10.1109/TETCI.2025.3537943","DOIUrl":null,"url":null,"abstract":"This work is to explore the optimized leader-follower attitude consensus scheme for the multi-quadrotor unmanned aerial vehicle (QUAV) system. Since the QUAV attitude dynamic is modeled by a second-order nonlinear differential equation, the optimized backstepping (OB) technique can be competent for this control design. To derive the optimized leader-follower attitude consensus control, the critic-actor reinforcement learning (RL) is performed in the final backstepping step. Different with the attitude control of single QUAV, the case of multi-QUAV is composed of multiple intercommunicated QUAV attitude individuals, so its control design is more complex and thorny. Moreover, the traditional RL optimizing controls deduce the critic or actor updating law from the negative gradient of approximated Hamilton–Jacobi–Bellman (HJB) equation' square, thus it leads to these algorithms very complexity. Hence the traditional optimizing control methods are implemented to multi-QUAV attitude system difficultly. However, since this optimized scheme deduces the RL training laws from a simple positive function of equivalent with HJB equation, it can obviously simplify algorithm for the smooth application in the multi-QUAV attitude system. Finally, theory and simulation certify the feasibility of this optimized consensus control.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 2","pages":"1469-1479"},"PeriodicalIF":5.3000,"publicationDate":"2025-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Emerging Topics in Computational Intelligence","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10897906/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
This work is to explore the optimized leader-follower attitude consensus scheme for the multi-quadrotor unmanned aerial vehicle (QUAV) system. Since the QUAV attitude dynamic is modeled by a second-order nonlinear differential equation, the optimized backstepping (OB) technique can be competent for this control design. To derive the optimized leader-follower attitude consensus control, the critic-actor reinforcement learning (RL) is performed in the final backstepping step. Different with the attitude control of single QUAV, the case of multi-QUAV is composed of multiple intercommunicated QUAV attitude individuals, so its control design is more complex and thorny. Moreover, the traditional RL optimizing controls deduce the critic or actor updating law from the negative gradient of approximated Hamilton–Jacobi–Bellman (HJB) equation' square, thus it leads to these algorithms very complexity. Hence the traditional optimizing control methods are implemented to multi-QUAV attitude system difficultly. However, since this optimized scheme deduces the RL training laws from a simple positive function of equivalent with HJB equation, it can obviously simplify algorithm for the smooth application in the multi-QUAV attitude system. Finally, theory and simulation certify the feasibility of this optimized consensus control.
期刊介绍:
The IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI) publishes original articles on emerging aspects of computational intelligence, including theory, applications, and surveys.
TETCI is an electronics only publication. TETCI publishes six issues per year.
Authors are encouraged to submit manuscripts in any emerging topic in computational intelligence, especially nature-inspired computing topics not covered by other IEEE Computational Intelligence Society journals. A few such illustrative examples are glial cell networks, computational neuroscience, Brain Computer Interface, ambient intelligence, non-fuzzy computing with words, artificial life, cultural learning, artificial endocrine networks, social reasoning, artificial hormone networks, computational intelligence for the IoT and Smart-X technologies.