使用强化学习的四轴飞行器姿态控制

2022 International Symposium on Electromobility (ISEM) Pub Date : 2022-10-17 DOI:10.1109/ISEM55847.2022.9976737

Shun Nakasone, R. Galluzzi, Rogelio Bustamante-Bello

{"title":"使用强化学习的四轴飞行器姿态控制","authors":"Shun Nakasone, R. Galluzzi, Rogelio Bustamante-Bello","doi":"10.1109/ISEM55847.2022.9976737","DOIUrl":null,"url":null,"abstract":"In this paper, a novel control strategy based on Reinforcement Learning is presented to achieve better performance of attitude control for quadcopters. By using Proximal Policy Optimization, the agent is trained via a reward function and interaction with the environment. The control algorithm obtained from this training process is simulated and tested against proportional-integral-derivative control, being the most common attitude control algorithm used in drone races. The resulting control policies were comparable to the baseline counterpart and, in some cases, outperformed it in terms of noise rejection and robustness to external disturbances.","PeriodicalId":310452,"journal":{"name":"2022 International Symposium on Electromobility (ISEM)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Attitude Control for Quadcopters using Reinforcement Learning\",\"authors\":\"Shun Nakasone, R. Galluzzi, Rogelio Bustamante-Bello\",\"doi\":\"10.1109/ISEM55847.2022.9976737\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, a novel control strategy based on Reinforcement Learning is presented to achieve better performance of attitude control for quadcopters. By using Proximal Policy Optimization, the agent is trained via a reward function and interaction with the environment. The control algorithm obtained from this training process is simulated and tested against proportional-integral-derivative control, being the most common attitude control algorithm used in drone races. The resulting control policies were comparable to the baseline counterpart and, in some cases, outperformed it in terms of noise rejection and robustness to external disturbances.\",\"PeriodicalId\":310452,\"journal\":{\"name\":\"2022 International Symposium on Electromobility (ISEM)\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Symposium on Electromobility (ISEM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISEM55847.2022.9976737\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Symposium on Electromobility (ISEM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISEM55847.2022.9976737","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

为了提高四轴飞行器的姿态控制性能，提出了一种基于强化学习的控制策略。通过使用最近邻策略优化，通过奖励函数和与环境的交互来训练智能体。在此训练过程中得到的控制算法与无人机比赛中最常用的姿态控制算法比例-积分-导数控制进行了仿真和测试。由此产生的控制策略与基线对应策略相当，并且在某些情况下，在噪声抑制和对外部干扰的鲁棒性方面优于基准策略。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Attitude Control for Quadcopters using Reinforcement Learning

In this paper, a novel control strategy based on Reinforcement Learning is presented to achieve better performance of attitude control for quadcopters. By using Proximal Policy Optimization, the agent is trained via a reward function and interaction with the environment. The control algorithm obtained from this training process is simulated and tested against proportional-integral-derivative control, being the most common attitude control algorithm used in drone races. The resulting control policies were comparable to the baseline counterpart and, in some cases, outperformed it in terms of noise rejection and robustness to external disturbances.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 International Symposium on Electromobility (ISEM)

自引率

0.00%

发文量