{"title":"Autonomous Navigation and Control of a Quadrotor Using Deep Reinforcement Learning","authors":"Mohamed Mokhtar, A. El-Badawy","doi":"10.1109/ICUAS57906.2023.10156126","DOIUrl":null,"url":null,"abstract":"A deep reinforcement learning-based control framework has been proposed in this paper to achieve autonomous navigation and control of a quadrotor. Cascaded reinforcement learning agents form the control framework. First, a path following (PF) agent controls the quadrotor’s tracking behavior by directly mapping environment states into motor commands. The second agent modifies the desired path to avoid any detected obstacles along the path. The obstacle avoidance (OA) agent achieves this task by adding an offset distance deflection to the tracking error before sending it to the path-following agent. Generalization of the obstacle avoidance behavior in three-dimensional space was achieved by the usage of frame transformation. The two agents were trained using the \"Twin Delayed Deep Deterministic Policy Gradient\" (TD3) algorithm, and the developed framework succeeded in avoiding multiple obstacles of different sizes and configurations in simulation.","PeriodicalId":379073,"journal":{"name":"2023 International Conference on Unmanned Aircraft Systems (ICUAS)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Unmanned Aircraft Systems (ICUAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICUAS57906.2023.10156126","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A deep reinforcement learning-based control framework has been proposed in this paper to achieve autonomous navigation and control of a quadrotor. Cascaded reinforcement learning agents form the control framework. First, a path following (PF) agent controls the quadrotor’s tracking behavior by directly mapping environment states into motor commands. The second agent modifies the desired path to avoid any detected obstacles along the path. The obstacle avoidance (OA) agent achieves this task by adding an offset distance deflection to the tracking error before sending it to the path-following agent. Generalization of the obstacle avoidance behavior in three-dimensional space was achieved by the usage of frame transformation. The two agents were trained using the "Twin Delayed Deep Deterministic Policy Gradient" (TD3) algorithm, and the developed framework succeeded in avoiding multiple obstacles of different sizes and configurations in simulation.