基于超参数优化的强化学习AI无人机控制器仿真

2020 IEEE 10th International Conference on System Engineering and Technology (ICSET) Pub Date : 2020-11-09 DOI:10.1109/ICSET51301.2020.9265381

Mohamad Hafiz Abu Bakar, Abu Ubaidah bin Shamsudin, R. A. Rahim

{"title":"基于超参数优化的强化学习AI无人机控制器仿真","authors":"Mohamad Hafiz Abu Bakar, Abu Ubaidah bin Shamsudin, R. A. Rahim","doi":"10.1109/ICSET51301.2020.9265381","DOIUrl":null,"url":null,"abstract":"Drone is one of the latest drone technologies that grows with multiple applications; one of the critical applications is for fire-fighting drones such as water hose carrying for firefighting. One of the main challenges of the drone technologies is the non-linear dynamic movement caused by a variety of fire conditions. One solution is to use a nonlinear controller such as Reinforcement Learning. In this paper, Reinforcement Learning has been applied as their key control system to improve the conventional approach, which is the agent (drone) that will interact with the environment without need of the controller for the flying process. This paper is introduced an optimization method for the hyperparameter in order to achieve a better reward. In addition, we only concentrate on the learning rate (alpha) and potential reward factor discount (gamma) for optimization in this paper. From this optimization, the better performance and response from our result by using alpha = 0.1 & gamma = 0.8 with reward produced 6100 and it takes 49 seconds in the learning process.","PeriodicalId":299530,"journal":{"name":"2020 IEEE 10th International Conference on System Engineering and Technology (ICSET)","volume":"132 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Simulation of Drone Controller using Reinforcement Learning AI with Hyperparameter Optimization\",\"authors\":\"Mohamad Hafiz Abu Bakar, Abu Ubaidah bin Shamsudin, R. A. Rahim\",\"doi\":\"10.1109/ICSET51301.2020.9265381\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Drone is one of the latest drone technologies that grows with multiple applications; one of the critical applications is for fire-fighting drones such as water hose carrying for firefighting. One of the main challenges of the drone technologies is the non-linear dynamic movement caused by a variety of fire conditions. One solution is to use a nonlinear controller such as Reinforcement Learning. In this paper, Reinforcement Learning has been applied as their key control system to improve the conventional approach, which is the agent (drone) that will interact with the environment without need of the controller for the flying process. This paper is introduced an optimization method for the hyperparameter in order to achieve a better reward. In addition, we only concentrate on the learning rate (alpha) and potential reward factor discount (gamma) for optimization in this paper. From this optimization, the better performance and response from our result by using alpha = 0.1 & gamma = 0.8 with reward produced 6100 and it takes 49 seconds in the learning process.\",\"PeriodicalId\":299530,\"journal\":{\"name\":\"2020 IEEE 10th International Conference on System Engineering and Technology (ICSET)\",\"volume\":\"132 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 10th International Conference on System Engineering and Technology (ICSET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSET51301.2020.9265381\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 10th International Conference on System Engineering and Technology (ICSET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSET51301.2020.9265381","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

无人机是最新的无人机技术之一，随着多种应用的发展而发展;其中一个关键的应用是消防无人机，如用于消防的水带。无人机技术面临的主要挑战之一是由各种火灾条件引起的非线性动态运动。一个解决方案是使用非线性控制器，如强化学习。本文将强化学习作为其关键控制系统，以改进传统的方法，即智能体(无人机)在飞行过程中无需控制器即可与环境交互。本文介绍了一种优化超参数的方法，以获得更好的奖励。此外，我们在本文中只关注优化的学习率(alpha)和潜在奖励因子折扣(gamma)。从这个优化中，我们使用alpha = 0.1 & gamma = 0.8和奖励产生的更好的性能和响应产生6100，并且在学习过程中需要49秒。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Simulation of Drone Controller using Reinforcement Learning AI with Hyperparameter Optimization

Drone is one of the latest drone technologies that grows with multiple applications; one of the critical applications is for fire-fighting drones such as water hose carrying for firefighting. One of the main challenges of the drone technologies is the non-linear dynamic movement caused by a variety of fire conditions. One solution is to use a nonlinear controller such as Reinforcement Learning. In this paper, Reinforcement Learning has been applied as their key control system to improve the conventional approach, which is the agent (drone) that will interact with the environment without need of the controller for the flying process. This paper is introduced an optimization method for the hyperparameter in order to achieve a better reward. In addition, we only concentrate on the learning rate (alpha) and potential reward factor discount (gamma) for optimization in this paper. From this optimization, the better performance and response from our result by using alpha = 0.1 & gamma = 0.8 with reward produced 6100 and it takes 49 seconds in the learning process.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 IEEE 10th International Conference on System Engineering and Technology (ICSET)

自引率

0.00%

发文量