Simulation of Drone Controller using Reinforcement Learning AI with Hyperparameter Optimization

Mohamad Hafiz Abu Bakar, Abu Ubaidah bin Shamsudin, R. A. Rahim
{"title":"Simulation of Drone Controller using Reinforcement Learning AI with Hyperparameter Optimization","authors":"Mohamad Hafiz Abu Bakar, Abu Ubaidah bin Shamsudin, R. A. Rahim","doi":"10.1109/ICSET51301.2020.9265381","DOIUrl":null,"url":null,"abstract":"Drone is one of the latest drone technologies that grows with multiple applications; one of the critical applications is for fire-fighting drones such as water hose carrying for firefighting. One of the main challenges of the drone technologies is the non-linear dynamic movement caused by a variety of fire conditions. One solution is to use a nonlinear controller such as Reinforcement Learning. In this paper, Reinforcement Learning has been applied as their key control system to improve the conventional approach, which is the agent (drone) that will interact with the environment without need of the controller for the flying process. This paper is introduced an optimization method for the hyperparameter in order to achieve a better reward. In addition, we only concentrate on the learning rate (alpha) and potential reward factor discount (gamma) for optimization in this paper. From this optimization, the better performance and response from our result by using alpha = 0.1 & gamma = 0.8 with reward produced 6100 and it takes 49 seconds in the learning process.","PeriodicalId":299530,"journal":{"name":"2020 IEEE 10th International Conference on System Engineering and Technology (ICSET)","volume":"132 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 10th International Conference on System Engineering and Technology (ICSET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSET51301.2020.9265381","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Drone is one of the latest drone technologies that grows with multiple applications; one of the critical applications is for fire-fighting drones such as water hose carrying for firefighting. One of the main challenges of the drone technologies is the non-linear dynamic movement caused by a variety of fire conditions. One solution is to use a nonlinear controller such as Reinforcement Learning. In this paper, Reinforcement Learning has been applied as their key control system to improve the conventional approach, which is the agent (drone) that will interact with the environment without need of the controller for the flying process. This paper is introduced an optimization method for the hyperparameter in order to achieve a better reward. In addition, we only concentrate on the learning rate (alpha) and potential reward factor discount (gamma) for optimization in this paper. From this optimization, the better performance and response from our result by using alpha = 0.1 & gamma = 0.8 with reward produced 6100 and it takes 49 seconds in the learning process.
基于超参数优化的强化学习AI无人机控制器仿真
无人机是最新的无人机技术之一,随着多种应用的发展而发展;其中一个关键的应用是消防无人机,如用于消防的水带。无人机技术面临的主要挑战之一是由各种火灾条件引起的非线性动态运动。一个解决方案是使用非线性控制器,如强化学习。本文将强化学习作为其关键控制系统,以改进传统的方法,即智能体(无人机)在飞行过程中无需控制器即可与环境交互。本文介绍了一种优化超参数的方法,以获得更好的奖励。此外,我们在本文中只关注优化的学习率(alpha)和潜在奖励因子折扣(gamma)。从这个优化中,我们使用alpha = 0.1 & gamma = 0.8和奖励产生的更好的性能和响应产生6100,并且在学习过程中需要49秒。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信