The Effects of Rewards on Autonomous Unmanned Aerial Vehicle (UAV) Operations Using Reinforcement Learning

Hemali Virani, Dahai Liu, Dennis A. Vincenzi
{"title":"The Effects of Rewards on Autonomous Unmanned Aerial Vehicle (UAV) Operations Using Reinforcement Learning","authors":"Hemali Virani, Dahai Liu, Dennis A. Vincenzi","doi":"10.1142/S2301385021500187","DOIUrl":null,"url":null,"abstract":"The effects of rewards on the ability of an autonomous UAV controlled by a Reinforcement Learning agent to accomplish a target localization task were investigated. It was shown that with an increase in the reward obtained by a learning agent upon correct detection, systems would become more risk-tolerant, efficient and have a tendency to locate targets faster with an increase in the sensor sensitivity after systems achieve steady-state performance.","PeriodicalId":164619,"journal":{"name":"Unmanned Syst.","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Unmanned Syst.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/S2301385021500187","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

The effects of rewards on the ability of an autonomous UAV controlled by a Reinforcement Learning agent to accomplish a target localization task were investigated. It was shown that with an increase in the reward obtained by a learning agent upon correct detection, systems would become more risk-tolerant, efficient and have a tendency to locate targets faster with an increase in the sensor sensitivity after systems achieve steady-state performance.
基于强化学习的奖励对自主无人机(UAV)操作的影响
研究了奖励对由强化学习代理控制的自主无人机完成目标定位任务能力的影响。结果表明,随着学习智能体在正确检测后获得的奖励的增加,在系统达到稳态性能后,随着传感器灵敏度的增加,系统的风险容忍度和效率会提高,并倾向于更快地定位目标。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信