Self-Reflective Learning Strategy for Persistent Autonomy of Aerial Manipulators

IF 1 Q4 AUTOMATION & CONTROL SYSTEMS
Xu Zhou, Jiucai Zhang, Xiaoli Zhang
{"title":"Self-Reflective Learning Strategy for Persistent Autonomy of Aerial Manipulators","authors":"Xu Zhou, Jiucai Zhang, Xiaoli Zhang","doi":"10.1115/dscc2019-9086","DOIUrl":null,"url":null,"abstract":"\n Autonomous aerial manipulators have great potentials to assist humans or even fully automate manual labor-intensive tasks such as aerial cleaning, aerial transportation, infrastructure repair, and agricultural inspection and sampling. Reinforcement learning holds the promise of enabling persistent autonomy of aerial manipulators because it can adapt to different situations by automatically learning optimal policies from the interactions between the aerial manipulator and environments. However, the learning process itself could experience failures that can practically endanger the safety of aerial manipulators and hence hinder persistent autonomy. In order to solve this problem, we propose for the aerial manipulator a self-reflective learning strategy that can smartly and safely finding optimal policies for different new situations. This self-reflective manner consists of three steps: identifying the appearance of new situations, re-seeking the optimal policy with reinforcement learning, and evaluating the termination of self-reflection. Numerical simulations demonstrate, compared with conventional learning-based autonomy, our strategy can significantly reduce failures while still can finish the given task.","PeriodicalId":41412,"journal":{"name":"Mechatronic Systems and Control","volume":null,"pages":null},"PeriodicalIF":1.0000,"publicationDate":"2019-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mechatronic Systems and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1115/dscc2019-9086","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Autonomous aerial manipulators have great potentials to assist humans or even fully automate manual labor-intensive tasks such as aerial cleaning, aerial transportation, infrastructure repair, and agricultural inspection and sampling. Reinforcement learning holds the promise of enabling persistent autonomy of aerial manipulators because it can adapt to different situations by automatically learning optimal policies from the interactions between the aerial manipulator and environments. However, the learning process itself could experience failures that can practically endanger the safety of aerial manipulators and hence hinder persistent autonomy. In order to solve this problem, we propose for the aerial manipulator a self-reflective learning strategy that can smartly and safely finding optimal policies for different new situations. This self-reflective manner consists of three steps: identifying the appearance of new situations, re-seeking the optimal policy with reinforcement learning, and evaluating the termination of self-reflection. Numerical simulations demonstrate, compared with conventional learning-based autonomy, our strategy can significantly reduce failures while still can finish the given task.
空中机械臂持续自主的自反思学习策略
自主空中机械手具有很大的潜力,可以辅助人类,甚至完全自动化人工劳动密集型任务,如空中清洁、空中运输、基础设施维修、农业检验和抽样。强化学习有望实现空中机械臂的持久自治,因为它可以通过从空中机械臂与环境之间的相互作用中自动学习最佳策略来适应不同的情况。然而,学习过程本身可能会出现故障,这实际上会危及空中操纵器的安全,从而阻碍持续的自主性。为了解决这一问题,我们提出了一种能够针对不同新情况智能安全地寻找最优策略的空中机械臂自反射学习策略。这种自我反思方式包括三个步骤:识别新情况的出现,通过强化学习重新寻求最优策略,以及评估自我反思的终止。数值模拟表明,与传统的基于学习的自主学习相比,我们的策略在完成给定任务的同时显著减少了失败。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Mechatronic Systems and Control
Mechatronic Systems and Control AUTOMATION & CONTROL SYSTEMS-
CiteScore
1.40
自引率
66.70%
发文量
27
期刊介绍: This international journal publishes both theoretical and application-oriented papers on various aspects of mechatronic systems, modelling, design, conventional and intelligent control, and intelligent systems. Application areas of mechatronics may include robotics, transportation, energy systems, manufacturing, sensors, actuators, and automation. Techniques of artificial intelligence may include soft computing (fuzzy logic, neural networks, genetic algorithms/evolutionary computing, probabilistic methods, etc.). Techniques may cover frequency and time domains, linear and nonlinear systems, and deterministic and stochastic processes. Hybrid techniques of mechatronics that combine conventional and intelligent methods are also included. First published in 1972, this journal originated with an emphasis on conventional control systems and computer-based applications. Subsequently, with rapid advances in the field and in view of the widespread interest and application of soft computing in control systems, this latter aspect was integrated into the journal. Now the area of mechatronics is included as the main focus. A unique feature of the journal is its pioneering role in bridging the gap between conventional systems and intelligent systems, with an equal emphasis on theory and practical applications, including system modelling, design and instrumentation. It appears four times per year.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信