Online learning of optimal robot behavior for object manipulation using mode switching

T. Sekiguchi, Yuichi Kobayashi, A. Shimizu, T. Kaneko
{"title":"Online learning of optimal robot behavior for object manipulation using mode switching","authors":"T. Sekiguchi, Yuichi Kobayashi, A. Shimizu, T. Kaneko","doi":"10.1109/ROSE.2012.6402625","DOIUrl":null,"url":null,"abstract":"This paper presents an optimal robot motion learning method that involves object manipulation where dynamics of robots and environment are unknown. The dynamics of the environment is acquired by the robot's experience through online learning. A reinforcement learning framework which incorporates model identification is proposed. Based on the learning framework, an idea of effective motion acquisition is proposed through decomposing the task by detecting `switching of dynamics', which is called mode-switching. Object manipulation is divided into two modes, approaching to the object and pushing it toward the goal. This enables the robot to learn motions while reducing number of trials and to behave more dexterously by integrating modes, each of which was learned separately. The proposed learning method is evaluated in simulation of a wheeled robot. It was shown that appropriate motion for re-approaching and re-pushing to accurately move the object to the goal can be realized using the proposed idea of planning with mode switching.","PeriodicalId":306272,"journal":{"name":"2012 IEEE International Symposium on Robotic and Sensors Environments Proceedings","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Symposium on Robotic and Sensors Environments Proceedings","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROSE.2012.6402625","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

This paper presents an optimal robot motion learning method that involves object manipulation where dynamics of robots and environment are unknown. The dynamics of the environment is acquired by the robot's experience through online learning. A reinforcement learning framework which incorporates model identification is proposed. Based on the learning framework, an idea of effective motion acquisition is proposed through decomposing the task by detecting `switching of dynamics', which is called mode-switching. Object manipulation is divided into two modes, approaching to the object and pushing it toward the goal. This enables the robot to learn motions while reducing number of trials and to behave more dexterously by integrating modes, each of which was learned separately. The proposed learning method is evaluated in simulation of a wheeled robot. It was shown that appropriate motion for re-approaching and re-pushing to accurately move the object to the goal can be realized using the proposed idea of planning with mode switching.
基于模式切换的最优机器人行为在线学习
本文提出了一种涉及未知机器人动力学和环境的物体操作的最优机器人运动学习方法。环境的动态是由机器人通过在线学习的经验获得的。提出了一种结合模型识别的强化学习框架。在学习框架的基础上,提出了一种通过检测“动态切换”来分解任务的有效运动获取思想,即模式切换。对象操作分为接近对象和推动对象向目标移动两种模式。这使得机器人能够在减少试验次数的同时学习动作,并通过整合模式(每个模式都是单独学习的)来更灵活地行动。在轮式机器人的仿真中对所提出的学习方法进行了验证。结果表明,采用模式切换规划思想可以实现适当的再接近和再推运动,使目标物体准确移动到目标位置。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信