T. Sekiguchi, Yuichi Kobayashi, A. Shimizu, T. Kaneko
{"title":"Online learning of optimal robot behavior for object manipulation using mode switching","authors":"T. Sekiguchi, Yuichi Kobayashi, A. Shimizu, T. Kaneko","doi":"10.1109/ROSE.2012.6402625","DOIUrl":null,"url":null,"abstract":"This paper presents an optimal robot motion learning method that involves object manipulation where dynamics of robots and environment are unknown. The dynamics of the environment is acquired by the robot's experience through online learning. A reinforcement learning framework which incorporates model identification is proposed. Based on the learning framework, an idea of effective motion acquisition is proposed through decomposing the task by detecting `switching of dynamics', which is called mode-switching. Object manipulation is divided into two modes, approaching to the object and pushing it toward the goal. This enables the robot to learn motions while reducing number of trials and to behave more dexterously by integrating modes, each of which was learned separately. The proposed learning method is evaluated in simulation of a wheeled robot. It was shown that appropriate motion for re-approaching and re-pushing to accurately move the object to the goal can be realized using the proposed idea of planning with mode switching.","PeriodicalId":306272,"journal":{"name":"2012 IEEE International Symposium on Robotic and Sensors Environments Proceedings","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Symposium on Robotic and Sensors Environments Proceedings","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROSE.2012.6402625","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
This paper presents an optimal robot motion learning method that involves object manipulation where dynamics of robots and environment are unknown. The dynamics of the environment is acquired by the robot's experience through online learning. A reinforcement learning framework which incorporates model identification is proposed. Based on the learning framework, an idea of effective motion acquisition is proposed through decomposing the task by detecting `switching of dynamics', which is called mode-switching. Object manipulation is divided into two modes, approaching to the object and pushing it toward the goal. This enables the robot to learn motions while reducing number of trials and to behave more dexterously by integrating modes, each of which was learned separately. The proposed learning method is evaluated in simulation of a wheeled robot. It was shown that appropriate motion for re-approaching and re-pushing to accurately move the object to the goal can be realized using the proposed idea of planning with mode switching.