{"title":"Gaussian process based model predictive controller for imitation learning","authors":"V. Joukov, D. Kulić","doi":"10.1109/HUMANOIDS.2017.8246971","DOIUrl":null,"url":null,"abstract":"Humans still outperform robots in most manipulation and locomotion tasks. Research suggests that humans minimize a task specific cost function when performing movements. In this paper we present a Gaussian Process based method to learn the underlying cost function, without making assumptions on its structure, and reproduce the demonstrated movement on a robot using a linear model predictive control framework. We show that the learned cost function can be used to prioritize between tracking and additional cost functions based on exemplar variance, and satisfy task and joint space constraints. Tuning the weighting between learned position and velocity costs produces trajectories of the desired shape even in the presence of constraints. The approach is validated in simulation with a simple 2dof manipulator showing joint and task space tracking and with a 4dof manipulator reproducing trajectories based on a human handwriting dataset.","PeriodicalId":143992,"journal":{"name":"2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HUMANOIDS.2017.8246971","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Humans still outperform robots in most manipulation and locomotion tasks. Research suggests that humans minimize a task specific cost function when performing movements. In this paper we present a Gaussian Process based method to learn the underlying cost function, without making assumptions on its structure, and reproduce the demonstrated movement on a robot using a linear model predictive control framework. We show that the learned cost function can be used to prioritize between tracking and additional cost functions based on exemplar variance, and satisfy task and joint space constraints. Tuning the weighting between learned position and velocity costs produces trajectories of the desired shape even in the presence of constraints. The approach is validated in simulation with a simple 2dof manipulator showing joint and task space tracking and with a 4dof manipulator reproducing trajectories based on a human handwriting dataset.