{"title":"基于高斯过程的模仿学习模型预测控制器","authors":"V. Joukov, D. Kulić","doi":"10.1109/HUMANOIDS.2017.8246971","DOIUrl":null,"url":null,"abstract":"Humans still outperform robots in most manipulation and locomotion tasks. Research suggests that humans minimize a task specific cost function when performing movements. In this paper we present a Gaussian Process based method to learn the underlying cost function, without making assumptions on its structure, and reproduce the demonstrated movement on a robot using a linear model predictive control framework. We show that the learned cost function can be used to prioritize between tracking and additional cost functions based on exemplar variance, and satisfy task and joint space constraints. Tuning the weighting between learned position and velocity costs produces trajectories of the desired shape even in the presence of constraints. The approach is validated in simulation with a simple 2dof manipulator showing joint and task space tracking and with a 4dof manipulator reproducing trajectories based on a human handwriting dataset.","PeriodicalId":143992,"journal":{"name":"2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Gaussian process based model predictive controller for imitation learning\",\"authors\":\"V. Joukov, D. Kulić\",\"doi\":\"10.1109/HUMANOIDS.2017.8246971\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Humans still outperform robots in most manipulation and locomotion tasks. Research suggests that humans minimize a task specific cost function when performing movements. In this paper we present a Gaussian Process based method to learn the underlying cost function, without making assumptions on its structure, and reproduce the demonstrated movement on a robot using a linear model predictive control framework. We show that the learned cost function can be used to prioritize between tracking and additional cost functions based on exemplar variance, and satisfy task and joint space constraints. Tuning the weighting between learned position and velocity costs produces trajectories of the desired shape even in the presence of constraints. The approach is validated in simulation with a simple 2dof manipulator showing joint and task space tracking and with a 4dof manipulator reproducing trajectories based on a human handwriting dataset.\",\"PeriodicalId\":143992,\"journal\":{\"name\":\"2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HUMANOIDS.2017.8246971\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HUMANOIDS.2017.8246971","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Gaussian process based model predictive controller for imitation learning
Humans still outperform robots in most manipulation and locomotion tasks. Research suggests that humans minimize a task specific cost function when performing movements. In this paper we present a Gaussian Process based method to learn the underlying cost function, without making assumptions on its structure, and reproduce the demonstrated movement on a robot using a linear model predictive control framework. We show that the learned cost function can be used to prioritize between tracking and additional cost functions based on exemplar variance, and satisfy task and joint space constraints. Tuning the weighting between learned position and velocity costs produces trajectories of the desired shape even in the presence of constraints. The approach is validated in simulation with a simple 2dof manipulator showing joint and task space tracking and with a 4dof manipulator reproducing trajectories based on a human handwriting dataset.