{"title":"Continuous and Incremental Learning in physical Human-Robot Cooperation using Probabilistic Movement Primitives","authors":"Daniel Schäle, M. Stoelen, E. Kyrkjebø","doi":"10.1109/RO-MAN53752.2022.9900547","DOIUrl":null,"url":null,"abstract":"For a successful deployment of physical Human-Robot Cooperation (pHRC), humans need to be able to teach robots new motor skills quickly. Probabilistic movement primitives (ProMPs) are a promising method to encode a robot’s motor skills learned from human demonstrations in pHRC settings. However, most algorithms to learn ProMPs from human demonstrations operate in batch mode, which is not ideal in pHRC when we want humans and robots to work together from even the first demonstration. In this paper, we propose a new learning algorithm to learn ProMPs incre-mentally and continuously in pHRC settings. Our algorithm incorporates new demonstrations sequentially as they arrive, allowing humans to observe the robot’s learning progress and incrementally shape the robot’s motor skill. A built-in forgetting factor allows for corrective demonstrations resulting from the human’s learning curve or changes in task constraints. We compare the performance of our algorithm to existing batch ProMP algorithms on reference data generated from a pick-and-place task at our lab. Furthermore, we demonstrate how the forgetting factor allows us to adapt to changes in the task. The incremental learning algorithm presented in this paper has the potential to lead to a more intuitive learning progress and to establish a successful cooperation between human and robot faster than training in batch mode.","PeriodicalId":250997,"journal":{"name":"2022 31st IEEE International Conference on Robot and Human Interactive Communication (RO-MAN)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 31st IEEE International Conference on Robot and Human Interactive Communication (RO-MAN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RO-MAN53752.2022.9900547","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
For a successful deployment of physical Human-Robot Cooperation (pHRC), humans need to be able to teach robots new motor skills quickly. Probabilistic movement primitives (ProMPs) are a promising method to encode a robot’s motor skills learned from human demonstrations in pHRC settings. However, most algorithms to learn ProMPs from human demonstrations operate in batch mode, which is not ideal in pHRC when we want humans and robots to work together from even the first demonstration. In this paper, we propose a new learning algorithm to learn ProMPs incre-mentally and continuously in pHRC settings. Our algorithm incorporates new demonstrations sequentially as they arrive, allowing humans to observe the robot’s learning progress and incrementally shape the robot’s motor skill. A built-in forgetting factor allows for corrective demonstrations resulting from the human’s learning curve or changes in task constraints. We compare the performance of our algorithm to existing batch ProMP algorithms on reference data generated from a pick-and-place task at our lab. Furthermore, we demonstrate how the forgetting factor allows us to adapt to changes in the task. The incremental learning algorithm presented in this paper has the potential to lead to a more intuitive learning progress and to establish a successful cooperation between human and robot faster than training in batch mode.
为了成功地部署物理人机合作(pHRC),人类需要能够快速教会机器人新的运动技能。概率运动原语(Probabilistic movement primitives, ProMPs)是一种很有前途的方法,可以对机器人在pHRC环境中从人类演示中学习到的运动技能进行编码。然而,大多数从人类演示中学习promp的算法都是以批处理模式运行的,当我们希望人类和机器人从第一次演示开始就一起工作时,这在pHRC中并不理想。在本文中,我们提出了一种新的学习算法,用于在pHRC环境中增量和连续地学习promp。我们的算法结合了新的演示顺序,因为他们到达,允许人类观察机器人的学习进度,并逐步塑造机器人的运动技能。内置的遗忘因素允许由于人类的学习曲线或任务限制的变化而产生的纠正性演示。我们将算法的性能与现有的批量ProMP算法在实验室的拾取任务生成的参考数据上进行了比较。此外,我们还展示了遗忘因素如何使我们适应任务中的变化。本文提出的增量学习算法有可能导致更直观的学习过程,并比批处理模式更快地建立人与机器人之间的成功合作。