基于模仿学习和模型预测控制的自主家政机器人算法设计

2022 Asia Conference on Algorithms, Computing and Machine Learning (CACML) Pub Date : 2022-03-01 DOI:10.1109/CACML55074.2022.00024

Fangyu Zhu, Zhe Wu

{"title":"基于模仿学习和模型预测控制的自主家政机器人算法设计","authors":"Fangyu Zhu, Zhe Wu","doi":"10.1109/CACML55074.2022.00024","DOIUrl":null,"url":null,"abstract":"Intelligent robots are more and more adopted into humans' regular life, from work to leisure. For instance, autonomous vehicles are running on public roads for testing, intelligent moving robots are deployed in hotel or museum lobbies to help customers. In this project, we will design algorithmic autonomous housekeeping robots to help people with housework. To enable intelligent and efficient motion planning that allows the robots to execute given tasks without colliding with humans or static obstacles (such as furniture at home), we use a combination of imitation learning and model predictive control (MPC). First, we will use MPC to generate and collect multiple optimal actions for randomly generated initial conditions of the robots, obstacles and target locations. Based on that, we use imitation learning to learn a policy network from the optimal policies generated by MPC. Moreover, we also adopt the concept of data aggregation (DAgger) to further improve the learning performance. The experimental results verify the effectiveness of our algorithms.","PeriodicalId":137505,"journal":{"name":"2022 Asia Conference on Algorithms, Computing and Machine Learning (CACML)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Algorithmic Design of Autonomous Housekeeping Robots through Imitation Learning and Model Predictive Control\",\"authors\":\"Fangyu Zhu, Zhe Wu\",\"doi\":\"10.1109/CACML55074.2022.00024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Intelligent robots are more and more adopted into humans' regular life, from work to leisure. For instance, autonomous vehicles are running on public roads for testing, intelligent moving robots are deployed in hotel or museum lobbies to help customers. In this project, we will design algorithmic autonomous housekeeping robots to help people with housework. To enable intelligent and efficient motion planning that allows the robots to execute given tasks without colliding with humans or static obstacles (such as furniture at home), we use a combination of imitation learning and model predictive control (MPC). First, we will use MPC to generate and collect multiple optimal actions for randomly generated initial conditions of the robots, obstacles and target locations. Based on that, we use imitation learning to learn a policy network from the optimal policies generated by MPC. Moreover, we also adopt the concept of data aggregation (DAgger) to further improve the learning performance. The experimental results verify the effectiveness of our algorithms.\",\"PeriodicalId\":137505,\"journal\":{\"name\":\"2022 Asia Conference on Algorithms, Computing and Machine Learning (CACML)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 Asia Conference on Algorithms, Computing and Machine Learning (CACML)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CACML55074.2022.00024\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Asia Conference on Algorithms, Computing and Machine Learning (CACML)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CACML55074.2022.00024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

智能机器人越来越多地融入人类的日常生活，从工作到休闲。例如，自动驾驶汽车正在公共道路上进行测试，智能移动机器人被部署在酒店或博物馆的大厅里为顾客提供帮助。在这个项目中，我们将设计算法自主家政机器人来帮助人们做家务。为了实现智能和高效的运动规划，使机器人能够在不与人类或静态障碍物(如家中家具)碰撞的情况下执行给定任务，我们使用了模仿学习和模型预测控制(MPC)的组合。首先，我们将使用MPC来生成和收集随机生成的机器人、障碍物和目标位置初始条件下的多个最优动作。在此基础上，我们利用模仿学习从MPC生成的最优策略中学习策略网络。此外，我们还采用了数据聚合(DAgger)的概念来进一步提高学习性能。实验结果验证了算法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Algorithmic Design of Autonomous Housekeeping Robots through Imitation Learning and Model Predictive Control

Intelligent robots are more and more adopted into humans' regular life, from work to leisure. For instance, autonomous vehicles are running on public roads for testing, intelligent moving robots are deployed in hotel or museum lobbies to help customers. In this project, we will design algorithmic autonomous housekeeping robots to help people with housework. To enable intelligent and efficient motion planning that allows the robots to execute given tasks without colliding with humans or static obstacles (such as furniture at home), we use a combination of imitation learning and model predictive control (MPC). First, we will use MPC to generate and collect multiple optimal actions for randomly generated initial conditions of the robots, obstacles and target locations. Based on that, we use imitation learning to learn a policy network from the optimal policies generated by MPC. Moreover, we also adopt the concept of data aggregation (DAgger) to further improve the learning performance. The experimental results verify the effectiveness of our algorithms.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 Asia Conference on Algorithms, Computing and Machine Learning (CACML)

自引率

0.00%

发文量