{"title":"基于模仿学习和模型预测控制的自主家政机器人算法设计","authors":"Fangyu Zhu, Zhe Wu","doi":"10.1109/CACML55074.2022.00024","DOIUrl":null,"url":null,"abstract":"Intelligent robots are more and more adopted into humans' regular life, from work to leisure. For instance, autonomous vehicles are running on public roads for testing, intelligent moving robots are deployed in hotel or museum lobbies to help customers. In this project, we will design algorithmic autonomous housekeeping robots to help people with housework. To enable intelligent and efficient motion planning that allows the robots to execute given tasks without colliding with humans or static obstacles (such as furniture at home), we use a combination of imitation learning and model predictive control (MPC). First, we will use MPC to generate and collect multiple optimal actions for randomly generated initial conditions of the robots, obstacles and target locations. Based on that, we use imitation learning to learn a policy network from the optimal policies generated by MPC. Moreover, we also adopt the concept of data aggregation (DAgger) to further improve the learning performance. The experimental results verify the effectiveness of our algorithms.","PeriodicalId":137505,"journal":{"name":"2022 Asia Conference on Algorithms, Computing and Machine Learning (CACML)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Algorithmic Design of Autonomous Housekeeping Robots through Imitation Learning and Model Predictive Control\",\"authors\":\"Fangyu Zhu, Zhe Wu\",\"doi\":\"10.1109/CACML55074.2022.00024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Intelligent robots are more and more adopted into humans' regular life, from work to leisure. For instance, autonomous vehicles are running on public roads for testing, intelligent moving robots are deployed in hotel or museum lobbies to help customers. In this project, we will design algorithmic autonomous housekeeping robots to help people with housework. To enable intelligent and efficient motion planning that allows the robots to execute given tasks without colliding with humans or static obstacles (such as furniture at home), we use a combination of imitation learning and model predictive control (MPC). First, we will use MPC to generate and collect multiple optimal actions for randomly generated initial conditions of the robots, obstacles and target locations. Based on that, we use imitation learning to learn a policy network from the optimal policies generated by MPC. Moreover, we also adopt the concept of data aggregation (DAgger) to further improve the learning performance. The experimental results verify the effectiveness of our algorithms.\",\"PeriodicalId\":137505,\"journal\":{\"name\":\"2022 Asia Conference on Algorithms, Computing and Machine Learning (CACML)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 Asia Conference on Algorithms, Computing and Machine Learning (CACML)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CACML55074.2022.00024\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Asia Conference on Algorithms, Computing and Machine Learning (CACML)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CACML55074.2022.00024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Algorithmic Design of Autonomous Housekeeping Robots through Imitation Learning and Model Predictive Control
Intelligent robots are more and more adopted into humans' regular life, from work to leisure. For instance, autonomous vehicles are running on public roads for testing, intelligent moving robots are deployed in hotel or museum lobbies to help customers. In this project, we will design algorithmic autonomous housekeeping robots to help people with housework. To enable intelligent and efficient motion planning that allows the robots to execute given tasks without colliding with humans or static obstacles (such as furniture at home), we use a combination of imitation learning and model predictive control (MPC). First, we will use MPC to generate and collect multiple optimal actions for randomly generated initial conditions of the robots, obstacles and target locations. Based on that, we use imitation learning to learn a policy network from the optimal policies generated by MPC. Moreover, we also adopt the concept of data aggregation (DAgger) to further improve the learning performance. The experimental results verify the effectiveness of our algorithms.