{"title":"离线强化学习在机器人操作中的应用,以COG方法为例","authors":"Yanpeng Huo, Yuning Liang","doi":"10.1145/3522749.3523075","DOIUrl":null,"url":null,"abstract":"Artificial intelligence now has different applications in various industrial fields. Reinforcement learning (RL) is one of the hot topics in the artificial intelligence, also in robotics. It is an important learning method in the field of robotic manipulation. The training policies of reinforcement learning can be divided into online learning policy and offline learning policy. Besides, the reinforcement learning algorithm of offline policy has great potential in transforming large data sets into powerful decision engine. To solve the problem that most of robot applications involve collecting data from scratch for each new task, offline learning combined with online learning is to make the training more efficient and convenient. The aim of this paper is to clearly introduce the application of offline reinforcement learning in the field of robotic manipulation. The basic formulation of reinforcement learning includes two points: First, it introduces Markov Decision Process and one of method of solution – policy gradients. Then through analyzing an application of offline learning in the field of robotic manipulation - COG algorithm, this paper analyzes the process of offline learning combining the prior data to learn new robotic skills and uses this method to solve specific tasks of robotic, such as the problems of sample efficiency. The results show that the offline learning policy has important research value in the field of robotic manipulation by reducing training time and make process efficient, and it fully embodies its advantages in solving the problems of robotic sample efficiency.","PeriodicalId":361473,"journal":{"name":"Proceedings of the 6th International Conference on Control Engineering and Artificial Intelligence","volume":"107 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Offline reinforcement learning application in robotic manipulation with a COG method case\",\"authors\":\"Yanpeng Huo, Yuning Liang\",\"doi\":\"10.1145/3522749.3523075\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Artificial intelligence now has different applications in various industrial fields. Reinforcement learning (RL) is one of the hot topics in the artificial intelligence, also in robotics. It is an important learning method in the field of robotic manipulation. The training policies of reinforcement learning can be divided into online learning policy and offline learning policy. Besides, the reinforcement learning algorithm of offline policy has great potential in transforming large data sets into powerful decision engine. To solve the problem that most of robot applications involve collecting data from scratch for each new task, offline learning combined with online learning is to make the training more efficient and convenient. The aim of this paper is to clearly introduce the application of offline reinforcement learning in the field of robotic manipulation. The basic formulation of reinforcement learning includes two points: First, it introduces Markov Decision Process and one of method of solution – policy gradients. Then through analyzing an application of offline learning in the field of robotic manipulation - COG algorithm, this paper analyzes the process of offline learning combining the prior data to learn new robotic skills and uses this method to solve specific tasks of robotic, such as the problems of sample efficiency. The results show that the offline learning policy has important research value in the field of robotic manipulation by reducing training time and make process efficient, and it fully embodies its advantages in solving the problems of robotic sample efficiency.\",\"PeriodicalId\":361473,\"journal\":{\"name\":\"Proceedings of the 6th International Conference on Control Engineering and Artificial Intelligence\",\"volume\":\"107 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 6th International Conference on Control Engineering and Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3522749.3523075\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 6th International Conference on Control Engineering and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3522749.3523075","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Offline reinforcement learning application in robotic manipulation with a COG method case
Artificial intelligence now has different applications in various industrial fields. Reinforcement learning (RL) is one of the hot topics in the artificial intelligence, also in robotics. It is an important learning method in the field of robotic manipulation. The training policies of reinforcement learning can be divided into online learning policy and offline learning policy. Besides, the reinforcement learning algorithm of offline policy has great potential in transforming large data sets into powerful decision engine. To solve the problem that most of robot applications involve collecting data from scratch for each new task, offline learning combined with online learning is to make the training more efficient and convenient. The aim of this paper is to clearly introduce the application of offline reinforcement learning in the field of robotic manipulation. The basic formulation of reinforcement learning includes two points: First, it introduces Markov Decision Process and one of method of solution – policy gradients. Then through analyzing an application of offline learning in the field of robotic manipulation - COG algorithm, this paper analyzes the process of offline learning combining the prior data to learn new robotic skills and uses this method to solve specific tasks of robotic, such as the problems of sample efficiency. The results show that the offline learning policy has important research value in the field of robotic manipulation by reducing training time and make process efficient, and it fully embodies its advantages in solving the problems of robotic sample efficiency.