Boyu Li, Tao Jin, Yuanheng Zhu, Haoran Li, Yingnian Wu, Dongbin Zhao
{"title":"基于深度强化学习的运动目标射击控制策略","authors":"Boyu Li, Tao Jin, Yuanheng Zhu, Haoran Li, Yingnian Wu, Dongbin Zhao","doi":"10.1109/ICCSS53909.2021.9722012","DOIUrl":null,"url":null,"abstract":"Robots are playing a more and more important role in people’s production and life, recently. However, robot control in dynamic environment is still a difficulty. With the great breakthrough of deep reinforcement learning in the field of video games, this method is also extended to the field of robots. Due to the gap between the simulation environment and the real environment, the deep reinforcement learning algorithm trained in the simulation environment is difficult to be applied to the real environment. Aiming at the gimbal control with two degrees of freedom (DOF), a pipline combining system identification and deep reinforcement learning is proposed. On the one hand, the shooting accuracy of the gimbal to moving objects is improved through deep reinforcement learning algorithm. On the other hand, the gap between simulation and reality is reduced through system identification. The method is verified in the RoboMaster University AI Challenge (RMUA) shooting system. The results show that the shooting accuracy is better than the classical control method.","PeriodicalId":435816,"journal":{"name":"2021 8th International Conference on Information, Cybernetics, and Computational Social Systems (ICCSS)","volume":"111 4","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Moving Target Shooting Control Policy Based on Deep Reinforcement Learning\",\"authors\":\"Boyu Li, Tao Jin, Yuanheng Zhu, Haoran Li, Yingnian Wu, Dongbin Zhao\",\"doi\":\"10.1109/ICCSS53909.2021.9722012\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Robots are playing a more and more important role in people’s production and life, recently. However, robot control in dynamic environment is still a difficulty. With the great breakthrough of deep reinforcement learning in the field of video games, this method is also extended to the field of robots. Due to the gap between the simulation environment and the real environment, the deep reinforcement learning algorithm trained in the simulation environment is difficult to be applied to the real environment. Aiming at the gimbal control with two degrees of freedom (DOF), a pipline combining system identification and deep reinforcement learning is proposed. On the one hand, the shooting accuracy of the gimbal to moving objects is improved through deep reinforcement learning algorithm. On the other hand, the gap between simulation and reality is reduced through system identification. The method is verified in the RoboMaster University AI Challenge (RMUA) shooting system. The results show that the shooting accuracy is better than the classical control method.\",\"PeriodicalId\":435816,\"journal\":{\"name\":\"2021 8th International Conference on Information, Cybernetics, and Computational Social Systems (ICCSS)\",\"volume\":\"111 4\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 8th International Conference on Information, Cybernetics, and Computational Social Systems (ICCSS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCSS53909.2021.9722012\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 8th International Conference on Information, Cybernetics, and Computational Social Systems (ICCSS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSS53909.2021.9722012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Moving Target Shooting Control Policy Based on Deep Reinforcement Learning
Robots are playing a more and more important role in people’s production and life, recently. However, robot control in dynamic environment is still a difficulty. With the great breakthrough of deep reinforcement learning in the field of video games, this method is also extended to the field of robots. Due to the gap between the simulation environment and the real environment, the deep reinforcement learning algorithm trained in the simulation environment is difficult to be applied to the real environment. Aiming at the gimbal control with two degrees of freedom (DOF), a pipline combining system identification and deep reinforcement learning is proposed. On the one hand, the shooting accuracy of the gimbal to moving objects is improved through deep reinforcement learning algorithm. On the other hand, the gap between simulation and reality is reduced through system identification. The method is verified in the RoboMaster University AI Challenge (RMUA) shooting system. The results show that the shooting accuracy is better than the classical control method.