An Implementation of Vision Based Deep Reinforcement Learning for Humanoid Robot Locomotion

2019 IEEE International Symposium on INnovations in Intelligent SysTems and Applications (INISTA) Pub Date : 2019-07-03 DOI:10.1109/INISTA.2019.8778209

Recen Özaln, Çağrı Kaymak, Özal Yıldırım, A. Uçar, Y. Demir, C. Güzelı̇ş

{"title":"An Implementation of Vision Based Deep Reinforcement Learning for Humanoid Robot Locomotion","authors":"Recen Özaln, Çağrı Kaymak, Özal Yıldırım, A. Uçar, Y. Demir, C. Güzelı̇ş","doi":"10.1109/INISTA.2019.8778209","DOIUrl":null,"url":null,"abstract":"Deep reinforcement learning (DRL) exhibits a promising approach for controlling humanoid robot locomotion. However, only values relating sensors such as IMU, gyroscope, and GPS are not sufficient robots to learn their locomotion skills. In this article, we aim to show the success of vision based DRL. We propose a new vision based deep reinforcement learning algorithm for the locomotion of the Robotis-op2 humanoid robot for the first time. In experimental setup, we construct the locomotion of humanoid robot in a specific environment in the Webots software. We use Double Dueling Q Networks (D3QN) and Deep Q Networks (DQN) that are a kind of reinforcement learning algorithm. We present the performance of vision based DRL algorithm on a locomotion experiment. The experimental results show that D3QN is better than DQN in that stable locomotion and fast training and the vision based DRL algorithms will be successfully able to use at the other complex environments and applications.","PeriodicalId":262143,"journal":{"name":"2019 IEEE International Symposium on INnovations in Intelligent SysTems and Applications (INISTA)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Symposium on INnovations in Intelligent SysTems and Applications (INISTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INISTA.2019.8778209","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

Abstract

Deep reinforcement learning (DRL) exhibits a promising approach for controlling humanoid robot locomotion. However, only values relating sensors such as IMU, gyroscope, and GPS are not sufficient robots to learn their locomotion skills. In this article, we aim to show the success of vision based DRL. We propose a new vision based deep reinforcement learning algorithm for the locomotion of the Robotis-op2 humanoid robot for the first time. In experimental setup, we construct the locomotion of humanoid robot in a specific environment in the Webots software. We use Double Dueling Q Networks (D3QN) and Deep Q Networks (DQN) that are a kind of reinforcement learning algorithm. We present the performance of vision based DRL algorithm on a locomotion experiment. The experimental results show that D3QN is better than DQN in that stable locomotion and fast training and the vision based DRL algorithms will be successfully able to use at the other complex environments and applications.

查看原文本刊更多论文

基于视觉的人形机器人运动深度强化学习的实现

深度强化学习(DRL)是控制人形机器人运动的一种很有前途的方法。然而，只有与IMU、陀螺仪和GPS等传感器相关的值不足以让机器人学习运动技能。在本文中，我们旨在展示基于视觉的DRL的成功。针对Robotis-op2类人机器人的运动，首次提出了一种新的基于视觉的深度强化学习算法。在实验设置中，我们在Webots软件中构建了仿人机器人在特定环境中的运动。我们使用Double Dueling Q Networks (D3QN)和Deep Q Networks (DQN)这两种强化学习算法。在一个运动实验中，给出了基于视觉的DRL算法的性能。实验结果表明，D3QN在稳定运动和快速训练方面优于DQN，基于视觉的DRL算法将能够成功地用于其他复杂环境和应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 IEEE International Symposium on INnovations in Intelligent SysTems and Applications (INISTA)

自引率

0.00%

发文量