使用分层强化学习的虚拟生物实时运动生成

ACM SIGGRAPH 2018 Studio Pub Date : 2018-08-12 DOI:10.1145/3214822.3214826

Keisuke Ogaki, Masayoshi Nakamura

{"title":"使用分层强化学习的虚拟生物实时运动生成","authors":"Keisuke Ogaki, Masayoshi Nakamura","doi":"10.1145/3214822.3214826","DOIUrl":null,"url":null,"abstract":"Describing the motions of imaginary original creatures is an essential part of animations and computer games. One approach to generate such motions involves finding an optimal motion for approaching a goal by using the creatures' body and motor skills. Currently, researchers are employing deep reinforcement learning (DeepRL) to find such optimal motions. Some end-to-end DeepRL approaches learn the policy function, which outputs target pose for each joint according to the environment. In our study, we employed a hierarchical approach with a separate DeepRL decision maker and simple exploration-based sequence maker, and an action token, through which these two layers can communicate. By optimizing these two functions independently, we can achieve a light, fast-learning system available on mobile devices. In addition, we propose another technique to learn the policy at a faster pace with the help of a heuristic rule. By treating the heuristic rule as an additional action token, we can naturally incorporate it via Q-learning. The experimental results show that creatures can achieve better performance with the use of both heuristics and DeepRL than by using them independently.","PeriodicalId":225677,"journal":{"name":"ACM SIGGRAPH 2018 Studio","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Real-time motion generation for imaginary creatures using hierarchical reinforcement learning\",\"authors\":\"Keisuke Ogaki, Masayoshi Nakamura\",\"doi\":\"10.1145/3214822.3214826\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Describing the motions of imaginary original creatures is an essential part of animations and computer games. One approach to generate such motions involves finding an optimal motion for approaching a goal by using the creatures' body and motor skills. Currently, researchers are employing deep reinforcement learning (DeepRL) to find such optimal motions. Some end-to-end DeepRL approaches learn the policy function, which outputs target pose for each joint according to the environment. In our study, we employed a hierarchical approach with a separate DeepRL decision maker and simple exploration-based sequence maker, and an action token, through which these two layers can communicate. By optimizing these two functions independently, we can achieve a light, fast-learning system available on mobile devices. In addition, we propose another technique to learn the policy at a faster pace with the help of a heuristic rule. By treating the heuristic rule as an additional action token, we can naturally incorporate it via Q-learning. The experimental results show that creatures can achieve better performance with the use of both heuristics and DeepRL than by using them independently.\",\"PeriodicalId\":225677,\"journal\":{\"name\":\"ACM SIGGRAPH 2018 Studio\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM SIGGRAPH 2018 Studio\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3214822.3214826\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM SIGGRAPH 2018 Studio","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3214822.3214826","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

描述想象中的原始生物的动作是动画和电脑游戏的重要组成部分。产生这种动作的一种方法是利用生物的身体和运动技能找到接近目标的最佳动作。目前，研究人员正在使用深度强化学习(DeepRL)来寻找这种最佳运动。一些端到端DeepRL方法学习策略函数，根据环境输出每个关节的目标姿态。在我们的研究中，我们采用了一种分层方法，其中包括一个单独的DeepRL决策者和一个简单的基于探索的序列生成器，以及一个动作令牌，通过它这两层可以进行通信。通过对这两个功能的独立优化，我们可以实现一个轻巧、快速的移动设备学习系统。此外，我们提出了另一种技术，借助启发式规则以更快的速度学习策略。通过将启发式规则视为附加的动作令牌，我们可以通过Q-learning自然地将其合并。实验结果表明，与单独使用启发式和深度深度学习相比，生物可以获得更好的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Real-time motion generation for imaginary creatures using hierarchical reinforcement learning

Describing the motions of imaginary original creatures is an essential part of animations and computer games. One approach to generate such motions involves finding an optimal motion for approaching a goal by using the creatures' body and motor skills. Currently, researchers are employing deep reinforcement learning (DeepRL) to find such optimal motions. Some end-to-end DeepRL approaches learn the policy function, which outputs target pose for each joint according to the environment. In our study, we employed a hierarchical approach with a separate DeepRL decision maker and simple exploration-based sequence maker, and an action token, through which these two layers can communicate. By optimizing these two functions independently, we can achieve a light, fast-learning system available on mobile devices. In addition, we propose another technique to learn the policy at a faster pace with the help of a heuristic rule. By treating the heuristic rule as an additional action token, we can naturally incorporate it via Q-learning. The experimental results show that creatures can achieve better performance with the use of both heuristics and DeepRL than by using them independently.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

ACM SIGGRAPH 2018 Studio

自引率

0.00%

发文量