{"title":"基于深度强化学习的基于agent的盒子操作动画研究","authors":"Hsiang-Yu Yang, Chien-Chou Wong, Sai-Keung Wong","doi":"10.6688/JISE.20210537(3).0003","DOIUrl":null,"url":null,"abstract":"This paper focuses on push-manipulation in an agent-based animation. A policy is learned in a learning session in which an agent perceives its own internal state and the surrounding environment and determines its actions. In each time step, the agent performs an action. Then it receives a reward that is a combination of different types of reward terms, including forward progress, orientation progress, collision avoidance, and finish time. Based on the received reward, the policy is improved gradually. We develop a system that controls an agent to transport a box. We investigate the effects of each reward term and study the impacts of various inputs on the performance of the agent in environments with obstacles. The inputs include the number of rays for perceiving the environment, obstacle settings, and box sizes. We performed some experiments and analyzed our findings in details. The experiment results show that the behaviors of agents are affected by the reward terms and various inputs in certain aspects, such as the movement smoothness of the agents, wandering about the box, loss of orientation, sensitivity about collision avoidance, and pushing styles.","PeriodicalId":50177,"journal":{"name":"Journal of Information Science and Engineering","volume":null,"pages":null},"PeriodicalIF":0.5000,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Study on Agent-Based Box-Manipulation Animation Using Deep Reinforcement Learning\",\"authors\":\"Hsiang-Yu Yang, Chien-Chou Wong, Sai-Keung Wong\",\"doi\":\"10.6688/JISE.20210537(3).0003\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper focuses on push-manipulation in an agent-based animation. A policy is learned in a learning session in which an agent perceives its own internal state and the surrounding environment and determines its actions. In each time step, the agent performs an action. Then it receives a reward that is a combination of different types of reward terms, including forward progress, orientation progress, collision avoidance, and finish time. Based on the received reward, the policy is improved gradually. We develop a system that controls an agent to transport a box. We investigate the effects of each reward term and study the impacts of various inputs on the performance of the agent in environments with obstacles. The inputs include the number of rays for perceiving the environment, obstacle settings, and box sizes. We performed some experiments and analyzed our findings in details. The experiment results show that the behaviors of agents are affected by the reward terms and various inputs in certain aspects, such as the movement smoothness of the agents, wandering about the box, loss of orientation, sensitivity about collision avoidance, and pushing styles.\",\"PeriodicalId\":50177,\"journal\":{\"name\":\"Journal of Information Science and Engineering\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.5000,\"publicationDate\":\"2021-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Information Science and Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.6688/JISE.20210537(3).0003\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Information Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.6688/JISE.20210537(3).0003","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
A Study on Agent-Based Box-Manipulation Animation Using Deep Reinforcement Learning
This paper focuses on push-manipulation in an agent-based animation. A policy is learned in a learning session in which an agent perceives its own internal state and the surrounding environment and determines its actions. In each time step, the agent performs an action. Then it receives a reward that is a combination of different types of reward terms, including forward progress, orientation progress, collision avoidance, and finish time. Based on the received reward, the policy is improved gradually. We develop a system that controls an agent to transport a box. We investigate the effects of each reward term and study the impacts of various inputs on the performance of the agent in environments with obstacles. The inputs include the number of rays for perceiving the environment, obstacle settings, and box sizes. We performed some experiments and analyzed our findings in details. The experiment results show that the behaviors of agents are affected by the reward terms and various inputs in certain aspects, such as the movement smoothness of the agents, wandering about the box, loss of orientation, sensitivity about collision avoidance, and pushing styles.
期刊介绍:
The Journal of Information Science and Engineering is dedicated to the dissemination of information on computer science, computer engineering, and computer systems. This journal encourages articles on original research in the areas of computer hardware, software, man-machine interface, theory and applications. tutorial papers in the above-mentioned areas, and state-of-the-art papers on various aspects of computer systems and applications.