基于奖励塑造的足球任务合作行为习得

Proceedings of the 2021 5th International Conference on Innovation in Artificial Intelligence Pub Date : 2021-03-05 DOI:10.1145/3461353.3461360

Takashi Abe, R. Orihara, Y. Sei, Yasuyuki Tahara, Akihiko Ohsuga

{"title":"基于奖励塑造的足球任务合作行为习得","authors":"Takashi Abe, R. Orihara, Y. Sei, Yasuyuki Tahara, Akihiko Ohsuga","doi":"10.1145/3461353.3461360","DOIUrl":null,"url":null,"abstract":"In this research, soccer task is investigated among the numerous tasks of deep reinforcement learning. The soccer task requires cooperative behavior. However, it is difficult for the agents to acquire the behavior, because a reward is sparsely given. Moreover, the behaviors of the allies and opponents must be considered by the agents. In addition, in the soccer task, if the agents attempt to acquire high-level cooperative behavior from low-level movements, such as ball kicking, a huge amount of time will be needed to learn a model. In this research, we conduct experiments in which reward shaping is incorporated into deep reinforcement learning. This enables the agents to efficiently acquire cooperative behavior from low-level movements in a soccer task. The findings of this research indicate that reward shaping with a designer's domain knowledge positively influences the agent's attempt to acquire cooperative behavior from low-level movements.","PeriodicalId":114871,"journal":{"name":"Proceedings of the 2021 5th International Conference on Innovation in Artificial Intelligence","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Acquisition of Cooperative Behavior in a Soccer Task Using Reward Shaping\",\"authors\":\"Takashi Abe, R. Orihara, Y. Sei, Yasuyuki Tahara, Akihiko Ohsuga\",\"doi\":\"10.1145/3461353.3461360\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this research, soccer task is investigated among the numerous tasks of deep reinforcement learning. The soccer task requires cooperative behavior. However, it is difficult for the agents to acquire the behavior, because a reward is sparsely given. Moreover, the behaviors of the allies and opponents must be considered by the agents. In addition, in the soccer task, if the agents attempt to acquire high-level cooperative behavior from low-level movements, such as ball kicking, a huge amount of time will be needed to learn a model. In this research, we conduct experiments in which reward shaping is incorporated into deep reinforcement learning. This enables the agents to efficiently acquire cooperative behavior from low-level movements in a soccer task. The findings of this research indicate that reward shaping with a designer's domain knowledge positively influences the agent's attempt to acquire cooperative behavior from low-level movements.\",\"PeriodicalId\":114871,\"journal\":{\"name\":\"Proceedings of the 2021 5th International Conference on Innovation in Artificial Intelligence\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-03-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2021 5th International Conference on Innovation in Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3461353.3461360\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 5th International Conference on Innovation in Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3461353.3461360","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

在本研究中，足球任务是深度强化学习的众多任务之一。足球任务需要合作行为。然而，由于奖励是稀疏的，代理很难获得这种行为。此外，代理人还必须考虑盟友和对手的行为。此外，在足球任务中，如果智能体试图从低级动作(如踢球)中获得高级合作行为，则需要大量的时间来学习模型。在本研究中，我们进行了将奖励形成纳入深度强化学习的实验。这使得智能体能够有效地从足球任务中的低级动作中获得合作行为。本研究结果表明，基于设计师领域知识的奖励塑造正向影响代理从低级动作获得合作行为的尝试。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Acquisition of Cooperative Behavior in a Soccer Task Using Reward Shaping

In this research, soccer task is investigated among the numerous tasks of deep reinforcement learning. The soccer task requires cooperative behavior. However, it is difficult for the agents to acquire the behavior, because a reward is sparsely given. Moreover, the behaviors of the allies and opponents must be considered by the agents. In addition, in the soccer task, if the agents attempt to acquire high-level cooperative behavior from low-level movements, such as ball kicking, a huge amount of time will be needed to learn a model. In this research, we conduct experiments in which reward shaping is incorporated into deep reinforcement learning. This enables the agents to efficiently acquire cooperative behavior from low-level movements in a soccer task. The findings of this research indicate that reward shaping with a designer's domain knowledge positively influences the agent's attempt to acquire cooperative behavior from low-level movements.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2021 5th International Conference on Innovation in Artificial Intelligence

自引率

0.00%

发文量