Cautious Curiosity: A Novel Approach to a Human-Like Gameplay Agent

Chujin Zhou, Tiago Machado, Casper Harteveld
{"title":"Cautious Curiosity: A Novel Approach to a Human-Like Gameplay Agent","authors":"Chujin Zhou, Tiago Machado, Casper Harteveld","doi":"10.1609/aiide.v19i1.27533","DOIUrl":null,"url":null,"abstract":"We introduce a new reward function direction for intrinsically motivated reinforcement learning to mimic human behavior in the context of computer games. Similar to previous research, we focus on so-called ``curiosity agents'', which are agents whose intrinsic reward is based on the concept of curiosity. We designed our novel intrinsic reward, which we call ``Cautious Curiosity'' (CC) based on (1) a theory that proposes curiosity as a psychological definition called information gap, and (2) a recent study showing that the relationship between curiosity and information gap is an inverted U-curve. In this work, we compared our agent using the classic game Super Mario Bros. with (1) a random agent, (2) an agent based on the Asynchronous Advantage Actor Critic algorithm (A3C), (3) an agent based on the Intrinsic Curiosity Module (ICM), and (4) an average human player. We also asked participants (n = 100) to watch videos of these agents and rate how human-like they are. The main contribution of this work is that we present a reward function that, as perceived by humans, induces an agent to play a computer game similarly to a human, while maintaining its competitiveness and being more believable compared to other agents.","PeriodicalId":498041,"journal":{"name":"Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1609/aiide.v19i1.27533","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

We introduce a new reward function direction for intrinsically motivated reinforcement learning to mimic human behavior in the context of computer games. Similar to previous research, we focus on so-called ``curiosity agents'', which are agents whose intrinsic reward is based on the concept of curiosity. We designed our novel intrinsic reward, which we call ``Cautious Curiosity'' (CC) based on (1) a theory that proposes curiosity as a psychological definition called information gap, and (2) a recent study showing that the relationship between curiosity and information gap is an inverted U-curve. In this work, we compared our agent using the classic game Super Mario Bros. with (1) a random agent, (2) an agent based on the Asynchronous Advantage Actor Critic algorithm (A3C), (3) an agent based on the Intrinsic Curiosity Module (ICM), and (4) an average human player. We also asked participants (n = 100) to watch videos of these agents and rate how human-like they are. The main contribution of this work is that we present a reward function that, as perceived by humans, induces an agent to play a computer game similarly to a human, while maintaining its competitiveness and being more believable compared to other agents.
谨慎的好奇心:一种创造类人玩法代理的新方法
我们为内在动机强化学习引入了一个新的奖励函数方向,以模拟计算机游戏环境中的人类行为。与之前的研究类似,我们关注的是所谓的“好奇心代理”,这是基于好奇心概念的内在奖励的代理。我们设计了新的内在奖励,我们称之为“谨慎好奇心”(CC),这是基于(1)一个将好奇心作为一种心理学定义的理论,称为信息缺口,以及(2)最近的一项研究表明好奇心和信息缺口之间的关系是倒u型曲线。在这项工作中,我们将使用经典游戏《超级马里奥兄弟》的智能体与(1)随机智能体、(2)基于异步优势参与者评论算法(A3C)的智能体、(3)基于内在好奇心模块(ICM)的智能体以及(4)普通人类玩家进行了比较。我们还要求参与者(n = 100)观看这些代理的视频,并评价它们与人类的相似程度。这项工作的主要贡献是,我们提出了一个奖励函数,在人类感知的情况下,诱导智能体像人类一样玩电脑游戏,同时保持其竞争力,与其他智能体相比更可信。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信