Investigating the Performance of Various Deep Neural Networks-based Approaches Designed to Identify Game Events in Gameplay Footage

IF 2.3 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Proceedings of the ACM on computer graphics and interactive techniques Pub Date : 2022-05-04 DOI:10.1145/3522624

Matheus Prado Prandini Faria, E. Julia, M. Z. Nascimento, Rita Maria Silva Julia

{"title":"Investigating the Performance of Various Deep Neural Networks-based Approaches Designed to Identify Game Events in Gameplay Footage","authors":"Matheus Prado Prandini Faria, E. Julia, M. Z. Nascimento, Rita Maria Silva Julia","doi":"10.1145/3522624","DOIUrl":null,"url":null,"abstract":"Video games, in addition to representing an extremely relevant field of entertainment and market, have been widely used as a case study in artificial intelligence for representing a problem with a high degree of complexity. In such studies, the investigation of approaches that endow player agents with the ability to retrieve relevant information from game scenes stands out, since such information can be very useful to improve their learning ability. This work proposes and analyses new deep learning-based models to identify game events occurring in Super Mario Bros gameplay footage. The architecture of each model is composed of a feature extractor convolutional neural network (CNN) and a classifier neural network (NN). The extracting CNN aims to produce a feature-based representation for game scenes and submit it to the classifier, so that the latter can identify the game event present in each scene. The models differ from each other according to the following elements: the type of the CNN; the type of the NN classifier; and the type of the game scene representation at the CNN input, being either single frames, or chunks, which are n-sequential frames (in this paper 6 frames were used per chunk) grouped into a single input. The main contribution of this article is to demonstrate the greater performance reached by the models which combines the chunk representation for the game scenes with the resources of the classifier recurrent neural networks (RNN).","PeriodicalId":74536,"journal":{"name":"Proceedings of the ACM on computer graphics and interactive techniques","volume":" ","pages":"1 - 17"},"PeriodicalIF":2.3000,"publicationDate":"2022-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ACM on computer graphics and interactive techniques","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3522624","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Video games, in addition to representing an extremely relevant field of entertainment and market, have been widely used as a case study in artificial intelligence for representing a problem with a high degree of complexity. In such studies, the investigation of approaches that endow player agents with the ability to retrieve relevant information from game scenes stands out, since such information can be very useful to improve their learning ability. This work proposes and analyses new deep learning-based models to identify game events occurring in Super Mario Bros gameplay footage. The architecture of each model is composed of a feature extractor convolutional neural network (CNN) and a classifier neural network (NN). The extracting CNN aims to produce a feature-based representation for game scenes and submit it to the classifier, so that the latter can identify the game event present in each scene. The models differ from each other according to the following elements: the type of the CNN; the type of the NN classifier; and the type of the game scene representation at the CNN input, being either single frames, or chunks, which are n-sequential frames (in this paper 6 frames were used per chunk) grouped into a single input. The main contribution of this article is to demonstrate the greater performance reached by the models which combines the chunk representation for the game scenes with the resources of the classifier recurrent neural networks (RNN).

查看原文本刊更多论文

研究各种基于深度神经网络的方法的性能，这些方法旨在识别游戏玩法镜头中的游戏事件

电子游戏除了代表了一个与娱乐和市场极其相关的领域外，还被广泛用作人工智能的案例研究，因为它代表了一个高度复杂的问题。在这些研究中，对赋予玩家代理从游戏场景中检索相关信息的能力的方法的研究非常突出，因为这些信息对于提高他们的学习能力非常有用。这项工作提出并分析了新的基于深度学习的模型，以识别发生在超级马里奥兄弟游戏玩法镜头中的游戏事件。每个模型的体系结构由特征提取器卷积神经网络(CNN)和分类器神经网络(NN)组成。提取CNN的目的是对游戏场景产生基于特征的表示，并提交给分类器，使分类器能够识别每个场景中存在的游戏事件。模型之间的差异主要体现在以下几个方面:CNN的类型;NN分类器的类型;以及CNN输入的游戏场景表示类型，要么是单帧，要么是块，即n个连续帧(在本文中每个块使用6帧)分组到单个输入中。本文的主要贡献是展示了将游戏场景的块表示与分类器递归神经网络(RNN)的资源相结合的模型所达到的更高性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the ACM on computer graphics and interactive techniques

CiteScore

2.90

自引率

0.00%

发文量