Real-Time Voice-Controlled Game Interaction using Convolutional Neural Networks

Dania Maryam Waqar, T. Gunawan, M. Kartiwi, R. Ahmad
{"title":"Real-Time Voice-Controlled Game Interaction using Convolutional Neural Networks","authors":"Dania Maryam Waqar, T. Gunawan, M. Kartiwi, R. Ahmad","doi":"10.1109/ICSIMA50015.2021.9526318","DOIUrl":null,"url":null,"abstract":"Speech recognition has gained growing popularity due to its wide applications in almost every field, ranging from wake-word recognition, emotion recognition, command recognition, and interactive game. Recently, there is a growing interest in using voice in the gaming industry. Voice-controlled interaction made gaming much more accessible to a wider audience. However, the use of voice to control games requires real-time processing to avoid unwanted delay. This paper proposes speech command recognition using Convolutional Neural Networks (CNN) to control the popular snake game. First, the limited dataset for Up, Down, Left, Right speech commands was prepared for training, validation, and testing. Second, an optimum MFCC and CNN-based speech command recognition were proposed to recognize the four speech command. Results showed that our proposed algorithm could achieve high recognition accuracy of 96.5% and was able to detect all four commands. Finally, the proposed algorithm is integrated with a Python-based snake game.","PeriodicalId":404811,"journal":{"name":"2021 IEEE 7th International Conference on Smart Instrumentation, Measurement and Applications (ICSIMA)","volume":"199 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 7th International Conference on Smart Instrumentation, Measurement and Applications (ICSIMA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSIMA50015.2021.9526318","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Speech recognition has gained growing popularity due to its wide applications in almost every field, ranging from wake-word recognition, emotion recognition, command recognition, and interactive game. Recently, there is a growing interest in using voice in the gaming industry. Voice-controlled interaction made gaming much more accessible to a wider audience. However, the use of voice to control games requires real-time processing to avoid unwanted delay. This paper proposes speech command recognition using Convolutional Neural Networks (CNN) to control the popular snake game. First, the limited dataset for Up, Down, Left, Right speech commands was prepared for training, validation, and testing. Second, an optimum MFCC and CNN-based speech command recognition were proposed to recognize the four speech command. Results showed that our proposed algorithm could achieve high recognition accuracy of 96.5% and was able to detect all four commands. Finally, the proposed algorithm is integrated with a Python-based snake game.
使用卷积神经网络的实时语音控制游戏交互
语音识别由于其在唤醒词识别、情感识别、命令识别和互动游戏等几乎每个领域的广泛应用而越来越受欢迎。最近,人们对在游戏行业中使用语音越来越感兴趣。语音控制互动让更多用户更容易接触到游戏。然而,使用语音控制游戏需要实时处理,以避免不必要的延迟。本文提出了一种基于卷积神经网络(CNN)的语音命令识别方法来控制流行的蛇类游戏。首先,为上、下、左、右语音命令的有限数据集准备了训练、验证和测试。其次,提出了一种最优的基于MFCC和cnn的语音命令识别方法来识别四种语音命令。结果表明,该算法的识别准确率高达96.5%,能够检测到所有4个命令。最后,将该算法与基于python的蛇类游戏相结合。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信