Dania Maryam Waqar, T. Gunawan, M. Kartiwi, R. Ahmad
{"title":"使用卷积神经网络的实时语音控制游戏交互","authors":"Dania Maryam Waqar, T. Gunawan, M. Kartiwi, R. Ahmad","doi":"10.1109/ICSIMA50015.2021.9526318","DOIUrl":null,"url":null,"abstract":"Speech recognition has gained growing popularity due to its wide applications in almost every field, ranging from wake-word recognition, emotion recognition, command recognition, and interactive game. Recently, there is a growing interest in using voice in the gaming industry. Voice-controlled interaction made gaming much more accessible to a wider audience. However, the use of voice to control games requires real-time processing to avoid unwanted delay. This paper proposes speech command recognition using Convolutional Neural Networks (CNN) to control the popular snake game. First, the limited dataset for Up, Down, Left, Right speech commands was prepared for training, validation, and testing. Second, an optimum MFCC and CNN-based speech command recognition were proposed to recognize the four speech command. Results showed that our proposed algorithm could achieve high recognition accuracy of 96.5% and was able to detect all four commands. Finally, the proposed algorithm is integrated with a Python-based snake game.","PeriodicalId":404811,"journal":{"name":"2021 IEEE 7th International Conference on Smart Instrumentation, Measurement and Applications (ICSIMA)","volume":"199 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Real-Time Voice-Controlled Game Interaction using Convolutional Neural Networks\",\"authors\":\"Dania Maryam Waqar, T. Gunawan, M. Kartiwi, R. Ahmad\",\"doi\":\"10.1109/ICSIMA50015.2021.9526318\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speech recognition has gained growing popularity due to its wide applications in almost every field, ranging from wake-word recognition, emotion recognition, command recognition, and interactive game. Recently, there is a growing interest in using voice in the gaming industry. Voice-controlled interaction made gaming much more accessible to a wider audience. However, the use of voice to control games requires real-time processing to avoid unwanted delay. This paper proposes speech command recognition using Convolutional Neural Networks (CNN) to control the popular snake game. First, the limited dataset for Up, Down, Left, Right speech commands was prepared for training, validation, and testing. Second, an optimum MFCC and CNN-based speech command recognition were proposed to recognize the four speech command. Results showed that our proposed algorithm could achieve high recognition accuracy of 96.5% and was able to detect all four commands. Finally, the proposed algorithm is integrated with a Python-based snake game.\",\"PeriodicalId\":404811,\"journal\":{\"name\":\"2021 IEEE 7th International Conference on Smart Instrumentation, Measurement and Applications (ICSIMA)\",\"volume\":\"199 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 7th International Conference on Smart Instrumentation, Measurement and Applications (ICSIMA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSIMA50015.2021.9526318\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 7th International Conference on Smart Instrumentation, Measurement and Applications (ICSIMA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSIMA50015.2021.9526318","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Real-Time Voice-Controlled Game Interaction using Convolutional Neural Networks
Speech recognition has gained growing popularity due to its wide applications in almost every field, ranging from wake-word recognition, emotion recognition, command recognition, and interactive game. Recently, there is a growing interest in using voice in the gaming industry. Voice-controlled interaction made gaming much more accessible to a wider audience. However, the use of voice to control games requires real-time processing to avoid unwanted delay. This paper proposes speech command recognition using Convolutional Neural Networks (CNN) to control the popular snake game. First, the limited dataset for Up, Down, Left, Right speech commands was prepared for training, validation, and testing. Second, an optimum MFCC and CNN-based speech command recognition were proposed to recognize the four speech command. Results showed that our proposed algorithm could achieve high recognition accuracy of 96.5% and was able to detect all four commands. Finally, the proposed algorithm is integrated with a Python-based snake game.