Muhammad Rizal Prasetyo, Iwan Kumianto Wibowo, M. Bachtiar, Renardi Adryantoro Priambudi, Khoirul Anwar, Putu Bagus Kertha Segara
{"title":"Implementation Voice Command System for Soccer Robot ERSOW","authors":"Muhammad Rizal Prasetyo, Iwan Kumianto Wibowo, M. Bachtiar, Renardi Adryantoro Priambudi, Khoirul Anwar, Putu Bagus Kertha Segara","doi":"10.1109/IES50839.2020.9231941","DOIUrl":null,"url":null,"abstract":"ERSOW is a wheeled soccer robot that is included in the Middle Size League (MSL) category in the Indonesian Wheeled Robot Soccer Contest division (Wheeled KRSBI). Wheeled soccer robot has Artificial Intelligent (AI) for kick the ball, receive the ball, feed the ball, recognize the ball, recognize the opponent, recognize the goal, receive instructions from the base station, and so forth. This research focuses on giving instructions to ERSOW through the base station using the voice command system. The system uses speech as input in the form of analog signals. Speech recognition is done by using a deep speech package so as to produce output in the form of text. The system will run on the Robot Operating System (ROS). The result of speech recognition using 13 trained speakers when tested by one speaker in different distance show average Word Error Rate (WER) 0.46% and Word Accuracy (W Acc) is 99.54%. When tested by five different speaker using trained speakers show average WER is 4.37% and W Acc is 95.63%, when using non trained speakers show average WER is 28.25% and W Acc is 71.75%. Base station implementation shows the simulation of the robot when the user gives instruction by his or her own voice.","PeriodicalId":344685,"journal":{"name":"2020 International Electronics Symposium (IES)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Electronics Symposium (IES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IES50839.2020.9231941","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
ERSOW is a wheeled soccer robot that is included in the Middle Size League (MSL) category in the Indonesian Wheeled Robot Soccer Contest division (Wheeled KRSBI). Wheeled soccer robot has Artificial Intelligent (AI) for kick the ball, receive the ball, feed the ball, recognize the ball, recognize the opponent, recognize the goal, receive instructions from the base station, and so forth. This research focuses on giving instructions to ERSOW through the base station using the voice command system. The system uses speech as input in the form of analog signals. Speech recognition is done by using a deep speech package so as to produce output in the form of text. The system will run on the Robot Operating System (ROS). The result of speech recognition using 13 trained speakers when tested by one speaker in different distance show average Word Error Rate (WER) 0.46% and Word Accuracy (W Acc) is 99.54%. When tested by five different speaker using trained speakers show average WER is 4.37% and W Acc is 95.63%, when using non trained speakers show average WER is 28.25% and W Acc is 71.75%. Base station implementation shows the simulation of the robot when the user gives instruction by his or her own voice.