Dipankar Gupta, Emam Hossain, Mohammad Shahadat Hossain, Karl Andersson, S. Hossain
{"title":"使用孟加拉语语音命令识别和人脸检测的数字个人助理","authors":"Dipankar Gupta, Emam Hossain, Mohammad Shahadat Hossain, Karl Andersson, S. Hossain","doi":"10.1109/RAAICON48939.2019.47","DOIUrl":null,"url":null,"abstract":"Though speech recognition has been a common interest of researchers over the last couple of decades, but very few works have been done on Bangla voice recognition. In this research, we developed a digital personal assistant for handicapped people which recognizes continuous Bangla voice commands. We employed the cross-correlation technique which compares the energy of Bangla voice commands with prerecorded reference signals. After recognizing a Bangla command, it executes a task specified by that command. Mouse cursor can also be controlled using the facial movement of a user. We validated our model in three different environments (noisy, moderate and noiseless) so that the model can act naturally. We also compared our proposed model with a combined model of MFCC & DTW, and another model which combines crosscorrelation with LPC. Results indicate that the proposed model achieves a huge accuracy and smaller response time comparing to the other two techniques.","PeriodicalId":102214,"journal":{"name":"2019 IEEE International Conference on Robotics, Automation, Artificial-intelligence and Internet-of-Things (RAAICON)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"A Digital Personal Assistant using Bangla Voice Command Recognition and Face Detection\",\"authors\":\"Dipankar Gupta, Emam Hossain, Mohammad Shahadat Hossain, Karl Andersson, S. Hossain\",\"doi\":\"10.1109/RAAICON48939.2019.47\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Though speech recognition has been a common interest of researchers over the last couple of decades, but very few works have been done on Bangla voice recognition. In this research, we developed a digital personal assistant for handicapped people which recognizes continuous Bangla voice commands. We employed the cross-correlation technique which compares the energy of Bangla voice commands with prerecorded reference signals. After recognizing a Bangla command, it executes a task specified by that command. Mouse cursor can also be controlled using the facial movement of a user. We validated our model in three different environments (noisy, moderate and noiseless) so that the model can act naturally. We also compared our proposed model with a combined model of MFCC & DTW, and another model which combines crosscorrelation with LPC. Results indicate that the proposed model achieves a huge accuracy and smaller response time comparing to the other two techniques.\",\"PeriodicalId\":102214,\"journal\":{\"name\":\"2019 IEEE International Conference on Robotics, Automation, Artificial-intelligence and Internet-of-Things (RAAICON)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE International Conference on Robotics, Automation, Artificial-intelligence and Internet-of-Things (RAAICON)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RAAICON48939.2019.47\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Conference on Robotics, Automation, Artificial-intelligence and Internet-of-Things (RAAICON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RAAICON48939.2019.47","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Digital Personal Assistant using Bangla Voice Command Recognition and Face Detection
Though speech recognition has been a common interest of researchers over the last couple of decades, but very few works have been done on Bangla voice recognition. In this research, we developed a digital personal assistant for handicapped people which recognizes continuous Bangla voice commands. We employed the cross-correlation technique which compares the energy of Bangla voice commands with prerecorded reference signals. After recognizing a Bangla command, it executes a task specified by that command. Mouse cursor can also be controlled using the facial movement of a user. We validated our model in three different environments (noisy, moderate and noiseless) so that the model can act naturally. We also compared our proposed model with a combined model of MFCC & DTW, and another model which combines crosscorrelation with LPC. Results indicate that the proposed model achieves a huge accuracy and smaller response time comparing to the other two techniques.