使用孟加拉语语音命令识别和人脸检测的数字个人助理

Dipankar Gupta, Emam Hossain, Mohammad Shahadat Hossain, Karl Andersson, S. Hossain
{"title":"使用孟加拉语语音命令识别和人脸检测的数字个人助理","authors":"Dipankar Gupta, Emam Hossain, Mohammad Shahadat Hossain, Karl Andersson, S. Hossain","doi":"10.1109/RAAICON48939.2019.47","DOIUrl":null,"url":null,"abstract":"Though speech recognition has been a common interest of researchers over the last couple of decades, but very few works have been done on Bangla voice recognition. In this research, we developed a digital personal assistant for handicapped people which recognizes continuous Bangla voice commands. We employed the cross-correlation technique which compares the energy of Bangla voice commands with prerecorded reference signals. After recognizing a Bangla command, it executes a task specified by that command. Mouse cursor can also be controlled using the facial movement of a user. We validated our model in three different environments (noisy, moderate and noiseless) so that the model can act naturally. We also compared our proposed model with a combined model of MFCC & DTW, and another model which combines crosscorrelation with LPC. Results indicate that the proposed model achieves a huge accuracy and smaller response time comparing to the other two techniques.","PeriodicalId":102214,"journal":{"name":"2019 IEEE International Conference on Robotics, Automation, Artificial-intelligence and Internet-of-Things (RAAICON)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"A Digital Personal Assistant using Bangla Voice Command Recognition and Face Detection\",\"authors\":\"Dipankar Gupta, Emam Hossain, Mohammad Shahadat Hossain, Karl Andersson, S. Hossain\",\"doi\":\"10.1109/RAAICON48939.2019.47\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Though speech recognition has been a common interest of researchers over the last couple of decades, but very few works have been done on Bangla voice recognition. In this research, we developed a digital personal assistant for handicapped people which recognizes continuous Bangla voice commands. We employed the cross-correlation technique which compares the energy of Bangla voice commands with prerecorded reference signals. After recognizing a Bangla command, it executes a task specified by that command. Mouse cursor can also be controlled using the facial movement of a user. We validated our model in three different environments (noisy, moderate and noiseless) so that the model can act naturally. We also compared our proposed model with a combined model of MFCC & DTW, and another model which combines crosscorrelation with LPC. Results indicate that the proposed model achieves a huge accuracy and smaller response time comparing to the other two techniques.\",\"PeriodicalId\":102214,\"journal\":{\"name\":\"2019 IEEE International Conference on Robotics, Automation, Artificial-intelligence and Internet-of-Things (RAAICON)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE International Conference on Robotics, Automation, Artificial-intelligence and Internet-of-Things (RAAICON)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RAAICON48939.2019.47\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Conference on Robotics, Automation, Artificial-intelligence and Internet-of-Things (RAAICON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RAAICON48939.2019.47","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

摘要

虽然语音识别在过去的几十年里一直是研究人员的共同兴趣,但在孟加拉语语音识别方面的工作却很少。在本研究中,我们开发了一款能够识别连续孟加拉语语音指令的残疾人数字个人助理。我们采用交叉相关技术,将孟加拉语语音命令的能量与预先录制的参考信号进行比较。在识别孟加拉语命令后,它执行该命令指定的任务。还可以使用用户的面部运动来控制鼠标光标。我们在三种不同的环境(有噪声、适度和无噪声)中验证了我们的模型,以便模型可以自然地运行。我们还将我们的模型与MFCC和DTW的组合模型以及另一个将相互关联与LPC相结合的模型进行了比较。结果表明,与其他两种技术相比,该模型具有更高的精度和更短的响应时间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Digital Personal Assistant using Bangla Voice Command Recognition and Face Detection
Though speech recognition has been a common interest of researchers over the last couple of decades, but very few works have been done on Bangla voice recognition. In this research, we developed a digital personal assistant for handicapped people which recognizes continuous Bangla voice commands. We employed the cross-correlation technique which compares the energy of Bangla voice commands with prerecorded reference signals. After recognizing a Bangla command, it executes a task specified by that command. Mouse cursor can also be controlled using the facial movement of a user. We validated our model in three different environments (noisy, moderate and noiseless) so that the model can act naturally. We also compared our proposed model with a combined model of MFCC & DTW, and another model which combines crosscorrelation with LPC. Results indicate that the proposed model achieves a huge accuracy and smaller response time comparing to the other two techniques.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信