Object Detection using Speech Recognition

IF 0.3
Chetana B. Thaokar, Gayatri Ladsawangikar, Tanaya Wadibhasme, Sandeep Sureka
{"title":"Object Detection using Speech Recognition","authors":"Chetana B. Thaokar, Gayatri Ladsawangikar, Tanaya Wadibhasme, Sandeep Sureka","doi":"10.47164/ijngc.v13i5.974","DOIUrl":null,"url":null,"abstract":"Nearly all practical applications, including autonomous navigation, visual systems, face recognition, and more, rely on object detection. In this paper, object detection and speech recognition are combined to help visually impaired people who want to use voice commands to find a certain object. People who are blind or visually challenged can move more independently if they are aware of their surroundings. With the use of the OpenCV libraries, a model has been implemented, and good results have been obtained. In this paper, a thorough review of object detection employing region-based conventional neural network (CNN)- based learning systems for practical applications has been conducted. This study examines the various object identification processes utilizing YOLOV4 object detection techniques and talks through detection, including a speech recognition system that was created by transcribing spoken language into text.","PeriodicalId":42021,"journal":{"name":"International Journal of Next-Generation Computing","volume":"41 1","pages":""},"PeriodicalIF":0.3000,"publicationDate":"2022-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Next-Generation Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47164/ijngc.v13i5.974","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Nearly all practical applications, including autonomous navigation, visual systems, face recognition, and more, rely on object detection. In this paper, object detection and speech recognition are combined to help visually impaired people who want to use voice commands to find a certain object. People who are blind or visually challenged can move more independently if they are aware of their surroundings. With the use of the OpenCV libraries, a model has been implemented, and good results have been obtained. In this paper, a thorough review of object detection employing region-based conventional neural network (CNN)- based learning systems for practical applications has been conducted. This study examines the various object identification processes utilizing YOLOV4 object detection techniques and talks through detection, including a speech recognition system that was created by transcribing spoken language into text.
使用语音识别的目标检测
几乎所有的实际应用,包括自主导航、视觉系统、人脸识别等,都依赖于目标检测。本文将物体检测与语音识别相结合,帮助视障人士使用语音命令找到特定的物体。如果盲人或视力障碍的人能意识到周围的环境,他们就能更独立地行动。利用OpenCV库实现了一个模型,并取得了良好的效果。本文对基于区域的传统神经网络(CNN)学习系统在实际应用中的目标检测进行了全面的综述。本研究考察了利用YOLOV4对象检测技术的各种对象识别过程,并通过检测进行讨论,包括通过将口语转录成文本创建的语音识别系统。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
International Journal of Next-Generation Computing
International Journal of Next-Generation Computing COMPUTER SCIENCE, THEORY & METHODS-
自引率
66.70%
发文量
60
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信