Chetana B. Thaokar, Gayatri Ladsawangikar, Tanaya Wadibhasme, Sandeep Sureka
{"title":"Object Detection using Speech Recognition","authors":"Chetana B. Thaokar, Gayatri Ladsawangikar, Tanaya Wadibhasme, Sandeep Sureka","doi":"10.47164/ijngc.v13i5.974","DOIUrl":null,"url":null,"abstract":"Nearly all practical applications, including autonomous navigation, visual systems, face recognition, and more, rely on object detection. In this paper, object detection and speech recognition are combined to help visually impaired people who want to use voice commands to find a certain object. People who are blind or visually challenged can move more independently if they are aware of their surroundings. With the use of the OpenCV libraries, a model has been implemented, and good results have been obtained. In this paper, a thorough review of object detection employing region-based conventional neural network (CNN)- based learning systems for practical applications has been conducted. This study examines the various object identification processes utilizing YOLOV4 object detection techniques and talks through detection, including a speech recognition system that was created by transcribing spoken language into text.","PeriodicalId":42021,"journal":{"name":"International Journal of Next-Generation Computing","volume":"41 1","pages":""},"PeriodicalIF":0.3000,"publicationDate":"2022-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Next-Generation Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47164/ijngc.v13i5.974","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Nearly all practical applications, including autonomous navigation, visual systems, face recognition, and more, rely on object detection. In this paper, object detection and speech recognition are combined to help visually impaired people who want to use voice commands to find a certain object. People who are blind or visually challenged can move more independently if they are aware of their surroundings. With the use of the OpenCV libraries, a model has been implemented, and good results have been obtained. In this paper, a thorough review of object detection employing region-based conventional neural network (CNN)- based learning systems for practical applications has been conducted. This study examines the various object identification processes utilizing YOLOV4 object detection techniques and talks through detection, including a speech recognition system that was created by transcribing spoken language into text.