Object Recognition based on Deep Learning Algorithms using Embedded IoT with Interactive Interface

Swapna Borde, Chandan Patil, Chinmay Sonawane, Mankrit Singh
{"title":"Object Recognition based on Deep Learning Algorithms using Embedded IoT with Interactive Interface","authors":"Swapna Borde, Chandan Patil, Chinmay Sonawane, Mankrit Singh","doi":"10.1109/ICICCS56967.2023.10142821","DOIUrl":null,"url":null,"abstract":"Object detection is one of the most popular applications of machine learning in the modern era. With the growth of IOT in recent times, dedicated devices offering real-time object detection have seen overwhelming demand and applications in many sectors e.g. security, healthcare, workplace etc. Different algorithms and approaches have been implemented and studied in terms of object detection. To refer to a few the YOLO family, RCNN family, SSD etc. This research study compares the performance of YOLO and Faster RCNN based on a custom dataset containing different objects and items. The IOU of each data point (image) is calculated and compared. YOLO performs better for a small margin. In this research study, a language learning interactive model is demonstrated based on object detection(YOLO), NLP, python, flask and IoT (Raspberry Pi.) The web application is running on is divided into two parts, learning and practice. The learning part has a raspberry pi device which has a camera module that captures real-time footage, recognizes the object and reads out its name in the language the user is learning. The practice has the same setup, with the application asking the user to show a particular object. Points are awarded for correct guesses, making the learning process more interactive and involving.","PeriodicalId":219272,"journal":{"name":"2023 7th International Conference on Intelligent Computing and Control Systems (ICICCS)","volume":"182 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 7th International Conference on Intelligent Computing and Control Systems (ICICCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICCS56967.2023.10142821","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Object detection is one of the most popular applications of machine learning in the modern era. With the growth of IOT in recent times, dedicated devices offering real-time object detection have seen overwhelming demand and applications in many sectors e.g. security, healthcare, workplace etc. Different algorithms and approaches have been implemented and studied in terms of object detection. To refer to a few the YOLO family, RCNN family, SSD etc. This research study compares the performance of YOLO and Faster RCNN based on a custom dataset containing different objects and items. The IOU of each data point (image) is calculated and compared. YOLO performs better for a small margin. In this research study, a language learning interactive model is demonstrated based on object detection(YOLO), NLP, python, flask and IoT (Raspberry Pi.) The web application is running on is divided into two parts, learning and practice. The learning part has a raspberry pi device which has a camera module that captures real-time footage, recognizes the object and reads out its name in the language the user is learning. The practice has the same setup, with the application asking the user to show a particular object. Points are awarded for correct guesses, making the learning process more interactive and involving.
基于深度学习算法的嵌入式物联网对象识别与交互界面
目标检测是现代机器学习最流行的应用之一。随着近年来物联网的发展,提供实时对象检测的专用设备在安全,医疗保健,工作场所等许多领域都有巨大的需求和应用。在目标检测方面,已经实现和研究了不同的算法和方法。可以参考几个YOLO家族、RCNN家族、SSD等。本研究基于包含不同对象和项目的自定义数据集,比较了YOLO和Faster RCNN的性能。计算并比较每个数据点(图像)的IOU。YOLO在小范围内表现更好。在本研究中,展示了一个基于对象检测(YOLO)、自然语言处理(NLP)、python、flask和物联网(Raspberry Pi)的语言学习交互模型。web应用程序的运行分为两个部分,学习和实践。学习部分有一个树莓派设备,它有一个摄像头模块,可以捕捉实时镜头,识别物体,并用用户正在学习的语言读出它的名字。实践有相同的设置,应用程序要求用户显示一个特定的对象。正确的猜测会获得分数,使学习过程更具互动性和参与性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信