DEEP LEARNING FOR VISUAL RECOGNITION

G.Vijaya Lakshmi, B.Amrutha Varshini, Ch.Goutham Naidu, P.Viswanth Reddy, A.Raheem Khan
{"title":"DEEP LEARNING FOR VISUAL RECOGNITION","authors":"G.Vijaya Lakshmi, B.Amrutha Varshini, Ch.Goutham Naidu, P.Viswanth Reddy, A.Raheem Khan","doi":"10.36713/epra16464","DOIUrl":null,"url":null,"abstract":"The design and development of an advanced object detection system are presented in this work, which was guided by a thorough literature review and feasibility assessment. The literature review emphasises how object detection techniques have evolved, highlighting the shift from conventional techniques to deep learning approaches. Important developments are highlighted, such as feature pyramid networks, anchor-based localization, and region-based and single-stage detectors. Furthermore, offered are insights about assessment metrics, transfer learning, and data augmentation. A feasibility study assesses the suggested systems operational, technological, and financial viability and finds that it is highly feasible in each of these areas. The architecture of the system is modular and scalable, including backend services, data management, an object detection engine, and a user interface among its constituent parts. Specifications for both functional and non-functional needs are provided, which direct the development of the system. The development phases, resource allocation, development process, and quality assurance procedures are all outlined in the implementation plan. Through the integration of deep learning techniques, the suggested system seeks to achieve high-performance object identification capabilities that are appropriate for a variety of applications while being scalable, reliable, and user-friendly.","PeriodicalId":114964,"journal":{"name":"EPRA International Journal of Research & Development (IJRD)","volume":"113 2","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EPRA International Journal of Research & Development (IJRD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36713/epra16464","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The design and development of an advanced object detection system are presented in this work, which was guided by a thorough literature review and feasibility assessment. The literature review emphasises how object detection techniques have evolved, highlighting the shift from conventional techniques to deep learning approaches. Important developments are highlighted, such as feature pyramid networks, anchor-based localization, and region-based and single-stage detectors. Furthermore, offered are insights about assessment metrics, transfer learning, and data augmentation. A feasibility study assesses the suggested systems operational, technological, and financial viability and finds that it is highly feasible in each of these areas. The architecture of the system is modular and scalable, including backend services, data management, an object detection engine, and a user interface among its constituent parts. Specifications for both functional and non-functional needs are provided, which direct the development of the system. The development phases, resource allocation, development process, and quality assurance procedures are all outlined in the implementation plan. Through the integration of deep learning techniques, the suggested system seeks to achieve high-performance object identification capabilities that are appropriate for a variety of applications while being scalable, reliable, and user-friendly.
用于视觉识别的深度学习
本作品介绍了先进物体检测系统的设计和开发过程,并以全面的文献综述和可行性评估为指导。文献综述强调了物体检测技术的发展,突出了从传统技术到深度学习方法的转变。重点介绍了一些重要的发展,如特征金字塔网络、基于锚点的定位、基于区域的检测器和单级检测器。此外,还提供了有关评估指标、迁移学习和数据增强的见解。可行性研究对所建议的系统在操作、技术和财务方面的可行性进行了评估,发现该系统在这些方面都非常可行。该系统的结构是模块化和可扩展的,包括后台服务、数据管理、对象检测引擎和用户界面等组成部分。系统提供了功能性和非功能性需求规格,指导系统的开发。实施计划中概述了开发阶段、资源分配、开发流程和质量保证程序。通过整合深度学习技术,所建议的系统力求实现高性能的对象识别能力,适用于各种应用,同时具有可扩展性、可靠性和用户友好性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信