研究视觉图像识别和分类的最新方法

V. P. Lysechko, B. I. Sadovnykov, O. M. Komar, О. S. Zhuchenko
{"title":"研究视觉图像识别和分类的最新方法","authors":"V. P. Lysechko, B. I. Sadovnykov, O. M. Komar, О. S. Zhuchenko","doi":"10.15588/1607-3274-2024-1-13","DOIUrl":null,"url":null,"abstract":"Context. The paper provides an overview of current methods for recognizing and classifying visual images in static images or video stream. The paper will discuss various approaches, including machine learning, current problems of these methods and possible improvements. The biggest challenges of the visual image retrieval and classification task are discussed. The main emphasis is placed on the review of such promising algorithms as SSD, YOLO, R-CNN, an overview of the principles of these methods, network architectures. \nObjective. The aim of the work is to analyze existing studies and find the best algorithm for recognizing and classifying visual images for further activities. \nMethod. Primary method is to compare different factors of algorithms in order to select the most perspective one. There are different marks to compare, like image processing speed, accuracy. There are a number of studies and publications that propose methods and algorithms for solving the problem of finding and classifying images in an image [3–6]. It should be noted that most promising approaches are based on machine learning methods. It is worth noting that the proposed methods have drawbacks due to the imperfect implementation of the Faster R-CNN, YOLO, SSD algorithms for working with streaming video. The impact of these drawbacks can be significantly reduced by applying the following solutions: development of combined identification methods, processing of edge cases – tracking the position of identified objects, using the difference between video frames, additional preliminary preparation of input data. Another major area for improvement is the optimization of methods to work with real-time video data, as most current methods focus on images. \nResults. As an outcome of the current research we have found an optimal algorithm for further researches and optimizations. \nConclusions. Analysis of existent papers and researches has demonstrated the most promising algorithm for further optimizations and experiments. Also current approaches still have some space for further. The next step is to take the chosen algorithm and investigate possibilities to enhance it.","PeriodicalId":518330,"journal":{"name":"Radio Electronics, Computer Science, Control","volume":"23 2","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A RESEARCH OF THE LATEST APPROACHES TO VISUAL IMAGE RECOGNITION AND CLASSIFICATION\",\"authors\":\"V. P. Lysechko, B. I. Sadovnykov, O. M. Komar, О. S. Zhuchenko\",\"doi\":\"10.15588/1607-3274-2024-1-13\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Context. The paper provides an overview of current methods for recognizing and classifying visual images in static images or video stream. The paper will discuss various approaches, including machine learning, current problems of these methods and possible improvements. The biggest challenges of the visual image retrieval and classification task are discussed. The main emphasis is placed on the review of such promising algorithms as SSD, YOLO, R-CNN, an overview of the principles of these methods, network architectures. \\nObjective. The aim of the work is to analyze existing studies and find the best algorithm for recognizing and classifying visual images for further activities. \\nMethod. Primary method is to compare different factors of algorithms in order to select the most perspective one. There are different marks to compare, like image processing speed, accuracy. There are a number of studies and publications that propose methods and algorithms for solving the problem of finding and classifying images in an image [3–6]. It should be noted that most promising approaches are based on machine learning methods. It is worth noting that the proposed methods have drawbacks due to the imperfect implementation of the Faster R-CNN, YOLO, SSD algorithms for working with streaming video. The impact of these drawbacks can be significantly reduced by applying the following solutions: development of combined identification methods, processing of edge cases – tracking the position of identified objects, using the difference between video frames, additional preliminary preparation of input data. Another major area for improvement is the optimization of methods to work with real-time video data, as most current methods focus on images. \\nResults. As an outcome of the current research we have found an optimal algorithm for further researches and optimizations. \\nConclusions. Analysis of existent papers and researches has demonstrated the most promising algorithm for further optimizations and experiments. Also current approaches still have some space for further. The next step is to take the chosen algorithm and investigate possibilities to enhance it.\",\"PeriodicalId\":518330,\"journal\":{\"name\":\"Radio Electronics, Computer Science, Control\",\"volume\":\"23 2\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-04-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Radio Electronics, Computer Science, Control\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.15588/1607-3274-2024-1-13\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radio Electronics, Computer Science, Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15588/1607-3274-2024-1-13","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

背景。本文概述了当前对静态图像或视频流中的视觉图像进行识别和分类的方法。本文将讨论包括机器学习在内的各种方法、这些方法目前存在的问题以及可能的改进。还将讨论视觉图像检索和分类任务所面临的最大挑战。主要重点是回顾 SSD、YOLO、R-CNN 等有前途的算法,概述这些方法的原理和网络架构。目标。这项工作的目的是分析现有研究,为进一步的活动找到识别和分类视觉图像的最佳算法。方法。主要方法是对算法的不同因素进行比较,以选择最有前景的算法。需要比较的指标有很多,如图像处理速度、准确性等。有许多研究和出版物都提出了解决图像中图像查找和分类问题的方法和算法[3-6]。值得注意的是,大多数有前途的方法都是基于机器学习方法的。值得注意的是,由于 Faster R-CNN、YOLO 和 SSD 算法在处理流媒体视频方面的实施不完善,所提出的方法存在缺陷。通过采用以下解决方案,可以显著减少这些缺点的影响:开发组合识别方法、处理边缘情况--跟踪已识别物体的位置、使用视频帧之间的差值、对输入数据进行额外的初步准备。另一个需要改进的主要领域是优化处理实时视频数据的方法,因为目前的大多数方法都侧重于图像。成果。作为当前研究的一项成果,我们找到了一种最佳算法,可供进一步研究和优化。结论。对现有论文和研究的分析表明,最有前途的算法有待进一步优化和实验。此外,当前的方法仍有进一步发展的空间。下一步是采用所选算法,并研究改进它的可能性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A RESEARCH OF THE LATEST APPROACHES TO VISUAL IMAGE RECOGNITION AND CLASSIFICATION
Context. The paper provides an overview of current methods for recognizing and classifying visual images in static images or video stream. The paper will discuss various approaches, including machine learning, current problems of these methods and possible improvements. The biggest challenges of the visual image retrieval and classification task are discussed. The main emphasis is placed on the review of such promising algorithms as SSD, YOLO, R-CNN, an overview of the principles of these methods, network architectures. Objective. The aim of the work is to analyze existing studies and find the best algorithm for recognizing and classifying visual images for further activities. Method. Primary method is to compare different factors of algorithms in order to select the most perspective one. There are different marks to compare, like image processing speed, accuracy. There are a number of studies and publications that propose methods and algorithms for solving the problem of finding and classifying images in an image [3–6]. It should be noted that most promising approaches are based on machine learning methods. It is worth noting that the proposed methods have drawbacks due to the imperfect implementation of the Faster R-CNN, YOLO, SSD algorithms for working with streaming video. The impact of these drawbacks can be significantly reduced by applying the following solutions: development of combined identification methods, processing of edge cases – tracking the position of identified objects, using the difference between video frames, additional preliminary preparation of input data. Another major area for improvement is the optimization of methods to work with real-time video data, as most current methods focus on images. Results. As an outcome of the current research we have found an optimal algorithm for further researches and optimizations. Conclusions. Analysis of existent papers and researches has demonstrated the most promising algorithm for further optimizations and experiments. Also current approaches still have some space for further. The next step is to take the chosen algorithm and investigate possibilities to enhance it.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信