Research on Neural Networks Integration for Object Classification in Video Analysis Systems

2020 International Conference on Industrial Engineering, Applications and Manufacturing (ICIEAM) Pub Date : 2020-05-01 DOI:10.1109/ICIEAM48468.2020.9112011

I. Fomin, Vladislav Burin, A. Bakhshiev

{"title":"Research on Neural Networks Integration for Object Classification in Video Analysis Systems","authors":"I. Fomin, Vladislav Burin, A. Bakhshiev","doi":"10.1109/ICIEAM48468.2020.9112011","DOIUrl":null,"url":null,"abstract":"Object recognition with the help of outdoor video surveillance cameras is an important task in the context of ensuring the security at enterprises, public places and even private premises. There have long existed systems that allow detecting moving objects in the image sequence from a video surveillance system. Such a system is partially considered in this research. It detects moving objects using a background model, which has certain problems. Due to this some objects are missed or detected falsely. We propose to combine the moving objects detection results with the classification, using a deep neural network. This will allow determining whether a detected object belongs to a certain class, sorting out false detections, discarding the unnecessary ones (sometimes individual classes are unwanted), to divide detected people into the employees in the uniform and all others, etc. The authors perform a network training in the Keras developer-friendly environment that provides for quick building, changing and training of network architectures. The performance of the Keras integration into a video analysis system, using direct Python script execution techniques, is between 6 and 52 ms, while the precision is between 59.1% and 97.2% for different architectures. The integration, made by freezing a selected network architecture with weights, is selected after testing. After that, frozen architecture can be imported into video analysis using the TensorFlow interface for C++. The performance of such type of integration is between 3 and 49 ms. The precision is between 63.4% and 97.8% for different architectures.","PeriodicalId":285590,"journal":{"name":"2020 International Conference on Industrial Engineering, Applications and Manufacturing (ICIEAM)","volume":"58 3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Industrial Engineering, Applications and Manufacturing (ICIEAM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIEAM48468.2020.9112011","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Object recognition with the help of outdoor video surveillance cameras is an important task in the context of ensuring the security at enterprises, public places and even private premises. There have long existed systems that allow detecting moving objects in the image sequence from a video surveillance system. Such a system is partially considered in this research. It detects moving objects using a background model, which has certain problems. Due to this some objects are missed or detected falsely. We propose to combine the moving objects detection results with the classification, using a deep neural network. This will allow determining whether a detected object belongs to a certain class, sorting out false detections, discarding the unnecessary ones (sometimes individual classes are unwanted), to divide detected people into the employees in the uniform and all others, etc. The authors perform a network training in the Keras developer-friendly environment that provides for quick building, changing and training of network architectures. The performance of the Keras integration into a video analysis system, using direct Python script execution techniques, is between 6 and 52 ms, while the precision is between 59.1% and 97.2% for different architectures. The integration, made by freezing a selected network architecture with weights, is selected after testing. After that, frozen architecture can be imported into video analysis using the TensorFlow interface for C++. The performance of such type of integration is between 3 and 49 ms. The precision is between 63.4% and 97.8% for different architectures.

查看原文本刊更多论文

视频分析系统中目标分类的神经网络集成研究

利用户外视频监控摄像机进行物体识别，是保障企业、公共场所乃至私人场所安全的重要任务。长期以来，已有系统允许从视频监控系统的图像序列中检测移动物体。本研究部分考虑了这一系统。它使用背景模型来检测移动物体，这有一定的问题。由于这个原因，一些对象被遗漏或错误地检测到。我们提出使用深度神经网络将运动目标检测结果与分类相结合。这将允许确定检测到的对象是否属于某个类别，分类错误的检测，丢弃不必要的检测(有时单个类别是不需要的)，将检测到的人员分为穿制服的员工和所有其他人，等等。作者在Keras开发人员友好的环境中执行网络训练，该环境提供了快速构建，更改和训练网络架构。Keras集成到视频分析系统中，使用直接的Python脚本执行技术，性能在6到52毫秒之间，而不同架构的精度在59.1%到97.2%之间。通过将选定的网络体系结构与权重冻结来进行集成，并在测试后进行选择。之后，可以使用c++的TensorFlow接口将冻结架构导入到视频分析中。这种类型集成的性能在3到49毫秒之间。对于不同的体系结构，精度在63.4%到97.8%之间。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 International Conference on Industrial Engineering, Applications and Manufacturing (ICIEAM)

自引率

0.00%

发文量