Edge Artificial Intelligence: A Multi-Camera Video Surveillance Application

Volume 7: 17th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications (MESA) Pub Date : 2021-08-17 DOI:10.1115/detc2021-70738

Daniele Berardini, A. Mancini, P. Zingaretti, S. Moccia

{"title":"Edge Artificial Intelligence: A Multi-Camera Video Surveillance Application","authors":"Daniele Berardini, A. Mancini, P. Zingaretti, S. Moccia","doi":"10.1115/detc2021-70738","DOIUrl":null,"url":null,"abstract":"\n Nowadays, video surveillance has a crucial role. Analyzing surveillance videos is, however, a time consuming and tiresome procedure. In the last years, artificial intelligence paved the way for automatic and accurate surveillance-video analysis. In parallel to the development of artificial-intelligence methodologies, edge computing is becoming an active field of research with the final goal to provide cost-effective and real time deployment of the developed methodologies. In this work, we present an edge artificial intelligence application to video surveillance. Our approach relies on a set of four IP cameras, which acquire video frames that are processed on the edge using the NVIDIA® Jetson Nano. A state-of-the-art deep-learning model, called Single Shot multibox Detector (SSD) MobileNetV2 network, is used to perform object and people detection in real-time. The proposed infrastructure obtained an inference speed of ∼10.0 Frames per Second (FPS) for each parallel video stream. These results prompt the possibility of translating our work into a real word scenario. The integration of the presented application into a wider monitoring system with a central unit could bring benefits to the overall infrastructure. Indeed our application could send only video-related high-level information to the central unit, allowing it to combine information with data coming from other sensing devices without unuseful data overload. This would ensure a fast response in case of emergency or detected anomalies. We hope this work will contribute to stimulate the research in the field of edge artificial intelligence for video surveillance.","PeriodicalId":221388,"journal":{"name":"Volume 7: 17th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications (MESA)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Volume 7: 17th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications (MESA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1115/detc2021-70738","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Nowadays, video surveillance has a crucial role. Analyzing surveillance videos is, however, a time consuming and tiresome procedure. In the last years, artificial intelligence paved the way for automatic and accurate surveillance-video analysis. In parallel to the development of artificial-intelligence methodologies, edge computing is becoming an active field of research with the final goal to provide cost-effective and real time deployment of the developed methodologies. In this work, we present an edge artificial intelligence application to video surveillance. Our approach relies on a set of four IP cameras, which acquire video frames that are processed on the edge using the NVIDIA® Jetson Nano. A state-of-the-art deep-learning model, called Single Shot multibox Detector (SSD) MobileNetV2 network, is used to perform object and people detection in real-time. The proposed infrastructure obtained an inference speed of ∼10.0 Frames per Second (FPS) for each parallel video stream. These results prompt the possibility of translating our work into a real word scenario. The integration of the presented application into a wider monitoring system with a central unit could bring benefits to the overall infrastructure. Indeed our application could send only video-related high-level information to the central unit, allowing it to combine information with data coming from other sensing devices without unuseful data overload. This would ensure a fast response in case of emergency or detected anomalies. We hope this work will contribute to stimulate the research in the field of edge artificial intelligence for video surveillance.

查看原文本刊更多论文

边缘人工智能:多摄像头视频监控应用

如今，视频监控扮演着至关重要的角色。然而，分析监控视频是一个耗时且令人厌烦的过程。在过去的几年里，人工智能为自动和准确的监控视频分析铺平了道路。与人工智能方法的发展并行，边缘计算正在成为一个活跃的研究领域，其最终目标是为开发的方法提供成本效益和实时部署。在这项工作中，我们提出了一种边缘人工智能在视频监控中的应用。我们的方法依赖于一组四个IP摄像机，它们获取视频帧，并使用NVIDIA®Jetson Nano在边缘进行处理。最先进的深度学习模型，称为单镜头多盒检测器(SSD) MobileNetV2网络，用于实时检测物体和人员。所提出的基础设施为每个并行视频流获得了每秒10.0帧(FPS)的推理速度。这些结果提示了将我们的工作转化为真实世界场景的可能性。将所介绍的应用程序集成到具有中央单元的更广泛的监视系统中可以为整个基础设施带来好处。实际上，我们的应用程序只能向中央单元发送与视频相关的高级信息，允许它将信息与来自其他传感设备的数据结合起来，而不会产生无用的数据过载。这将确保在发生紧急情况或发现异常情况时迅速作出反应。我们希望这项工作将有助于促进视频监控边缘人工智能领域的研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Volume 7: 17th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications (MESA)

自引率

0.00%

发文量