{"title":"闭路电视摄像机中基于深度神经网络的实时暴力活动检测","authors":"D. S, Sushant Govindraj, S. N. Omkar","doi":"10.1109/CONECCT52877.2021.9622739","DOIUrl":null,"url":null,"abstract":"In surveillance systems one of the most difficult and critical tasks is detecting violence, most video surveillance systems are faced with the challenges of false alarms and working in a real-time environment. This paper introduces a lightweight system that can be employed in real-time systems. The system proposed uses OpenPose for multi-person 2D pose estimation, YoloV3 for person detection, and a CNN to classify the violent action. OpenPose unlike other systems uses a bottom-up approach which decouples the running time from the number of people in the frame, YoloV3 deployed for person detection has a very fast processing time and the CNN used was trained on the skeleton dataset that was generated and had very good accuracy. We also propose an skeleton image dataset of three action categories, namely kicking, punching, and non-violent.","PeriodicalId":164499,"journal":{"name":"2021 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Real-time Violence Activity Detection Using Deep Neural Networks in a CCTV camera\",\"authors\":\"D. S, Sushant Govindraj, S. N. Omkar\",\"doi\":\"10.1109/CONECCT52877.2021.9622739\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In surveillance systems one of the most difficult and critical tasks is detecting violence, most video surveillance systems are faced with the challenges of false alarms and working in a real-time environment. This paper introduces a lightweight system that can be employed in real-time systems. The system proposed uses OpenPose for multi-person 2D pose estimation, YoloV3 for person detection, and a CNN to classify the violent action. OpenPose unlike other systems uses a bottom-up approach which decouples the running time from the number of people in the frame, YoloV3 deployed for person detection has a very fast processing time and the CNN used was trained on the skeleton dataset that was generated and had very good accuracy. We also propose an skeleton image dataset of three action categories, namely kicking, punching, and non-violent.\",\"PeriodicalId\":164499,\"journal\":{\"name\":\"2021 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CONECCT52877.2021.9622739\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CONECCT52877.2021.9622739","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Real-time Violence Activity Detection Using Deep Neural Networks in a CCTV camera
In surveillance systems one of the most difficult and critical tasks is detecting violence, most video surveillance systems are faced with the challenges of false alarms and working in a real-time environment. This paper introduces a lightweight system that can be employed in real-time systems. The system proposed uses OpenPose for multi-person 2D pose estimation, YoloV3 for person detection, and a CNN to classify the violent action. OpenPose unlike other systems uses a bottom-up approach which decouples the running time from the number of people in the frame, YoloV3 deployed for person detection has a very fast processing time and the CNN used was trained on the skeleton dataset that was generated and had very good accuracy. We also propose an skeleton image dataset of three action categories, namely kicking, punching, and non-violent.