基于姿态估计和动作分析的危险行为识别

Proceedings of the 4th International Symposium on Signal Processing Systems Pub Date : 2022-03-25 DOI:10.1145/3532342.3532343

Xiaojun Bai, Z. Wang, Zhiying Zhang

{"title":"基于姿态估计和动作分析的危险行为识别","authors":"Xiaojun Bai, Z. Wang, Zhiying Zhang","doi":"10.1145/3532342.3532343","DOIUrl":null,"url":null,"abstract":"In order to identify dangerous behaviors such as fighting in public area, proposed a deep learning model that combines human post estimation and action analysis for behavior recognition. The model takes video frame sequence as input, first make use of HRNet as backbone network to detect human body joints and generate human pose frame sequence, and then, make use of recurrent neural network for action analysis from the pose frame sequence, thus determine whether there is dangerous behavior in this video. In the first part, optimization was made on HRNet so as to reduce its parameter and improve the accuracy of joint location. In the second part, introduced Cubic LSTM as a comprehensive model for action recognition, which analysis the motion of human joints from both temporal sequence and spatial sequence, thus achieve better inference score. Experiments show that the recognition accuracy of the proposed method can reach 92.8%. In the last part, a dangerous behavior recognition system is developed based on this model, which use a monitoring host to analysis the captured video frames from cameras, and identify dangerous behavior from them automatically, thus serve for public security tasks.","PeriodicalId":398859,"journal":{"name":"Proceedings of the 4th International Symposium on Signal Processing Systems","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dangerous Behavior Recognition Based on Pose Estimation and Action Analysis\",\"authors\":\"Xiaojun Bai, Z. Wang, Zhiying Zhang\",\"doi\":\"10.1145/3532342.3532343\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In order to identify dangerous behaviors such as fighting in public area, proposed a deep learning model that combines human post estimation and action analysis for behavior recognition. The model takes video frame sequence as input, first make use of HRNet as backbone network to detect human body joints and generate human pose frame sequence, and then, make use of recurrent neural network for action analysis from the pose frame sequence, thus determine whether there is dangerous behavior in this video. In the first part, optimization was made on HRNet so as to reduce its parameter and improve the accuracy of joint location. In the second part, introduced Cubic LSTM as a comprehensive model for action recognition, which analysis the motion of human joints from both temporal sequence and spatial sequence, thus achieve better inference score. Experiments show that the recognition accuracy of the proposed method can reach 92.8%. In the last part, a dangerous behavior recognition system is developed based on this model, which use a monitoring host to analysis the captured video frames from cameras, and identify dangerous behavior from them automatically, thus serve for public security tasks.\",\"PeriodicalId\":398859,\"journal\":{\"name\":\"Proceedings of the 4th International Symposium on Signal Processing Systems\",\"volume\":\"80 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 4th International Symposium on Signal Processing Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3532342.3532343\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Symposium on Signal Processing Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3532342.3532343","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

为了识别公共场所打斗等危险行为，提出了一种结合人类后估计和行为分析的深度学习模型，用于行为识别。该模型以视频帧序列为输入，首先利用HRNet作为骨干网络对人体关节进行检测，生成人体姿态帧序列，然后利用递归神经网络对姿态帧序列进行动作分析，从而判断该视频中是否存在危险行为。第一部分对HRNet进行了优化，减小了HRNet的参数，提高了关节定位的精度。第二部分介绍了Cubic LSTM作为动作识别的综合模型，该模型从时间序列和空间序列两方面分析人体关节的运动，从而获得更好的推理分数。实验表明，该方法的识别准确率可达92.8%。最后，在此基础上开发了一个危险行为识别系统，该系统利用监控主机对摄像机捕获的视频帧进行分析，自动识别危险行为，为公安工作服务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Dangerous Behavior Recognition Based on Pose Estimation and Action Analysis

In order to identify dangerous behaviors such as fighting in public area, proposed a deep learning model that combines human post estimation and action analysis for behavior recognition. The model takes video frame sequence as input, first make use of HRNet as backbone network to detect human body joints and generate human pose frame sequence, and then, make use of recurrent neural network for action analysis from the pose frame sequence, thus determine whether there is dangerous behavior in this video. In the first part, optimization was made on HRNet so as to reduce its parameter and improve the accuracy of joint location. In the second part, introduced Cubic LSTM as a comprehensive model for action recognition, which analysis the motion of human joints from both temporal sequence and spatial sequence, thus achieve better inference score. Experiments show that the recognition accuracy of the proposed method can reach 92.8%. In the last part, a dangerous behavior recognition system is developed based on this model, which use a monitoring host to analysis the captured video frames from cameras, and identify dangerous behavior from them automatically, thus serve for public security tasks.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 4th International Symposium on Signal Processing Systems

自引率

0.00%

发文量