{"title":"基于姿态估计和动作分析的危险行为识别","authors":"Xiaojun Bai, Z. Wang, Zhiying Zhang","doi":"10.1145/3532342.3532343","DOIUrl":null,"url":null,"abstract":"In order to identify dangerous behaviors such as fighting in public area, proposed a deep learning model that combines human post estimation and action analysis for behavior recognition. The model takes video frame sequence as input, first make use of HRNet as backbone network to detect human body joints and generate human pose frame sequence, and then, make use of recurrent neural network for action analysis from the pose frame sequence, thus determine whether there is dangerous behavior in this video. In the first part, optimization was made on HRNet so as to reduce its parameter and improve the accuracy of joint location. In the second part, introduced Cubic LSTM as a comprehensive model for action recognition, which analysis the motion of human joints from both temporal sequence and spatial sequence, thus achieve better inference score. Experiments show that the recognition accuracy of the proposed method can reach 92.8%. In the last part, a dangerous behavior recognition system is developed based on this model, which use a monitoring host to analysis the captured video frames from cameras, and identify dangerous behavior from them automatically, thus serve for public security tasks.","PeriodicalId":398859,"journal":{"name":"Proceedings of the 4th International Symposium on Signal Processing Systems","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dangerous Behavior Recognition Based on Pose Estimation and Action Analysis\",\"authors\":\"Xiaojun Bai, Z. Wang, Zhiying Zhang\",\"doi\":\"10.1145/3532342.3532343\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In order to identify dangerous behaviors such as fighting in public area, proposed a deep learning model that combines human post estimation and action analysis for behavior recognition. The model takes video frame sequence as input, first make use of HRNet as backbone network to detect human body joints and generate human pose frame sequence, and then, make use of recurrent neural network for action analysis from the pose frame sequence, thus determine whether there is dangerous behavior in this video. In the first part, optimization was made on HRNet so as to reduce its parameter and improve the accuracy of joint location. In the second part, introduced Cubic LSTM as a comprehensive model for action recognition, which analysis the motion of human joints from both temporal sequence and spatial sequence, thus achieve better inference score. Experiments show that the recognition accuracy of the proposed method can reach 92.8%. In the last part, a dangerous behavior recognition system is developed based on this model, which use a monitoring host to analysis the captured video frames from cameras, and identify dangerous behavior from them automatically, thus serve for public security tasks.\",\"PeriodicalId\":398859,\"journal\":{\"name\":\"Proceedings of the 4th International Symposium on Signal Processing Systems\",\"volume\":\"80 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 4th International Symposium on Signal Processing Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3532342.3532343\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Symposium on Signal Processing Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3532342.3532343","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Dangerous Behavior Recognition Based on Pose Estimation and Action Analysis
In order to identify dangerous behaviors such as fighting in public area, proposed a deep learning model that combines human post estimation and action analysis for behavior recognition. The model takes video frame sequence as input, first make use of HRNet as backbone network to detect human body joints and generate human pose frame sequence, and then, make use of recurrent neural network for action analysis from the pose frame sequence, thus determine whether there is dangerous behavior in this video. In the first part, optimization was made on HRNet so as to reduce its parameter and improve the accuracy of joint location. In the second part, introduced Cubic LSTM as a comprehensive model for action recognition, which analysis the motion of human joints from both temporal sequence and spatial sequence, thus achieve better inference score. Experiments show that the recognition accuracy of the proposed method can reach 92.8%. In the last part, a dangerous behavior recognition system is developed based on this model, which use a monitoring host to analysis the captured video frames from cameras, and identify dangerous behavior from them automatically, thus serve for public security tasks.