{"title":"基于深度学习的室内流媒体视频异常活动识别","authors":"D. Kumar, Srinivasan Ramapriya Sailaja","doi":"10.23919/ITUK53220.2021.9662095","DOIUrl":null,"url":null,"abstract":"Human activity recognition has emerged as a challenging research domain for video analysis. The major issue for abnormal activity recognition in a streaming video is the presence of the large spatio-temporal data along with the constraints of communication networks affecting the quality of received data for analysis. In this paper, we propose a deep learning-based system to identify abnormal human activities using a combination of Skeleton Activity Forecasting (SAF) and a Bi-LSTM network. The generated skeleton joint points of a human subject are used for the pose estimation. The skeleton tracking and regions of interest points are estimated on a streaming video from an IP networked camera. The extracted interest points and their corresponding features are optimized and used to classify them as normal, abnormal or suspicious actions. The proposed system complies with Recommendation ITU-T H.627 “Signalling and protocols for a video surveillance system” and has been experimented and evaluated over benchmarked data sets for the recognition of human actions. The system performance attains a precision of 85.6% and an accuracy of 97.2% in recognizing different actions.","PeriodicalId":423554,"journal":{"name":"2021 ITU Kaleidoscope: Connecting Physical and Virtual Worlds (ITU K)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Abnormal Activity Recognition using Deep Learning in Streaming Video for Indoor Application\",\"authors\":\"D. Kumar, Srinivasan Ramapriya Sailaja\",\"doi\":\"10.23919/ITUK53220.2021.9662095\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Human activity recognition has emerged as a challenging research domain for video analysis. The major issue for abnormal activity recognition in a streaming video is the presence of the large spatio-temporal data along with the constraints of communication networks affecting the quality of received data for analysis. In this paper, we propose a deep learning-based system to identify abnormal human activities using a combination of Skeleton Activity Forecasting (SAF) and a Bi-LSTM network. The generated skeleton joint points of a human subject are used for the pose estimation. The skeleton tracking and regions of interest points are estimated on a streaming video from an IP networked camera. The extracted interest points and their corresponding features are optimized and used to classify them as normal, abnormal or suspicious actions. The proposed system complies with Recommendation ITU-T H.627 “Signalling and protocols for a video surveillance system” and has been experimented and evaluated over benchmarked data sets for the recognition of human actions. The system performance attains a precision of 85.6% and an accuracy of 97.2% in recognizing different actions.\",\"PeriodicalId\":423554,\"journal\":{\"name\":\"2021 ITU Kaleidoscope: Connecting Physical and Virtual Worlds (ITU K)\",\"volume\":\"66 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 ITU Kaleidoscope: Connecting Physical and Virtual Worlds (ITU K)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/ITUK53220.2021.9662095\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 ITU Kaleidoscope: Connecting Physical and Virtual Worlds (ITU K)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/ITUK53220.2021.9662095","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Abnormal Activity Recognition using Deep Learning in Streaming Video for Indoor Application
Human activity recognition has emerged as a challenging research domain for video analysis. The major issue for abnormal activity recognition in a streaming video is the presence of the large spatio-temporal data along with the constraints of communication networks affecting the quality of received data for analysis. In this paper, we propose a deep learning-based system to identify abnormal human activities using a combination of Skeleton Activity Forecasting (SAF) and a Bi-LSTM network. The generated skeleton joint points of a human subject are used for the pose estimation. The skeleton tracking and regions of interest points are estimated on a streaming video from an IP networked camera. The extracted interest points and their corresponding features are optimized and used to classify them as normal, abnormal or suspicious actions. The proposed system complies with Recommendation ITU-T H.627 “Signalling and protocols for a video surveillance system” and has been experimented and evaluated over benchmarked data sets for the recognition of human actions. The system performance attains a precision of 85.6% and an accuracy of 97.2% in recognizing different actions.