{"title":"Occlusion resolving inside public crowded scenes based on social deep learning model","authors":"A. S. Elons, Magdy Abol-Ela","doi":"10.1109/INTELCIS.2017.8260050","DOIUrl":null,"url":null,"abstract":"Past decade, the field of video analytics has been rapidly developing specially for crowd scenes. The advances in computational resources inspired researchers to build reliable video analytics systems that works real. The main root for any video analytics system is threat activity localization inside video streams. One major issue toward achieving that objective is Occlusion due to crowd intensity. In this paper, a hybrid deep learning model that exploits Convolution Neural Network (CNN) and Social Long Short-Term Memory (LSTM) for real-time video streaming analytics. The experiments were conducted on public available dataset UCY which contains 2 main scenes with 786 persons and 55 actions. The results concluded the superiority of Social LSTM over conventional LSTM and Mean Square Error (MSE) does not exceed 0.25.","PeriodicalId":321315,"journal":{"name":"2017 Eighth International Conference on Intelligent Computing and Information Systems (ICICIS)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Eighth International Conference on Intelligent Computing and Information Systems (ICICIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INTELCIS.2017.8260050","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Past decade, the field of video analytics has been rapidly developing specially for crowd scenes. The advances in computational resources inspired researchers to build reliable video analytics systems that works real. The main root for any video analytics system is threat activity localization inside video streams. One major issue toward achieving that objective is Occlusion due to crowd intensity. In this paper, a hybrid deep learning model that exploits Convolution Neural Network (CNN) and Social Long Short-Term Memory (LSTM) for real-time video streaming analytics. The experiments were conducted on public available dataset UCY which contains 2 main scenes with 786 persons and 55 actions. The results concluded the superiority of Social LSTM over conventional LSTM and Mean Square Error (MSE) does not exceed 0.25.