{"title":"Visual Attention based Cognitive Informative Frame Extraction Method for Smart Crowd Surveillance","authors":"Elizabeth B. Varghese, S. Thampi","doi":"10.1109/21CW48944.2021.9532519","DOIUrl":null,"url":null,"abstract":"In a smart surveillance system, the amount of video data has increased exponentially due to the increase in the number of monitoring devices and IoT sensors. To make smart and real-time decisions without latency in communication from these voluminous data is a tedious task. In this context, selecting informative frames from the video is of great importance that helps to extract only the salient features for further processing without latency and bandwidth constraints. In this paper, we are proposing a fast and reliable method for selecting informative frames from video sequences based on the human cognition process of visual attention to preserve the Spatio-temporal properties of the video. The proposed method extracts the informative frames using the frame informative score calculated based on visual attention maps, superpixel segmentation, and temporal information. Since our purpose is for analyzing crowd behavior from video data in a smart environment, we take two publicly available crowd video datasets for our experiments. The results show that the proposed approach is successful in extracting relevant video frames in linear time by preserving their spatial and temporal properties. We also analyze the feasibility of the proposed method in a fog computing-based simulated IoT framework, and it has been verified that the proposed cognitive approach could efficiently address the concerns of latency and bandwidth in smart surveillance environments.","PeriodicalId":239334,"journal":{"name":"2021 IEEE Conference on Norbert Wiener in the 21st Century (21CW)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Conference on Norbert Wiener in the 21st Century (21CW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/21CW48944.2021.9532519","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In a smart surveillance system, the amount of video data has increased exponentially due to the increase in the number of monitoring devices and IoT sensors. To make smart and real-time decisions without latency in communication from these voluminous data is a tedious task. In this context, selecting informative frames from the video is of great importance that helps to extract only the salient features for further processing without latency and bandwidth constraints. In this paper, we are proposing a fast and reliable method for selecting informative frames from video sequences based on the human cognition process of visual attention to preserve the Spatio-temporal properties of the video. The proposed method extracts the informative frames using the frame informative score calculated based on visual attention maps, superpixel segmentation, and temporal information. Since our purpose is for analyzing crowd behavior from video data in a smart environment, we take two publicly available crowd video datasets for our experiments. The results show that the proposed approach is successful in extracting relevant video frames in linear time by preserving their spatial and temporal properties. We also analyze the feasibility of the proposed method in a fog computing-based simulated IoT framework, and it has been verified that the proposed cognitive approach could efficiently address the concerns of latency and bandwidth in smart surveillance environments.