{"title":"视频序列中的语义背景估计","authors":"A. Savakis, Aadeesh Milind Shringarpure","doi":"10.1109/SPIN.2018.8474279","DOIUrl":null,"url":null,"abstract":"We present a method that estimates the scene background in videos by utilizing semantic segmentation to extract foreground objects, such as people or cars, and stitching background regions to reconstruct the background. Inspired by recent developments in deep learning, we utilize semantic segmentation based on Conditional Random Field as Recurrent Neural Networks (CRF as RNN) to detect the regions of important objects in each frame and generate a foreground-background map. We use these segmentation maps to extract the background regions from each frame and then stitch them over consecutive frames to obtain the full background for the video sequence. Our foreground/background estimation approach has potential applications in change detection, video surveillance, video compression and video privacy. We illustrate the effectiveness of our method on example videos from the Change Detection (CDNET) dataset.","PeriodicalId":184596,"journal":{"name":"2018 5th International Conference on Signal Processing and Integrated Networks (SPIN)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Semantic Background Estimation in Video Sequences\",\"authors\":\"A. Savakis, Aadeesh Milind Shringarpure\",\"doi\":\"10.1109/SPIN.2018.8474279\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present a method that estimates the scene background in videos by utilizing semantic segmentation to extract foreground objects, such as people or cars, and stitching background regions to reconstruct the background. Inspired by recent developments in deep learning, we utilize semantic segmentation based on Conditional Random Field as Recurrent Neural Networks (CRF as RNN) to detect the regions of important objects in each frame and generate a foreground-background map. We use these segmentation maps to extract the background regions from each frame and then stitch them over consecutive frames to obtain the full background for the video sequence. Our foreground/background estimation approach has potential applications in change detection, video surveillance, video compression and video privacy. We illustrate the effectiveness of our method on example videos from the Change Detection (CDNET) dataset.\",\"PeriodicalId\":184596,\"journal\":{\"name\":\"2018 5th International Conference on Signal Processing and Integrated Networks (SPIN)\",\"volume\":\"50 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 5th International Conference on Signal Processing and Integrated Networks (SPIN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SPIN.2018.8474279\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 5th International Conference on Signal Processing and Integrated Networks (SPIN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPIN.2018.8474279","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
摘要
本文提出了一种利用语义分割提取前景对象(如人或车),并拼接背景区域重建背景的视频场景背景估计方法。受深度学习最新发展的启发,我们利用基于条件随机场的语义分割作为递归神经网络(CRF as RNN)来检测每帧中重要物体的区域,并生成前景-背景图。我们使用这些分割映射从每帧中提取背景区域,然后将它们拼接到连续的帧中,以获得视频序列的完整背景。我们的前景/背景估计方法在变化检测、视频监控、视频压缩和视频隐私方面具有潜在的应用前景。我们对来自变化检测(CDNET)数据集的示例视频演示了我们的方法的有效性。
We present a method that estimates the scene background in videos by utilizing semantic segmentation to extract foreground objects, such as people or cars, and stitching background regions to reconstruct the background. Inspired by recent developments in deep learning, we utilize semantic segmentation based on Conditional Random Field as Recurrent Neural Networks (CRF as RNN) to detect the regions of important objects in each frame and generate a foreground-background map. We use these segmentation maps to extract the background regions from each frame and then stitch them over consecutive frames to obtain the full background for the video sequence. Our foreground/background estimation approach has potential applications in change detection, video surveillance, video compression and video privacy. We illustrate the effectiveness of our method on example videos from the Change Detection (CDNET) dataset.