{"title":"基于显著性过渡和运动目标信息的监控视频总结","authors":"M. Salehin, M. Paul","doi":"10.1109/DICTA.2015.7371311","DOIUrl":null,"url":null,"abstract":"Everyday an enormous amount of video is captured by surveillance system for various purposes around the whole world. However, this is almost impossible for human to analyze the vast majority of video data. In this paper, a video summarization method is introduced combining foreground object, motion, and visual attention cue. Foreground objects typically provide important information about video contents. Additionally, object motion is naturally more attractive to human being. Moreover, visual attention cue indicates the human's attraction label for key frame determination. Using these features, supervised classifier support vector machine (SVM) is applied to obtain the key frames from a surveillance video. Extensive experimental results show that the proposed method performs superior to the state-of-the-art method using publicly available BL-7F surveillance video dataset.","PeriodicalId":214897,"journal":{"name":"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"58 4","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Summarizing Surveillance Video by Saliency Transition and Moving Object Information\",\"authors\":\"M. Salehin, M. Paul\",\"doi\":\"10.1109/DICTA.2015.7371311\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Everyday an enormous amount of video is captured by surveillance system for various purposes around the whole world. However, this is almost impossible for human to analyze the vast majority of video data. In this paper, a video summarization method is introduced combining foreground object, motion, and visual attention cue. Foreground objects typically provide important information about video contents. Additionally, object motion is naturally more attractive to human being. Moreover, visual attention cue indicates the human's attraction label for key frame determination. Using these features, supervised classifier support vector machine (SVM) is applied to obtain the key frames from a surveillance video. Extensive experimental results show that the proposed method performs superior to the state-of-the-art method using publicly available BL-7F surveillance video dataset.\",\"PeriodicalId\":214897,\"journal\":{\"name\":\"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)\",\"volume\":\"58 4\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DICTA.2015.7371311\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DICTA.2015.7371311","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Summarizing Surveillance Video by Saliency Transition and Moving Object Information
Everyday an enormous amount of video is captured by surveillance system for various purposes around the whole world. However, this is almost impossible for human to analyze the vast majority of video data. In this paper, a video summarization method is introduced combining foreground object, motion, and visual attention cue. Foreground objects typically provide important information about video contents. Additionally, object motion is naturally more attractive to human being. Moreover, visual attention cue indicates the human's attraction label for key frame determination. Using these features, supervised classifier support vector machine (SVM) is applied to obtain the key frames from a surveillance video. Extensive experimental results show that the proposed method performs superior to the state-of-the-art method using publicly available BL-7F surveillance video dataset.