Mingyang Ma, Shaohui Mei, Junhui Hou, Shuai Wan, Zhiyong Wang
{"title":"基于相邻帧时间协同表示的视频摘要","authors":"Mingyang Ma, Shaohui Mei, Junhui Hou, Shuai Wan, Zhiyong Wang","doi":"10.1109/ISPACS.2017.8266466","DOIUrl":null,"url":null,"abstract":"The ever increasing volume of video content demands to develop efficient and effective video summarization (VS) techniques to manage the video data. Recent developments on sparse representation have demonstrated prospective results for VS. In this paper, in consideration of visual similarity of adjacent frames, we formulate the video summarization problem with a temporal collaborative representation (TCR) model, in which the adjacent frames instead of an individual frame are taken into consideration to avoid selecting transitional frames. In addition, a greedy iterative algorithm is designed for model optimization. Experimental results on a benchmark dataset with various types of videos demonstrate that the proposed algorithms can not only outperform the state of the art, but also reduce the probability of selecting transitional frames.","PeriodicalId":166414,"journal":{"name":"2017 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Video summarization via temporal collaborative representation of adjacent frames\",\"authors\":\"Mingyang Ma, Shaohui Mei, Junhui Hou, Shuai Wan, Zhiyong Wang\",\"doi\":\"10.1109/ISPACS.2017.8266466\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The ever increasing volume of video content demands to develop efficient and effective video summarization (VS) techniques to manage the video data. Recent developments on sparse representation have demonstrated prospective results for VS. In this paper, in consideration of visual similarity of adjacent frames, we formulate the video summarization problem with a temporal collaborative representation (TCR) model, in which the adjacent frames instead of an individual frame are taken into consideration to avoid selecting transitional frames. In addition, a greedy iterative algorithm is designed for model optimization. Experimental results on a benchmark dataset with various types of videos demonstrate that the proposed algorithms can not only outperform the state of the art, but also reduce the probability of selecting transitional frames.\",\"PeriodicalId\":166414,\"journal\":{\"name\":\"2017 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISPACS.2017.8266466\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPACS.2017.8266466","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Video summarization via temporal collaborative representation of adjacent frames
The ever increasing volume of video content demands to develop efficient and effective video summarization (VS) techniques to manage the video data. Recent developments on sparse representation have demonstrated prospective results for VS. In this paper, in consideration of visual similarity of adjacent frames, we formulate the video summarization problem with a temporal collaborative representation (TCR) model, in which the adjacent frames instead of an individual frame are taken into consideration to avoid selecting transitional frames. In addition, a greedy iterative algorithm is designed for model optimization. Experimental results on a benchmark dataset with various types of videos demonstrate that the proposed algorithms can not only outperform the state of the art, but also reduce the probability of selecting transitional frames.