Video summarization by learning submodular mixtures of objectives

2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Pub Date : 2015-06-07 DOI:10.1109/CVPR.2015.7298928

Michael Gygli, H. Grabner, L. Gool

引用次数: 389

Abstract

We present a novel method for summarizing raw, casually captured videos. The objective is to create a short summary that still conveys the story. It should thus be both, interesting and representative for the input video. Previous methods often used simplified assumptions and only optimized for one of these goals. Alternatively, they used handdefined objectives that were optimized sequentially by making consecutive hard decisions. This limits their use to a particular setting. Instead, we introduce a new method that (i) uses a supervised approach in order to learn the importance of global characteristics of a summary and (ii) jointly optimizes for multiple objectives and thus creates summaries that posses multiple properties of a good summary. Experiments on two challenging and very diverse datasets demonstrate the effectiveness of our method, where we outperform or match current state-of-the-art.

查看原文本刊更多论文

视频摘要通过学习子模块混合目标

我们提出了一种新的方法来总结原始的，随意捕获的视频。目标是创建一个简短的总结，仍然传达故事。因此，对于输入视频来说，它应该既有趣又具有代表性。以前的方法通常使用简化的假设，并且只针对其中一个目标进行优化。或者，他们使用手动定义的目标，通过做出连续的艰难决定来依次优化目标。这限制了它们在特定环境中的使用。相反，我们引入了一种新方法，该方法(i)使用监督方法来学习摘要的全局特征的重要性，(ii)针对多个目标共同优化，从而创建具有良好摘要的多个属性的摘要。在两个具有挑战性和非常多样化的数据集上的实验证明了我们方法的有效性，我们的方法优于或匹配当前最先进的技术。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

自引率

0.00%

发文量