An EM algorithm for video summarization, generative model approach

Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001 Pub Date : 2001-07-07 DOI:10.1109/ICCV.2001.937645

Xavier Orriols, Xavier Binefa

引用次数: 27

Abstract

In this paper, we address the visual video summarization problem in a Bayesian framework in order to detect and describe the underlying temporal transformation symmetries in a video sequence. Given a set of time correlated frames, we attempt to extract a reduced number of image-like data structures which are semantically meaningful and that have the ability of representing the sequence evolution. To this end, we present a generative model which involves jointly the representation and the evolution of appearance. Applying Linear Dynamical System theory to this problem, we discuss how the temporal information is encoded yielding a manner of grouping the iconic representations of the video sequence in terms of invariance. The formulation of this problem is driven in terms of a probabilistic approach, which affords a measure of perceptual similarity taking both learned appearance and time evolution models into account.

查看原文本刊更多论文

一种视频摘要的EM算法，生成模型方法

在本文中，我们在贝叶斯框架中解决视觉视频摘要问题，以检测和描述视频序列中潜在的时间变换对称性。给定一组时间相关的帧，我们试图提取数量较少的类图像数据结构，这些数据结构在语义上有意义，并且具有表示序列演化的能力。为此，我们提出了一种结合表象和表象演化的生成模型。将线性动力系统理论应用于这个问题，我们讨论了如何对时间信息进行编码，从而产生一种根据不变性对视频序列的标志性表示进行分组的方式。这个问题的表述是根据概率方法驱动的，它提供了一种感知相似性的度量，同时考虑了学习的外观和时间进化模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001

自引率

0.00%

发文量