一种视频摘要的EM算法，生成模型方法

Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001 Pub Date : 2001-07-07 DOI:10.1109/ICCV.2001.937645

Xavier Orriols, Xavier Binefa

{"title":"一种视频摘要的EM算法，生成模型方法","authors":"Xavier Orriols, Xavier Binefa","doi":"10.1109/ICCV.2001.937645","DOIUrl":null,"url":null,"abstract":"In this paper, we address the visual video summarization problem in a Bayesian framework in order to detect and describe the underlying temporal transformation symmetries in a video sequence. Given a set of time correlated frames, we attempt to extract a reduced number of image-like data structures which are semantically meaningful and that have the ability of representing the sequence evolution. To this end, we present a generative model which involves jointly the representation and the evolution of appearance. Applying Linear Dynamical System theory to this problem, we discuss how the temporal information is encoded yielding a manner of grouping the iconic representations of the video sequence in terms of invariance. The formulation of this problem is driven in terms of a probabilistic approach, which affords a measure of perceptual similarity taking both learned appearance and time evolution models into account.","PeriodicalId":429441,"journal":{"name":"Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"27","resultStr":"{\"title\":\"An EM algorithm for video summarization, generative model approach\",\"authors\":\"Xavier Orriols, Xavier Binefa\",\"doi\":\"10.1109/ICCV.2001.937645\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we address the visual video summarization problem in a Bayesian framework in order to detect and describe the underlying temporal transformation symmetries in a video sequence. Given a set of time correlated frames, we attempt to extract a reduced number of image-like data structures which are semantically meaningful and that have the ability of representing the sequence evolution. To this end, we present a generative model which involves jointly the representation and the evolution of appearance. Applying Linear Dynamical System theory to this problem, we discuss how the temporal information is encoded yielding a manner of grouping the iconic representations of the video sequence in terms of invariance. The formulation of this problem is driven in terms of a probabilistic approach, which affords a measure of perceptual similarity taking both learned appearance and time evolution models into account.\",\"PeriodicalId\":429441,\"journal\":{\"name\":\"Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2001-07-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"27\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCV.2001.937645\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCV.2001.937645","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 27

摘要

在本文中，我们在贝叶斯框架中解决视觉视频摘要问题，以检测和描述视频序列中潜在的时间变换对称性。给定一组时间相关的帧，我们试图提取数量较少的类图像数据结构，这些数据结构在语义上有意义，并且具有表示序列演化的能力。为此，我们提出了一种结合表象和表象演化的生成模型。将线性动力系统理论应用于这个问题，我们讨论了如何对时间信息进行编码，从而产生一种根据不变性对视频序列的标志性表示进行分组的方式。这个问题的表述是根据概率方法驱动的，它提供了一种感知相似性的度量，同时考虑了学习的外观和时间进化模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An EM algorithm for video summarization, generative model approach

In this paper, we address the visual video summarization problem in a Bayesian framework in order to detect and describe the underlying temporal transformation symmetries in a video sequence. Given a set of time correlated frames, we attempt to extract a reduced number of image-like data structures which are semantically meaningful and that have the ability of representing the sequence evolution. To this end, we present a generative model which involves jointly the representation and the evolution of appearance. Applying Linear Dynamical System theory to this problem, we discuss how the temporal information is encoded yielding a manner of grouping the iconic representations of the video sequence in terms of invariance. The formulation of this problem is driven in terms of a probabilistic approach, which affords a measure of perceptual similarity taking both learned appearance and time evolution models into account.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001

自引率

0.00%

发文量