On model-based clustering of video scenes using scenelets

Proceedings. IEEE International Conference on Multimedia and Expo Pub Date : 2002-11-07 DOI:10.1109/ICME.2002.1035778

Hong Lu, Yap-Peng Tan

引用次数: 5

Abstract

We propose in this paper a model-based approach to clustering video scenes based on scenelets. We define a video scenelet as a short consecutive sample of frames of a video sequence. The approach makes use of an unsupervised method to represent scenelets of a video with a concise Gaussian mixture model and cluster them into different video scenes according to their visual similarities. In particular the expectation-maximization algorithm is employed to estimate the unknown model parameters, and Bayesian information criterion is used to determine the optimal number and model of scene clusters in a principled manner. This approach is fundamentally different from many existing video clustering methods, as it does not require explicit knowledge of shot boundaries. Instead, the shot boundaries can also be obtained as a by-product of the scene clustering process. The proposed methods have been tested with various types of sports videos and promising results are reported in this paper.

查看原文本刊更多论文

基于模型的视频场景聚类研究

本文提出了一种基于场景集的视频场景聚类方法。我们将视频场景定义为视频序列中短的连续帧样本。该方法利用一种无监督的方法，用简洁的高斯混合模型来表示视频场景，并根据视觉相似性将其聚类到不同的视频场景中。其中，利用期望最大化算法对未知模型参数进行估计，并利用贝叶斯信息准则有原则地确定场景聚类的最优数量和模型。这种方法从根本上不同于许多现有的视频聚类方法，因为它不需要明确的镜头边界知识。相反，镜头边界也可以作为场景聚类过程的副产品得到。本文对所提出的方法进行了各种类型的运动视频的测试，并报告了令人满意的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings. IEEE International Conference on Multimedia and Expo

自引率

0.00%

发文量