A factor graph framework for semantic indexing and retrieval in video

2000 Proceedings Workshop on Content-based Access of Image and Video Libraries Pub Date : 2000-06-16 DOI:10.1109/IVL.2000.853836

M. Naphade, I. Kozintsev, T. Huang, K. Ramchandran

引用次数: 55

Abstract

This paper proposes a novel framework for semantic indexing and retrieval in digital video. The components of the framework are probabilistic multimedia objects (multijects) and a network of such objects (multinets). The main contribution of this paper is a novel application of a factor graph framework to model the interactions in a network of multijects (multinet) at a semantic level. Factor graphs are statistical graphical models that provide an efficient framework for exact and approximate inference via the sum-product algorithm. Incorporating the statistical interactions between the concepts using factor graphs enhances the detection probability of individual multijects and provides a unified framework for integrating multiple modalities and supports inference of unobservable concepts based on their relation with observable concepts. Our experiments reveal significant performance improvement using the inference on the factor graph models.

查看原文本刊更多论文

视频语义索引与检索的因子图框架

本文提出了一种新的数字视频语义索引与检索框架。该框架的组件是概率多媒体对象(multijects)和这种对象(multiinets)的网络。本文的主要贡献是一个新的应用因子图框架，在语义层面上对多对象网络(多网络)中的交互建模。因子图是统计图形模型，它通过和积算法为精确和近似推理提供了一个有效的框架。利用因子图将概念之间的统计交互作用结合起来，提高了单个多对象的检测概率，为整合多个模态提供了统一的框架，并支持基于不可观测概念与可观测概念之间关系的不可观测概念推理。我们的实验表明，使用因子图模型的推理可以显著提高性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2000 Proceedings Workshop on Content-based Access of Image and Video Libraries

自引率

0.00%

发文量