Multimodal late fusion bag of features applied to scene detection

Brazilian Symposium on Multimedia and the Web Pub Date : 2013-11-05 DOI:10.1145/2526188.2526202

Bruno Lorenço Lopes, R. Goularte

引用次数: 0

Abstract

Recent advances in technology have increased the availability of video data, creating a strong requirement for efficient systems to manage those materials. To make efficient use of video information, first, the data has to be automatic segmented into smaller, manageable and understandable units, like scenes. This paper presents a new, multimodal video scene segmentation technique. The proposed approach is to combine Bag of Features based techniques (visual and aural) in order to explore the latent semantic obtained by them in complementary way, improving scene segmentation. The results achieved showed to be promising.

查看原文本刊更多论文

多模态后期融合包特征应用于场景检测

最近技术的进步增加了视频数据的可用性，强烈要求有效的系统来管理这些材料。为了有效地利用视频信息，首先，必须将数据自动分割成更小的、可管理和可理解的单元，如场景。本文提出了一种新的多模态视频场景分割技术。本文提出的方法是将基于Bag of Features的技术(视觉和听觉)相结合，以互补的方式挖掘它们所获得的潜在语义，从而改进场景分割。取得的结果显示是有希望的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Brazilian Symposium on Multimedia and the Web

自引率

0.00%

发文量