Multi-Modal Video Concept Extraction Using Co-Training

2005 IEEE International Conference on Multimedia and Expo Pub Date : 2005-07-06 DOI:10.1109/ICME.2005.1521473

Rong Yan, M. Naphade

引用次数: 14

Abstract

For large scale automatic semantic video characterization, it is necessary to learn and model a large number of semantic concepts. A major obstacle to this is the insufficiency of labeled training samples. Semi-supervised learning algorithms such as co-training may help by incorporating a large amount of unlabeled data, which allows the redundant information across views to improve the learning performance. Although co-training has been successfully applied in several domains, it has not been used to detect video concepts before. In this paper, we extend co-training to the domain of video concept detection and investigate different strategies of co-training as well as their effects to the detection accuracy. We demonstrate performance based on the guideline of the TRECVID '03 semantic concept extraction task

查看原文本刊更多论文

基于协同训练的多模态视频概念提取

为了实现大规模的自动语义视频表征，需要对大量的语义概念进行学习和建模。一个主要的障碍是标记训练样本的不足。半监督学习算法(如co-training)可以通过合并大量未标记数据来提供帮助，这些数据允许跨视图的冗余信息来提高学习性能。虽然协同训练已经成功地应用于多个领域，但它还没有被用于检测视频概念。本文将协同训练扩展到视频概念检测领域，研究了不同的协同训练策略及其对检测精度的影响。我们基于TRECVID '03语义概念提取任务的指导来演示性能

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2005 IEEE International Conference on Multimedia and Expo

自引率

0.00%

发文量