Stanford I2V: a news video dataset for query-by-image experiments

Proceedings of the 6th ACM Multimedia Systems Conference Pub Date : 2015-03-18 DOI:10.1145/2713168.2713197

A. Araújo, J. Chaves, David M. Chen, Roland Angst, B. Girod

引用次数: 27

Abstract

Reproducible research in the area of visual search depends on the availability of large annotated datasets. In this paper, we address the problem of querying a video database by images that might share some contents with one or more video clips. We present a new large dataset, called Stanford I2V. We have collected more than 3; 800 hours of newscast videos and annotated more than 200 ground-truth queries. In the following, the dataset is described in detail, the collection methodology is outlined and retrieval performance for a benchmark algorithm is presented. These results may serve as a baseline for future research and provide an example of the intended use of the Stanford I2V dataset. The dataset can be downloaded at http://purl.stanford.edu/zx935qw7203.

查看原文本刊更多论文

Stanford I2V:用于图像查询实验的新闻视频数据集

视觉搜索领域的可重复性研究依赖于大型注释数据集的可用性。在本文中，我们解决了通过图像查询视频数据库的问题，这些图像可能与一个或多个视频片段共享某些内容。我们提出了一个新的大型数据集，叫做Stanford I2V。我们已经收集了3个以上;800小时的新闻视频和超过200个真实问题的注释。在下面，详细描述了数据集，概述了收集方法，并介绍了基准算法的检索性能。这些结果可以作为未来研究的基线，并提供斯坦福I2V数据集预期使用的示例。该数据集可从http://purl.stanford.edu/zx935qw7203下载。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 6th ACM Multimedia Systems Conference

自引率

0.00%

发文量