Leveraging an image folksonomy and the Signature Quadratic Form Distance for semantic-based detection of near-duplicate video clips

2011 IEEE International Conference on Multimedia and Expo Pub Date : 2011-07-11 DOI:10.1109/ICME.2011.6011937

Hyun-seok Min, J. Choi, W. D. Neve, Yong Man Ro

引用次数: 2

Abstract

Being able to detect near-duplicate video clips (NDVCs) is a prerequisite for a plethora of multimedia applications. Given the observation that content transformations tend to preserve semantic information, techniques for NDVC detection may benefit from the use of a semantic approach. This paper discusses how an image folksonomy (i.e., community-contributed images and metadata) and the Signature Quadratic Form Distance (SQFD) can be leveraged for the purpose of identifying NDVCs. Experimental results obtained for the MIRFLICKR-25000 image set and the TRECVID 2009 video set indicate that an image folksonomy and SQFD can be successfully used for detecting NDVCs. In addition, our findings show that model-free NDVC detection (i.e., NDVC detection using an image folksonomy) has a higher semantic coverage than model-based NDVC detection (i.e., NDVC detection using the VIREO-374 semantic concept models).

查看原文本刊更多论文

利用图像大众分类法和签名二次形式距离进行基于语义的近重复视频剪辑检测

能够检测近重复视频剪辑(NDVCs)是大量多媒体应用程序的先决条件。考虑到内容转换倾向于保留语义信息，NDVC检测技术可能会受益于使用语义方法。本文讨论了如何利用图像大众分类法(即社区贡献的图像和元数据)和签名二次形式距离(SQFD)来识别NDVCs。MIRFLICKR-25000图像集和TRECVID 2009视频集的实验结果表明，图像大众分类和SQFD可以成功地用于NDVCs检测。此外，我们的研究结果表明，无模型的NDVC检测(即使用图像大众分类法的NDVC检测)比基于模型的NDVC检测(即使用VIREO-374语义概念模型的NDVC检测)具有更高的语义覆盖率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2011 IEEE International Conference on Multimedia and Expo

自引率

0.00%

发文量