使用空间编码技术的图中图复制检测

Automated Information Extraction in Media Production Pub Date : 2011-12-01 DOI:10.1145/2072552.2072559

S. Purushotham, Q. Tian, C.-C. Jay Kuo

{"title":"使用空间编码技术的图中图复制检测","authors":"S. Purushotham, Q. Tian, C.-C. Jay Kuo","doi":"10.1145/2072552.2072559","DOIUrl":null,"url":null,"abstract":"Picture-in-Picture (PiP) is a special video transformation where one or more videos is scaled and spatially embedded in a host video. PiP is a very useful service to watch two or more videos simultaneously, however it can be exploited to visually hide one video inside another video. Today's copy detection techniques can be easily fooled by PiP, which is reflected in the poor results in the yearly TRECVID competitions. Inspired by the promise of spatial coding in partial image matching, we propose a generalized spatial coding representation in which both the relative position and relative orientation is embedded in the spatial code. In this paper, we will provide novel formulation for spatial verification problem and introduce polynomial and non-polynomial algorithms to efficiently address the spatial verification problem. Our initial experiment results on TRECVID and MSRA datasets shows that our proposed spatial verification algorithms provide around 20% improvement over the classical hierarchical bag-of-words approach.","PeriodicalId":280321,"journal":{"name":"Automated Information Extraction in Media Production","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Picture-in-picture copy detection using spatial coding techniques\",\"authors\":\"S. Purushotham, Q. Tian, C.-C. Jay Kuo\",\"doi\":\"10.1145/2072552.2072559\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Picture-in-Picture (PiP) is a special video transformation where one or more videos is scaled and spatially embedded in a host video. PiP is a very useful service to watch two or more videos simultaneously, however it can be exploited to visually hide one video inside another video. Today's copy detection techniques can be easily fooled by PiP, which is reflected in the poor results in the yearly TRECVID competitions. Inspired by the promise of spatial coding in partial image matching, we propose a generalized spatial coding representation in which both the relative position and relative orientation is embedded in the spatial code. In this paper, we will provide novel formulation for spatial verification problem and introduce polynomial and non-polynomial algorithms to efficiently address the spatial verification problem. Our initial experiment results on TRECVID and MSRA datasets shows that our proposed spatial verification algorithms provide around 20% improvement over the classical hierarchical bag-of-words approach.\",\"PeriodicalId\":280321,\"journal\":{\"name\":\"Automated Information Extraction in Media Production\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Automated Information Extraction in Media Production\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2072552.2072559\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Automated Information Extraction in Media Production","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2072552.2072559","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

画中画(PiP)是一种特殊的视频转换，其中一个或多个视频被缩放并在空间上嵌入到主机视频中。PiP是一个非常有用的服务，可以同时观看两个或多个视频，但是它可以被利用来可视化地将一个视频隐藏在另一个视频中。今天的复制检测技术很容易被PiP欺骗，这反映在每年的TRECVID比赛中成绩不佳。受空间编码在部分图像匹配中的应用前景启发，我们提出了一种广义的空间编码表示，其中相对位置和相对方向都嵌入到空间编码中。在本文中，我们将为空间验证问题提供新的公式，并引入多项式和非多项式算法来有效地解决空间验证问题。我们在TRECVID和MSRA数据集上的初步实验结果表明，我们提出的空间验证算法比经典的分层词袋方法提高了约20%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Picture-in-picture copy detection using spatial coding techniques

Picture-in-Picture (PiP) is a special video transformation where one or more videos is scaled and spatially embedded in a host video. PiP is a very useful service to watch two or more videos simultaneously, however it can be exploited to visually hide one video inside another video. Today's copy detection techniques can be easily fooled by PiP, which is reflected in the poor results in the yearly TRECVID competitions. Inspired by the promise of spatial coding in partial image matching, we propose a generalized spatial coding representation in which both the relative position and relative orientation is embedded in the spatial code. In this paper, we will provide novel formulation for spatial verification problem and introduce polynomial and non-polynomial algorithms to efficiently address the spatial verification problem. Our initial experiment results on TRECVID and MSRA datasets shows that our proposed spatial verification algorithms provide around 20% improvement over the classical hierarchical bag-of-words approach.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Automated Information Extraction in Media Production

自引率

0.00%

发文量