基于全局描述符的视频相似性检测时空参数评估

2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA) Pub Date : 2015-11-01 DOI:10.1109/DICTA.2015.7371255

A. Rouhi

{"title":"基于全局描述符的视频相似性检测时空参数评估","authors":"A. Rouhi","doi":"10.1109/DICTA.2015.7371255","DOIUrl":null,"url":null,"abstract":"The role of partitioned colour-based global descriptors is well known in video similarity detection tasks for their inexpensive yet effective performance compared to local descriptors. They provide robust and discriminative results in content-preserving visual distortions such as strong re-encoding, pattern insertions and photometric effects. The current research evaluates the effectiveness of three spatio-temporal parameters in video similarity detection tasks. The investigated parameters are specifically colour space, frame partitioning and sampling frame rates. CRIM method (video only) is selected as the base due to its optimum performance in content-preserving visual distortions in the TRECVID/CCD (Content-based Copy Detection) 2011. An amended version of CRIM, based on normalised-average luminance is introduced to compare the results with the baseline. The performance comparison is conducted using a subset of the TRECVID/CCD 2011 dataset, affected by four types of content-preserving visual distortions: T3, T4, T5 and T6. The experimental results shows that the normalised-average luminance descriptors offer more robust and competitive performance. Although they yielded a slightly better performance at the highest sampling frame rate (all frames), compared to the baseline, they offer significantly better performance at the lower sampling frame rate. The experimental evidence also reveals that the core competency of the luminance-based descriptors is significantly improved in terms of mean processing time. This metric is generally known as a shortcoming in video processing algorithms. The effect of the number of partitions is also investigated and it has been shown that increasing the number of partitions can severely lower the efficiency of the method, without yielding a significant increase in the performance.","PeriodicalId":214897,"journal":{"name":"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"229 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Evaluating Spatio-Temporal Parameters in Video Similarity Detection by Global Descriptors\",\"authors\":\"A. Rouhi\",\"doi\":\"10.1109/DICTA.2015.7371255\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The role of partitioned colour-based global descriptors is well known in video similarity detection tasks for their inexpensive yet effective performance compared to local descriptors. They provide robust and discriminative results in content-preserving visual distortions such as strong re-encoding, pattern insertions and photometric effects. The current research evaluates the effectiveness of three spatio-temporal parameters in video similarity detection tasks. The investigated parameters are specifically colour space, frame partitioning and sampling frame rates. CRIM method (video only) is selected as the base due to its optimum performance in content-preserving visual distortions in the TRECVID/CCD (Content-based Copy Detection) 2011. An amended version of CRIM, based on normalised-average luminance is introduced to compare the results with the baseline. The performance comparison is conducted using a subset of the TRECVID/CCD 2011 dataset, affected by four types of content-preserving visual distortions: T3, T4, T5 and T6. The experimental results shows that the normalised-average luminance descriptors offer more robust and competitive performance. Although they yielded a slightly better performance at the highest sampling frame rate (all frames), compared to the baseline, they offer significantly better performance at the lower sampling frame rate. The experimental evidence also reveals that the core competency of the luminance-based descriptors is significantly improved in terms of mean processing time. This metric is generally known as a shortcoming in video processing algorithms. The effect of the number of partitions is also investigated and it has been shown that increasing the number of partitions can severely lower the efficiency of the method, without yielding a significant increase in the performance.\",\"PeriodicalId\":214897,\"journal\":{\"name\":\"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)\",\"volume\":\"229 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DICTA.2015.7371255\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DICTA.2015.7371255","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

与局部描述符相比，基于颜色的分割全局描述符在视频相似度检测任务中的作用是众所周知的，因为它们的性能便宜但有效。它们在内容保留视觉扭曲(如强重新编码、模式插入和光度效应)方面提供鲁棒性和判别性结果。本研究评估了三个时空参数在视频相似度检测任务中的有效性。研究的参数包括色彩空间、帧分割和采样帧率。在TRECVID/CCD (Content-based Copy Detection，基于内容的拷贝检测)2011中，由于CRIM方法(仅视频)在内容保留视觉失真方面的性能最佳，因此选择CRIM方法作为基础。引入了一种基于归一化平均亮度的改进版CRIM，将结果与基线进行比较。使用TRECVID/CCD 2011数据集的一个子集进行性能比较，受四种类型的保留内容的视觉失真:T3, T4, T5和T6的影响。实验结果表明，归一化平均亮度描述符具有更好的鲁棒性和竞争力。尽管它们在最高采样帧率(所有帧)下的性能略好于基线，但它们在较低采样帧率下的性能明显更好。实验结果还表明，基于亮度的描述符的核心竞争力在平均处理时间方面得到了显著提高。这个度量通常被认为是视频处理算法中的一个缺点。我们还研究了分区数量的影响，结果表明，增加分区数量会严重降低该方法的效率，而不会显著提高性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Evaluating Spatio-Temporal Parameters in Video Similarity Detection by Global Descriptors

The role of partitioned colour-based global descriptors is well known in video similarity detection tasks for their inexpensive yet effective performance compared to local descriptors. They provide robust and discriminative results in content-preserving visual distortions such as strong re-encoding, pattern insertions and photometric effects. The current research evaluates the effectiveness of three spatio-temporal parameters in video similarity detection tasks. The investigated parameters are specifically colour space, frame partitioning and sampling frame rates. CRIM method (video only) is selected as the base due to its optimum performance in content-preserving visual distortions in the TRECVID/CCD (Content-based Copy Detection) 2011. An amended version of CRIM, based on normalised-average luminance is introduced to compare the results with the baseline. The performance comparison is conducted using a subset of the TRECVID/CCD 2011 dataset, affected by four types of content-preserving visual distortions: T3, T4, T5 and T6. The experimental results shows that the normalised-average luminance descriptors offer more robust and competitive performance. Although they yielded a slightly better performance at the highest sampling frame rate (all frames), compared to the baseline, they offer significantly better performance at the lower sampling frame rate. The experimental evidence also reveals that the core competency of the luminance-based descriptors is significantly improved in terms of mean processing time. This metric is generally known as a shortcoming in video processing algorithms. The effect of the number of partitions is also investigated and it has been shown that increasing the number of partitions can severely lower the efficiency of the method, without yielding a significant increase in the performance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

自引率

0.00%

发文量