基于H.264/AVC的增强ipmh鲁棒视觉描述符及其参数效果评价

2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA) Pub Date : 2015-11-01 DOI:10.1109/DICTA.2015.7371254

A. Rouhi

{"title":"基于H.264/AVC的增强ipmh鲁棒视觉描述符及其参数效果评价","authors":"A. Rouhi","doi":"10.1109/DICTA.2015.7371254","DOIUrl":null,"url":null,"abstract":"Intra-prediction Modes-based (IPM-based) descriptors are among robust and competitive visual descriptors for near-duplicate video similarity detection, in general and content-based copy detection (CCD), in particular. IPM-based descriptors are extracted from the compressed H.264/AVC (MPEG-4/AVC) video domain. Intra-prediction Modes (IPM) are the building blocks of the key frames (I and IDR slices) in the H.264/AVC video standard. IPM-based descriptors are generally constructed based on the probability distribution of the unified intra-prediction modes of the key frames. The current research introduce an enhanced version of IPM-Histogram (IPMH) with 10 bins, which is called enhanced-IPMH (e-IPMH). This research conducted using a subset of TRECVID/CCD (2011), dataset and TREC-EVAL-Video software to compute the performance measures. Based on the experimental evidences, the e-IPMH is an effective and inexpensive visual feature, compared to the pixel domain global descriptors. Analysing the experimental results of the e-IPMH, compared to its predecessor, IPMH shows improvement in the performance measures: Mean Reciprocal Rank (MRR) and Precision@1. However, its mean processing time, reveals it is slower compared to IPMH, due to its larger descriptor size. The current research also conducted a series of experiments to evaluate the effect of spatio-temporal parameters on IPM-based descriptors. The scope of the experiments are limited to the content-preserving visual distortions: T3, T4, T5 and T6 which are the functional scope of global visual descriptors.","PeriodicalId":214897,"journal":{"name":"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Enhanced-IPMH as a Robust Visual Descriptor from H.264/AVC and Evaluation of Parameters Effects\",\"authors\":\"A. Rouhi\",\"doi\":\"10.1109/DICTA.2015.7371254\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Intra-prediction Modes-based (IPM-based) descriptors are among robust and competitive visual descriptors for near-duplicate video similarity detection, in general and content-based copy detection (CCD), in particular. IPM-based descriptors are extracted from the compressed H.264/AVC (MPEG-4/AVC) video domain. Intra-prediction Modes (IPM) are the building blocks of the key frames (I and IDR slices) in the H.264/AVC video standard. IPM-based descriptors are generally constructed based on the probability distribution of the unified intra-prediction modes of the key frames. The current research introduce an enhanced version of IPM-Histogram (IPMH) with 10 bins, which is called enhanced-IPMH (e-IPMH). This research conducted using a subset of TRECVID/CCD (2011), dataset and TREC-EVAL-Video software to compute the performance measures. Based on the experimental evidences, the e-IPMH is an effective and inexpensive visual feature, compared to the pixel domain global descriptors. Analysing the experimental results of the e-IPMH, compared to its predecessor, IPMH shows improvement in the performance measures: Mean Reciprocal Rank (MRR) and Precision@1. However, its mean processing time, reveals it is slower compared to IPMH, due to its larger descriptor size. The current research also conducted a series of experiments to evaluate the effect of spatio-temporal parameters on IPM-based descriptors. The scope of the experiments are limited to the content-preserving visual distortions: T3, T4, T5 and T6 which are the functional scope of global visual descriptors.\",\"PeriodicalId\":214897,\"journal\":{\"name\":\"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DICTA.2015.7371254\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DICTA.2015.7371254","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

基于内预测模式(ipm)的描述符是用于近重复视频相似性检测的鲁棒性和竞争性视觉描述符之一，特别是基于内容的复制检测(CCD)。基于ipm的描述符是从压缩后的H.264/AVC (MPEG-4/AVC)视频域中提取出来的。内预测模式(IPM)是H.264/AVC视频标准中关键帧(I和IDR切片)的组成部分。基于ipm的描述符一般是基于关键帧的统一内预测模式的概率分布来构造的。目前的研究引入了一种具有10个bin的IPM-Histogram (IPMH)的增强版本，称为enhanced-IPMH (e-IPMH)。本研究使用TRECVID/CCD(2011)子集、数据集和trece - eval - video软件计算性能指标。实验证明，与像素域全局描述符相比，e-IPMH是一种有效且廉价的视觉特征。分析了e-IPMH的实验结果，与它的前身相比，IPMH在性能指标上有了改进:平均倒数秩(MRR)和Precision@1。然而，它的平均处理时间显示它比IPMH慢，因为它的描述符大小更大。本研究还进行了一系列实验，以评估时空参数对基于ipm的描述符的影响。实验范围仅限于全局视觉描述符的功能范围，即保留内容的视觉扭曲:T3, T4, T5和T6。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Enhanced-IPMH as a Robust Visual Descriptor from H.264/AVC and Evaluation of Parameters Effects

Intra-prediction Modes-based (IPM-based) descriptors are among robust and competitive visual descriptors for near-duplicate video similarity detection, in general and content-based copy detection (CCD), in particular. IPM-based descriptors are extracted from the compressed H.264/AVC (MPEG-4/AVC) video domain. Intra-prediction Modes (IPM) are the building blocks of the key frames (I and IDR slices) in the H.264/AVC video standard. IPM-based descriptors are generally constructed based on the probability distribution of the unified intra-prediction modes of the key frames. The current research introduce an enhanced version of IPM-Histogram (IPMH) with 10 bins, which is called enhanced-IPMH (e-IPMH). This research conducted using a subset of TRECVID/CCD (2011), dataset and TREC-EVAL-Video software to compute the performance measures. Based on the experimental evidences, the e-IPMH is an effective and inexpensive visual feature, compared to the pixel domain global descriptors. Analysing the experimental results of the e-IPMH, compared to its predecessor, IPMH shows improvement in the performance measures: Mean Reciprocal Rank (MRR) and Precision@1. However, its mean processing time, reveals it is slower compared to IPMH, due to its larger descriptor size. The current research also conducted a series of experiments to evaluate the effect of spatio-temporal parameters on IPM-based descriptors. The scope of the experiments are limited to the content-preserving visual distortions: T3, T4, T5 and T6 which are the functional scope of global visual descriptors.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

自引率

0.00%

发文量