Content-Based Video Relevance Prediction with Second-Order Relevance and Attention Modeling

Proceedings of the 26th ACM international conference on Multimedia Pub Date : 2018-10-15 DOI:10.1145/3240508.3266434

Xusong Chen, Rui Zhao, Shengjie Ma, Dong Liu, Zhengjun Zha

引用次数: 6

Abstract

This paper describes our proposed method for the Content-Based Video Relevance Prediction (CBVRP) challenge. Our method is based on deep learning, i.e. we train a deep network to predict the relevance between two video sequences from their features. We explore the usage of second-order relevance, both in preparing training data, and in extending the deep network. Second-order relevance refers to e.g. the relevance between x and z if x is relevant to y and y is relevant to z. In our proposed method, we use second-order relevance to increase positive samples and decrease negative samples, when preparing training data. We further extend the deep network with an attention module, where the attention mechanism is designed for second-order relevant video sequences. We verify the effectiveness of our method on the validation set of the CBVRP challenge.

查看原文本刊更多论文

通过二阶相关性和注意力建模进行基于内容的视频相关性预测

本文介绍了我们针对基于内容的视频相关性预测（CBVRP）挑战提出的方法。我们的方法基于深度学习，即我们训练一个深度网络，根据两个视频序列的特征来预测它们之间的相关性。我们在准备训练数据和扩展深度网络时都探索了二阶相关性的用法。在我们提出的方法中，我们在准备训练数据时使用二阶相关性来增加正样本和减少负样本。我们还利用注意力模块进一步扩展了深度网络，其中的注意力机制是针对二阶相关视频序列设计的。我们在 CBVRP 挑战赛的验证集上验证了我们方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 26th ACM international conference on Multimedia

自引率

0.00%

发文量