Analysis of the Complementarity of Latent and Concept Spaces for Cross-Modal Video Search

Proceedings of the 19th International Conference on Content-based Multimedia Indexing Pub Date : 2022-09-14 DOI:10.1145/3549555.3549600

Varsha Devi, P. Mulhem, G. Quénot

引用次数: 0

Abstract

This paper focuses on studying the complementarity between the spaces from hybrid cross-modal state-of-the-art systems for video retrieval like [5]. We aim at investigating if these spaces really convey different features, or if they are representing the same things. We use PCA (Principal Component Analysis) to study the optimal dimensions, CCA (Canonical Correlation Analysis) to assess the similarity of the spaces, and check if such approach is in fact similar to ensemble learning. We achieve experiments on the MST-VTT corpus, and show that in fact these two spaces are indeed very similar, paving the way for new models that could enforce more dissimilar spaces.

查看原文本刊更多论文

跨模态视频搜索中潜在空间和概念空间的互补性分析

本文主要研究[5]等混合跨模态视频检索系统空间间的互补性。我们的目标是研究这些空间是否真的传达了不同的特征，或者它们是否代表了相同的东西。我们使用PCA(主成分分析)来研究最优维度，CCA(典型相关分析)来评估空间的相似性，并检查这种方法是否实际上类似于集成学习。我们在MST-VTT语料库上实现了实验，并表明事实上这两个空间确实非常相似，为可以执行更多不同空间的新模型铺平了道路。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 19th International Conference on Content-based Multimedia Indexing

自引率

0.00%

发文量