Multi-modal subspace learning with dropout regularization for cross-modal recognition and retrieval

2016 Sixth International Conference on Image Processing Theory, Tools and Applications (IPTA) Pub Date : 2016-12-01 DOI:10.1109/IPTA.2016.7821032

G. Cao, Muhammad-Adeel Waris, Alexandros Iosifidis, M. Gabbouj

引用次数: 2

Abstract

There has been a surge of efforts in cross-modal recognition and retrieval in recent multimedia research. Towards this goal, we investigate a multi-modal subspace learning algorithm together with the Dropout regularizer. Inspired by the regularization for neural networks, we propose to aritificially remove the effect of certain amount of feature bins using the probabilistic approach to prevent the linear subspace learning from over-fitting. The novel regularizer is well integrated into the multi-modal learning algorithm which maximizes the between-class scatter while minimizing the within-class scatter in the projected latent space. The new objective function can be solved efficiently as the generalized eigenvalue problem. Experimental results have shown that superior performance can be obtained in both face-sketch recognition and cross-modal retrieval applications.

查看原文本刊更多论文

基于dropout正则化的多模态子空间学习跨模态识别与检索

跨模态识别与检索是近年来多媒体研究的热点。为了实现这一目标，我们研究了一种多模态子空间学习算法和Dropout正则化器。受神经网络正则化的启发，我们提出使用概率方法人工去除一定数量的特征箱的影响，以防止线性子空间学习过度拟合。该正则化器很好地集成到多模态学习算法中，在投影潜在空间中最大化类间散点，同时最小化类内散点。新的目标函数可以作为广义特征值问题有效地求解。实验结果表明，该方法在人脸素描识别和跨模态检索中均能取得较好的效果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 Sixth International Conference on Image Processing Theory, Tools and Applications (IPTA)

自引率

0.00%

发文量