Hash-SVM: Scalable Kernel Machines for Large-Scale Visual Classification

2014 IEEE Conference on Computer Vision and Pattern Recognition Pub Date : 2014-06-23 DOI:10.1109/CVPR.2014.130

Yadong Mu, G. Hua, Wei Fan, Shih-Fu Chang

{"title":"Hash-SVM: Scalable Kernel Machines for Large-Scale Visual Classification","authors":"Yadong Mu, G. Hua, Wei Fan, Shih-Fu Chang","doi":"10.1109/CVPR.2014.130","DOIUrl":null,"url":null,"abstract":"This paper presents a novel algorithm which uses compact hash bits to greatly improve the efficiency of non-linear kernel SVM in very large scale visual classification problems. Our key idea is to represent each sample with compact hash bits, over which an inner product is defined to serve as the surrogate of the original nonlinear kernels. Then the problem of solving the nonlinear SVM can be transformed into solving a linear SVM over the hash bits. The proposed Hash-SVM enjoys dramatic storage cost reduction owing to the compact binary representation, as well as a (sub-)linear training complexity via linear SVM. As a critical component of Hash-SVM, we propose a novel hashing scheme for arbitrary non-linear kernels via random subspace projection in reproducing kernel Hilbert space. Our comprehensive analysis reveals a well behaved theoretic bound of the deviation between the proposed hashing-based kernel approximation and the original kernel function. We also derive requirements on the hash bits for achieving a satisfactory accuracy level. Several experiments on large-scale visual classification benchmarks are conducted, including one with over 1 million images. The results show that Hash-SVM greatly reduces the computational complexity (more than ten times faster in many cases) while keeping comparable accuracies.","PeriodicalId":319578,"journal":{"name":"2014 IEEE Conference on Computer Vision and Pattern Recognition","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"56","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE Conference on Computer Vision and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVPR.2014.130","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 56

Abstract

This paper presents a novel algorithm which uses compact hash bits to greatly improve the efficiency of non-linear kernel SVM in very large scale visual classification problems. Our key idea is to represent each sample with compact hash bits, over which an inner product is defined to serve as the surrogate of the original nonlinear kernels. Then the problem of solving the nonlinear SVM can be transformed into solving a linear SVM over the hash bits. The proposed Hash-SVM enjoys dramatic storage cost reduction owing to the compact binary representation, as well as a (sub-)linear training complexity via linear SVM. As a critical component of Hash-SVM, we propose a novel hashing scheme for arbitrary non-linear kernels via random subspace projection in reproducing kernel Hilbert space. Our comprehensive analysis reveals a well behaved theoretic bound of the deviation between the proposed hashing-based kernel approximation and the original kernel function. We also derive requirements on the hash bits for achieving a satisfactory accuracy level. Several experiments on large-scale visual classification benchmarks are conducted, including one with over 1 million images. The results show that Hash-SVM greatly reduces the computational complexity (more than ten times faster in many cases) while keeping comparable accuracies.

查看原文本刊更多论文

哈希支持向量机:大规模视觉分类的可扩展核机

本文提出了一种新颖的算法，利用紧凑的哈希位大大提高了非线性核支持向量机在超大规模视觉分类问题中的效率。我们的关键思想是用紧凑的哈希位表示每个样本，在其上定义一个内积作为原始非线性核的代理。然后将求解非线性支持向量机问题转化为求解哈希位上的线性支持向量机问题。所提出的哈希支持向量机由于其紧凑的二进制表示而大大降低了存储成本，并且通过线性支持向量机的(亚)线性训练复杂度。作为哈希支持向量机的关键组成部分，我们提出了一种基于随机子空间投影的任意非线性核的哈希算法。我们的综合分析揭示了所提出的基于哈希的核近似与原始核函数之间偏差的一个良好的理论边界。我们还推导了对哈希位的要求，以达到令人满意的精度水平。进行了几个大规模视觉分类基准实验，其中包括一个超过100万张图像的实验。结果表明，哈希支持向量机大大降低了计算复杂度(在许多情况下超过十倍)，同时保持了相当的准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2014 IEEE Conference on Computer Vision and Pattern Recognition

自引率

0.00%

发文量