Similarity Learning for Nearest Neighbor Classification

2008 Eighth IEEE International Conference on Data Mining Pub Date : 2008-12-15 DOI:10.1109/ICDM.2008.81

A. M. Qamar, Éric Gaussier, J. Chevallet, Joo-Hwee Lim

引用次数: 63

Abstract

In this paper, we propose an algorithm for learning a general class of similarity measures for kNN classification. This class encompasses, among others, the standard cosine measure, as well as the Dice and Jaccard coefficients. The algorithm we propose is an extension of the voted perceptron algorithm and allows one to learn different types of similarity functions (either based on diagonal, symmetric or asymmetric similarity matrices). The results we obtained show that learning similarity measures yields significant improvements on several collections, for two prediction rules: the standard kNN rule, which was our primary goal, and a symmetric version of it.

查看原文本刊更多论文

基于相似性学习的最近邻分类

在本文中，我们提出了一种用于学习kNN分类的一般类相似度量的算法。这个类包含了标准余弦测量，以及骰子和Jaccard系数。我们提出的算法是投票感知器算法的扩展，允许人们学习不同类型的相似函数(基于对角、对称或非对称相似矩阵)。我们获得的结果表明，对于两个预测规则:标准kNN规则(这是我们的主要目标)和它的对称版本，学习相似度量在几个集合上产生了显著的改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2008 Eighth IEEE International Conference on Data Mining

自引率

0.00%

发文量