Fuzzy order-equivalence for similarity measures

NAFIPS 2008 - 2008 Annual Meeting of the North American Fuzzy Information Processing Society Pub Date : 2008-05-19 DOI:10.1109/NAFIPS.2008.4531238

M. Rifqi, Marie-Jeanne Lesot, Marcin Detyniecki

引用次数: 9

Abstract

Similarity measures constitute a central component of machine learning and retrieval systems, and the choice of an appropriate measure is of major importance. In this paper, we consider this issue from the point of view of the order induced by the measures when comparing a set of objects to a given reference, i.e. the ranking from the most similar object to the least similar one. We introduce the notion of fuzzy order- equivalence, based on degrees that quantify the extent to which the induced orders differ. We define these degrees using the generalized Kendall's rank correlation, taking into account the number of order permutations as well as their positions. We then present an automatic and hierarchical classification of usual similarity measures that makes it possible to indicate, for a given number of tolerated variations, the measures that will yield rankings without significant changes; it thus provides a guideline for set data similarity measure selection.

查看原文本刊更多论文

相似性度量的模糊顺序等价

相似性度量是机器学习和检索系统的核心组成部分，选择合适的度量非常重要。在本文中，我们从一组对象与给定参考进行比较时度量所引起的顺序的角度来考虑这个问题，即从最相似的对象到最不相似的对象的排序。我们引入了模糊阶等价的概念，基于量化诱导阶差程度的度。我们使用广义肯德尔秩相关来定义这些度，同时考虑到顺序排列的数量以及它们的位置。然后，我们提出了通常的相似性度量的自动分层分类，这使得有可能指出，对于给定数量的可容忍的变化，这些度量将产生排名，而不会发生重大变化;从而为集数据相似度度量的选择提供了指导。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

NAFIPS 2008 - 2008 Annual Meeting of the North American Fuzzy Information Processing Society

自引率

0.00%

发文量