{"title":"Fuzzy order-equivalence for similarity measures","authors":"M. Rifqi, Marie-Jeanne Lesot, Marcin Detyniecki","doi":"10.1109/NAFIPS.2008.4531238","DOIUrl":null,"url":null,"abstract":"Similarity measures constitute a central component of machine learning and retrieval systems, and the choice of an appropriate measure is of major importance. In this paper, we consider this issue from the point of view of the order induced by the measures when comparing a set of objects to a given reference, i.e. the ranking from the most similar object to the least similar one. We introduce the notion of fuzzy order- equivalence, based on degrees that quantify the extent to which the induced orders differ. We define these degrees using the generalized Kendall's rank correlation, taking into account the number of order permutations as well as their positions. We then present an automatic and hierarchical classification of usual similarity measures that makes it possible to indicate, for a given number of tolerated variations, the measures that will yield rankings without significant changes; it thus provides a guideline for set data similarity measure selection.","PeriodicalId":430770,"journal":{"name":"NAFIPS 2008 - 2008 Annual Meeting of the North American Fuzzy Information Processing Society","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NAFIPS 2008 - 2008 Annual Meeting of the North American Fuzzy Information Processing Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NAFIPS.2008.4531238","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
Similarity measures constitute a central component of machine learning and retrieval systems, and the choice of an appropriate measure is of major importance. In this paper, we consider this issue from the point of view of the order induced by the measures when comparing a set of objects to a given reference, i.e. the ranking from the most similar object to the least similar one. We introduce the notion of fuzzy order- equivalence, based on degrees that quantify the extent to which the induced orders differ. We define these degrees using the generalized Kendall's rank correlation, taking into account the number of order permutations as well as their positions. We then present an automatic and hierarchical classification of usual similarity measures that makes it possible to indicate, for a given number of tolerated variations, the measures that will yield rankings without significant changes; it thus provides a guideline for set data similarity measure selection.