{"title":"Supervised Pattern Recognition Involving Skewed Feature Densities","authors":"Alexandre Benatti, Luciano da F. Costa","doi":"arxiv-2409.01213","DOIUrl":null,"url":null,"abstract":"Pattern recognition constitutes a particularly important task underlying a\ngreat deal of scientific and technologica activities. At the same time, pattern\nrecognition involves several challenges, including the choice of features to\nrepresent the data elements, as well as possible respective transformations. In\nthe present work, the classification potential of the Euclidean distance and a\ndissimilarity index based on the coincidence similarity index are compared by\nusing the k-neighbors supervised classification method respectively to features\nresulting from several types of transformations of one- and two-dimensional\nsymmetric densities. Given two groups characterized by respective densities\nwithout or with overlap, different types of respective transformations are\nobtained and employed to quantitatively evaluate the performance of k-neighbors\nmethodologies based on the Euclidean distance an coincidence similarity index.\nMore specifically, the accuracy of classifying the intersection point between\nthe densities of two adjacent groups is taken into account for the comparison.\nSeveral interesting results are described and discussed, including the enhanced\npotential of the dissimilarity index for classifying datasets with right skewed\nfeature densities, as well as the identification that the sharpness of the\ncomparison between data elements can be independent of the respective\nsupervised classification performance.","PeriodicalId":501043,"journal":{"name":"arXiv - PHYS - Physics and Society","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - PHYS - Physics and Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.01213","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Pattern recognition constitutes a particularly important task underlying a
great deal of scientific and technologica activities. At the same time, pattern
recognition involves several challenges, including the choice of features to
represent the data elements, as well as possible respective transformations. In
the present work, the classification potential of the Euclidean distance and a
dissimilarity index based on the coincidence similarity index are compared by
using the k-neighbors supervised classification method respectively to features
resulting from several types of transformations of one- and two-dimensional
symmetric densities. Given two groups characterized by respective densities
without or with overlap, different types of respective transformations are
obtained and employed to quantitatively evaluate the performance of k-neighbors
methodologies based on the Euclidean distance an coincidence similarity index.
More specifically, the accuracy of classifying the intersection point between
the densities of two adjacent groups is taken into account for the comparison.
Several interesting results are described and discussed, including the enhanced
potential of the dissimilarity index for classifying datasets with right skewed
feature densities, as well as the identification that the sharpness of the
comparison between data elements can be independent of the respective
supervised classification performance.