基于接收信号强度的指纹数据库中不同距离指纹相似度指标对亲缘传播聚类性能的影响

IF 2.9 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC
Abdulmalik Shehu Yaro;Filip Maly;Karel Maly;Pavel Prazak
{"title":"基于接收信号强度的指纹数据库中不同距离指纹相似度指标对亲缘传播聚类性能的影响","authors":"Abdulmalik Shehu Yaro;Filip Maly;Karel Maly;Pavel Prazak","doi":"10.1109/OJSP.2024.3449816","DOIUrl":null,"url":null,"abstract":"The affinity propagation clustering (APC) algorithm is popular for fingerprint database clustering because it can cluster without pre-defining the number of clusters. However, the clustering performance of the APC algorithm heavily depends on the chosen fingerprint similarity metric, with distance-based metrics being the most commonly used. Despite its popularity, the APC algorithm lacks comprehensive research on how distance-based metrics affect clustering performance. This emphasizes the need for a better understanding of how these metrics influence its clustering performance, particularly in fingerprint databases. This paper investigates the impact of various distance-based fingerprint similarity metrics on the clustering performance of the APC algorithm. It identifies the best fingerprint similarity metric for optimal clustering performance for a given fingerprint database. The analysis is conducted across five experimentally generated online fingerprint databases, utilizing seven distance-based metrics: Euclidean, squared Euclidean, Manhattan, Spearman, cosine, Canberra, and Chebyshev distances. Using the silhouette score as the performance metric, the simulation results indicate that structural characteristics of the fingerprint database, such as the distribution of fingerprint vectors, play a key role in selecting the best fingerprint similarity metric. However, Euclidean and Manhattan distances are generally the preferable choices for use as fingerprint similarity metrics for the APC algorithm across most fingerprint databases, regardless of their structural characteristics. It is recommended that other factors, such as computational intensity and the presence or absence of outliers, be considered alongside the structural characteristics of the fingerprint database when choosing the appropriate fingerprint similarity metric for maximum clustering performance.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"5 ","pages":"1005-1014"},"PeriodicalIF":2.9000,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10646489","citationCount":"0","resultStr":"{\"title\":\"Impact of Varying Distance-Based Fingerprint Similarity Metrics on Affinity Propagation Clustering Performance in Received Signal Strength-Based Fingerprint Databases\",\"authors\":\"Abdulmalik Shehu Yaro;Filip Maly;Karel Maly;Pavel Prazak\",\"doi\":\"10.1109/OJSP.2024.3449816\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The affinity propagation clustering (APC) algorithm is popular for fingerprint database clustering because it can cluster without pre-defining the number of clusters. However, the clustering performance of the APC algorithm heavily depends on the chosen fingerprint similarity metric, with distance-based metrics being the most commonly used. Despite its popularity, the APC algorithm lacks comprehensive research on how distance-based metrics affect clustering performance. This emphasizes the need for a better understanding of how these metrics influence its clustering performance, particularly in fingerprint databases. This paper investigates the impact of various distance-based fingerprint similarity metrics on the clustering performance of the APC algorithm. It identifies the best fingerprint similarity metric for optimal clustering performance for a given fingerprint database. The analysis is conducted across five experimentally generated online fingerprint databases, utilizing seven distance-based metrics: Euclidean, squared Euclidean, Manhattan, Spearman, cosine, Canberra, and Chebyshev distances. Using the silhouette score as the performance metric, the simulation results indicate that structural characteristics of the fingerprint database, such as the distribution of fingerprint vectors, play a key role in selecting the best fingerprint similarity metric. However, Euclidean and Manhattan distances are generally the preferable choices for use as fingerprint similarity metrics for the APC algorithm across most fingerprint databases, regardless of their structural characteristics. It is recommended that other factors, such as computational intensity and the presence or absence of outliers, be considered alongside the structural characteristics of the fingerprint database when choosing the appropriate fingerprint similarity metric for maximum clustering performance.\",\"PeriodicalId\":73300,\"journal\":{\"name\":\"IEEE open journal of signal processing\",\"volume\":\"5 \",\"pages\":\"1005-1014\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2024-08-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10646489\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE open journal of signal processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10646489/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE open journal of signal processing","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10646489/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

亲和传播聚类(APC)算法在指纹数据库聚类中很受欢迎,因为它可以在不预先确定聚类数量的情况下进行聚类。然而,APC 算法的聚类性能在很大程度上取决于所选的指纹相似度指标,其中基于距离的指标最为常用。尽管 APC 算法很受欢迎,但对基于距离的指标如何影响聚类性能缺乏全面的研究。这就强调了更好地了解这些指标如何影响其聚类性能的必要性,尤其是在指纹数据库中。本文研究了各种基于距离的指纹相似度指标对 APC 算法聚类性能的影响。它为给定的指纹数据库确定了最佳聚类性能的最佳指纹相似度指标。分析在五个实验生成的在线指纹数据库中进行,使用了七种基于距离的指标:欧氏距离、欧氏平方距离、曼哈顿距离、斯皮尔曼距离、余弦距离、堪培拉距离和切比雪夫距离。模拟结果表明,以剪影得分作为性能指标,指纹数据库的结构特征(如指纹向量的分布)在选择最佳指纹相似度指标时起着关键作用。不过,在大多数指纹数据库中,无论其结构特征如何,欧几里得距离和曼哈顿距离通常都是 APC 算法用作指纹相似度度量的首选。建议在选择适当的指纹相似度指标以实现最高聚类性能时,除了考虑指纹数据库的结构特征外,还要考虑其他因素,如计算强度和是否存在异常值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Impact of Varying Distance-Based Fingerprint Similarity Metrics on Affinity Propagation Clustering Performance in Received Signal Strength-Based Fingerprint Databases
The affinity propagation clustering (APC) algorithm is popular for fingerprint database clustering because it can cluster without pre-defining the number of clusters. However, the clustering performance of the APC algorithm heavily depends on the chosen fingerprint similarity metric, with distance-based metrics being the most commonly used. Despite its popularity, the APC algorithm lacks comprehensive research on how distance-based metrics affect clustering performance. This emphasizes the need for a better understanding of how these metrics influence its clustering performance, particularly in fingerprint databases. This paper investigates the impact of various distance-based fingerprint similarity metrics on the clustering performance of the APC algorithm. It identifies the best fingerprint similarity metric for optimal clustering performance for a given fingerprint database. The analysis is conducted across five experimentally generated online fingerprint databases, utilizing seven distance-based metrics: Euclidean, squared Euclidean, Manhattan, Spearman, cosine, Canberra, and Chebyshev distances. Using the silhouette score as the performance metric, the simulation results indicate that structural characteristics of the fingerprint database, such as the distribution of fingerprint vectors, play a key role in selecting the best fingerprint similarity metric. However, Euclidean and Manhattan distances are generally the preferable choices for use as fingerprint similarity metrics for the APC algorithm across most fingerprint databases, regardless of their structural characteristics. It is recommended that other factors, such as computational intensity and the presence or absence of outliers, be considered alongside the structural characteristics of the fingerprint database when choosing the appropriate fingerprint similarity metric for maximum clustering performance.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
5.30
自引率
0.00%
发文量
0
审稿时长
22 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信