基于各种欧几里得平方和广义马哈拉诺比斯距离的生物距离分析算法,结合概率分层聚类分析和多维缩放技术

IF 2.1 2区 地球科学 Q1 ANTHROPOLOGY
Efthymia Nikita, Panos Nikitas
{"title":"基于各种欧几里得平方和广义马哈拉诺比斯距离的生物距离分析算法,结合概率分层聚类分析和多维缩放技术","authors":"Efthymia Nikita,&nbsp;Panos Nikitas","doi":"10.1007/s12520-024-02098-y","DOIUrl":null,"url":null,"abstract":"<div><p>Biodistance analysis identifies groups that exhibit biological affinity based on phenotypic data. This study proposes and evaluates the performance of algorithms for biodistance analysis based on various squared Euclidean and generalized Mahalanobis distances by combining them with probabilistic hierarchical cluster analysis (HCA) and multidimensional scaling (MDS). Four archaeological datasets of human dental metrics and/or non-metric traits were used. To analyze the data, we integrated our previous work on biodistances and developed algorithms that calculate various types of squared Euclidean and generalized Mahalanobis distances, estimate various parameters, apply modified MDS and HCA methods to compute all possible cluster probabilities, and provide MDS confidence ellipses and dendrograms with cluster probabilities. All algorithms are implemented in R. From the data analysis, we found that all distances studied are simulated very satisfactorily by the Monte-Carlo method, resulting in the estimation of accurate cluster probabilities. Examining the probabilities of expected cluster formation, we found that these probabilities are higher when calculated using generalized Mahalanobis distances than the corresponding Euclidean distances. Therefore, the cluster probabilities supported that the generalized Mahalanobis distances are better than the corresponding Euclidean distances in cluster analysis. From a methodological point of view, clustering information concerning population affinities should not be based on a single dendrogram but instead be extracted from the list of the most frequent clusters obtained from all simulated dendrograms.</p></div>","PeriodicalId":8214,"journal":{"name":"Archaeological and Anthropological Sciences","volume":"16 12","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Algorithms for biodistance analysis based on various squared Euclidean and generalized Mahalanobis distances combined with probabilistic hierarchical cluster analysis and multidimensional scaling\",\"authors\":\"Efthymia Nikita,&nbsp;Panos Nikitas\",\"doi\":\"10.1007/s12520-024-02098-y\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Biodistance analysis identifies groups that exhibit biological affinity based on phenotypic data. This study proposes and evaluates the performance of algorithms for biodistance analysis based on various squared Euclidean and generalized Mahalanobis distances by combining them with probabilistic hierarchical cluster analysis (HCA) and multidimensional scaling (MDS). Four archaeological datasets of human dental metrics and/or non-metric traits were used. To analyze the data, we integrated our previous work on biodistances and developed algorithms that calculate various types of squared Euclidean and generalized Mahalanobis distances, estimate various parameters, apply modified MDS and HCA methods to compute all possible cluster probabilities, and provide MDS confidence ellipses and dendrograms with cluster probabilities. All algorithms are implemented in R. From the data analysis, we found that all distances studied are simulated very satisfactorily by the Monte-Carlo method, resulting in the estimation of accurate cluster probabilities. Examining the probabilities of expected cluster formation, we found that these probabilities are higher when calculated using generalized Mahalanobis distances than the corresponding Euclidean distances. Therefore, the cluster probabilities supported that the generalized Mahalanobis distances are better than the corresponding Euclidean distances in cluster analysis. From a methodological point of view, clustering information concerning population affinities should not be based on a single dendrogram but instead be extracted from the list of the most frequent clusters obtained from all simulated dendrograms.</p></div>\",\"PeriodicalId\":8214,\"journal\":{\"name\":\"Archaeological and Anthropological Sciences\",\"volume\":\"16 12\",\"pages\":\"\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2024-11-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Archaeological and Anthropological Sciences\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s12520-024-02098-y\",\"RegionNum\":2,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ANTHROPOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Archaeological and Anthropological Sciences","FirstCategoryId":"89","ListUrlMain":"https://link.springer.com/article/10.1007/s12520-024-02098-y","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ANTHROPOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

生物距离分析可根据表型数据确定表现出生物亲缘关系的群体。本研究提出并评估了基于各种欧氏平方距离和广义马哈罗诺比距离的生物距离分析算法的性能,并将其与概率分层聚类分析(HCA)和多维尺度分析(MDS)相结合。我们使用了四个人类牙齿度量和/或非度量特征的考古数据集。为了分析这些数据,我们整合了之前在生物距离方面的研究成果,并开发了一些算法来计算各种类型的欧氏平方距离和广义马哈拉诺比距离,估计各种参数,应用改进的 MDS 和 HCA 方法来计算所有可能的聚类概率,并提供 MDS 置信椭圆和带有聚类概率的树枝图。通过数据分析,我们发现所有研究的距离都能通过蒙特卡洛方法得到非常令人满意的模拟,从而估算出准确的聚类概率。在研究预期聚类形成的概率时,我们发现使用广义马哈拉诺比距离计算的概率要高于相应的欧氏距离。因此,聚类概率证明,在聚类分析中,广义马哈拉诺比距离优于相应的欧氏距离。从方法学的角度来看,有关种群亲缘关系的聚类信息不应该基于单一的树枝图,而应该从所有模拟树枝图中获得的最频繁聚类列表中提取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Algorithms for biodistance analysis based on various squared Euclidean and generalized Mahalanobis distances combined with probabilistic hierarchical cluster analysis and multidimensional scaling

Algorithms for biodistance analysis based on various squared Euclidean and generalized Mahalanobis distances combined with probabilistic hierarchical cluster analysis and multidimensional scaling

Biodistance analysis identifies groups that exhibit biological affinity based on phenotypic data. This study proposes and evaluates the performance of algorithms for biodistance analysis based on various squared Euclidean and generalized Mahalanobis distances by combining them with probabilistic hierarchical cluster analysis (HCA) and multidimensional scaling (MDS). Four archaeological datasets of human dental metrics and/or non-metric traits were used. To analyze the data, we integrated our previous work on biodistances and developed algorithms that calculate various types of squared Euclidean and generalized Mahalanobis distances, estimate various parameters, apply modified MDS and HCA methods to compute all possible cluster probabilities, and provide MDS confidence ellipses and dendrograms with cluster probabilities. All algorithms are implemented in R. From the data analysis, we found that all distances studied are simulated very satisfactorily by the Monte-Carlo method, resulting in the estimation of accurate cluster probabilities. Examining the probabilities of expected cluster formation, we found that these probabilities are higher when calculated using generalized Mahalanobis distances than the corresponding Euclidean distances. Therefore, the cluster probabilities supported that the generalized Mahalanobis distances are better than the corresponding Euclidean distances in cluster analysis. From a methodological point of view, clustering information concerning population affinities should not be based on a single dendrogram but instead be extracted from the list of the most frequent clusters obtained from all simulated dendrograms.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Archaeological and Anthropological Sciences
Archaeological and Anthropological Sciences GEOSCIENCES, MULTIDISCIPLINARY-
CiteScore
4.80
自引率
18.20%
发文量
199
期刊介绍: Archaeological and Anthropological Sciences covers the full spectrum of natural scientific methods with an emphasis on the archaeological contexts and the questions being studied. It bridges the gap between archaeologists and natural scientists providing a forum to encourage the continued integration of scientific methodologies in archaeological research. Coverage in the journal includes: archaeology, geology/geophysical prospection, geoarchaeology, geochronology, palaeoanthropology, archaeozoology and archaeobotany, genetics and other biomolecules, material analysis and conservation science. The journal is endorsed by the German Society of Natural Scientific Archaeology and Archaeometry (GNAA), the Hellenic Society for Archaeometry (HSC), the Association of Italian Archaeometrists (AIAr) and the Society of Archaeological Sciences (SAS).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信