Sex estimation using long bones in the largest burial site of the Copper Age: Linear discriminant analysis and random forest

IF 1.5 2区 历史学 N/A ARCHAEOLOGY
{"title":"Sex estimation using long bones in the largest burial site of the Copper Age: Linear discriminant analysis and random forest","authors":"","doi":"10.1016/j.jasrep.2024.104730","DOIUrl":null,"url":null,"abstract":"<div><p>Sex estimation of the individuals in a sample is fundamental for any bioarchaeological study to define a particular demographic assemblage or to classify isolated remains. Long bones are an excellent alternative for sex estimation when the most dimorphic anatomical parts are not preserved or are highly altered. Here we propose a set of discriminant functions and classification models to estimate the sex of prehistoric individuals using linear discriminant analysis and machine learning approaches. Different osteometric variables were taken from the humeri, ulnae, radii, femurs and tibias of a sample of 109 articulated skeletons buried in the collective tomb of Camino del Molino (Region of Murcia, SE-Spain), dated to the 3rd millennium BC. Sex was estimated based on standard anthropological methods and ancient DNA analysis of a control sample. Fifty-two discriminant functions with prediction thresholds higher than 0.8 on the ROC curve were obtained using independent (22) and combined variables (30). The best LDA models for sex prediction were those based on proximal epiphyseal widths or their combination with other variables, reaching values close to 0.98 on the ROC curve. The random forest-based model obtained an accuracy of 0.94 and confirmed the importance of epiphyseal widths in sex classification. This analysis is more comprehensive than univariate LDA, as it allows for ranking the importance of bones in sex discrimination and considers correlations between long bones rather than treating them as independent observations. In contrast, applying LDA to each bone makes it easier to predict the sex of other coeval collections that do not have such a complete sample. This work aims to overcome the scarcity of methods that can be applied to sex estimation of the large volume of isolated remains from Camino del Molino and for other Mediterranean skeletal series from the Late Prehistory with high biological affinity and that share similar environmental conditions.</p></div>","PeriodicalId":48150,"journal":{"name":"Journal of Archaeological Science-Reports","volume":null,"pages":null},"PeriodicalIF":1.5000,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2352409X24003584/pdfft?md5=5ca7fb69d2e542dbe4efa2198e0f7ea2&pid=1-s2.0-S2352409X24003584-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Archaeological Science-Reports","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352409X24003584","RegionNum":2,"RegionCategory":"历史学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"N/A","JCRName":"ARCHAEOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Sex estimation of the individuals in a sample is fundamental for any bioarchaeological study to define a particular demographic assemblage or to classify isolated remains. Long bones are an excellent alternative for sex estimation when the most dimorphic anatomical parts are not preserved or are highly altered. Here we propose a set of discriminant functions and classification models to estimate the sex of prehistoric individuals using linear discriminant analysis and machine learning approaches. Different osteometric variables were taken from the humeri, ulnae, radii, femurs and tibias of a sample of 109 articulated skeletons buried in the collective tomb of Camino del Molino (Region of Murcia, SE-Spain), dated to the 3rd millennium BC. Sex was estimated based on standard anthropological methods and ancient DNA analysis of a control sample. Fifty-two discriminant functions with prediction thresholds higher than 0.8 on the ROC curve were obtained using independent (22) and combined variables (30). The best LDA models for sex prediction were those based on proximal epiphyseal widths or their combination with other variables, reaching values close to 0.98 on the ROC curve. The random forest-based model obtained an accuracy of 0.94 and confirmed the importance of epiphyseal widths in sex classification. This analysis is more comprehensive than univariate LDA, as it allows for ranking the importance of bones in sex discrimination and considers correlations between long bones rather than treating them as independent observations. In contrast, applying LDA to each bone makes it easier to predict the sex of other coeval collections that do not have such a complete sample. This work aims to overcome the scarcity of methods that can be applied to sex estimation of the large volume of isolated remains from Camino del Molino and for other Mediterranean skeletal series from the Late Prehistory with high biological affinity and that share similar environmental conditions.

利用铜器时代最大墓葬遗址中的长骨进行性别估计:线性判别分析和随机森林
对样本中的个体进行性别估计是任何生物考古学研究的基础,以确定特定的人口组合或对孤立的遗骸进行分类。如果最二态的解剖部位没有保存下来或者发生了很大的变化,长骨是进行性别估计的绝佳选择。在这里,我们提出了一套判别函数和分类模型,利用线性判别分析和机器学习方法来估计史前个体的性别。我们从埋葬在卡米诺德尔莫利诺(西班牙东南部穆尔西亚地区)集体墓葬中的 109 具有关节的骸骨样本(年代为公元前三千年)的肱骨、尺骨、桡骨、股骨和胫骨中提取了不同的骨测量变量。性别是根据标准人类学方法和对照样本的古 DNA 分析进行估计的。利用独立变量(22 个)和组合变量(30 个),获得了 52 个在 ROC 曲线上预测阈值高于 0.8 的判别函数。性别预测效果最好的 LDA 模型是基于近端骺端宽度或与其他变量组合的模型,其 ROC 曲线值接近 0.98。基于随机森林的模型准确率为 0.94,证实了骺端宽度在性别分类中的重要性。这种分析比单变量 LDA 更为全面,因为它允许对骨骼在性别鉴别中的重要性进行排序,并考虑了长骨之间的相关性,而不是将它们视为独立的观察对象。相比之下,对每块骨头应用 LDA 可以更容易地预测没有如此完整样本的其他同时期采集物的性别。这项工作的目的是克服可用于对来自卡米诺-德尔-莫利诺的大量孤立遗骸进行性别估计的方法稀缺的问题,以及对具有高度生物亲缘关系和类似环境条件的史前晚期其他地中海骨骼系列进行性别估计的问题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.10
自引率
12.50%
发文量
405
期刊介绍: Journal of Archaeological Science: Reports is aimed at archaeologists and scientists engaged with the application of scientific techniques and methodologies to all areas of archaeology. The journal focuses on the results of the application of scientific methods to archaeological problems and debates. It will provide a forum for reviews and scientific debate of issues in scientific archaeology and their impact in the wider subject. Journal of Archaeological Science: Reports will publish papers of excellent archaeological science, with regional or wider interest. This will include case studies, reviews and short papers where an established scientific technique sheds light on archaeological questions and debates.
文献相关原料
公司名称 产品信息 采购帮参考价格
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信