从长骨中进行个性化性别估计的可解释机器学习。

IF 2.3 3区医学 Q1 MEDICINE, LEGAL

International Journal of Legal Medicine Pub Date : 2025-10-25 DOI:10.1007/s00414-025-03635-7

Siam Knecht, Paolo Morandini, Lucie Biehler-Gomez, Yann Ardagna, Marie Perrin, Cristina Cattaneo, Christophe Roman, Pascal Adalian

{"title":"从长骨中进行个性化性别估计的可解释机器学习。","authors":"Siam Knecht, Paolo Morandini, Lucie Biehler-Gomez, Yann Ardagna, Marie Perrin, Cristina Cattaneo, Christophe Roman, Pascal Adalian","doi":"10.1007/s00414-025-03635-7","DOIUrl":null,"url":null,"abstract":"Sex estimation is an essential task in forensic anthropology. It is not only crucial for the identification of individuals from skeletal remains, but it is also essential for improving the reliability of other methods of biological profile estimation, such as age and stature, some of which perform better when sex is taken into account. This study investigates the application of machine learning (ML) techniques to sex estimation, with a particular focus on interpretability to address the \"black box\" challenge inherent in AI models. Using a diverse dataset of long bone measurements from 2,969 individuals, 12 different ML algorithms were evaluated. Missing data were handled using iterative regression imputation, though challenges arising from incomplete datasets underscored the need for improved data handling strategies. Linear Discriminant Analysis (LDA) emerged as the most accurate approach, achieving 95.2% accuracy. A key feature of this study is the integration of SHapley Additive exPlanations (SHAP) values, which provide individualized insights into the factors influencing each prediction. This interpretability framework ensures transparency and addresses legal and scientific concerns about the admissibility of AI-generated evidence in court. Indeed, misclassifications possibilities highlight the importance of clear, understandable models in forensic applications. The study emphasizes the significance of individualized prediction, illustrated by the probability of male or female classification for each individual, as well as the impact of missing values on prediction accuracy. This research demonstrates that ML models can effectively balance accuracy with interpretability, offering personalized, actionable insights for forensic investigations. It paves the way for AI-driven methods that meet both scientific rigor and legal standards, transforming sex estimation in forensic science by providing individualized, defensible evidence suitable for court.","PeriodicalId":14071,"journal":{"name":"International Journal of Legal Medicine","volume":" ","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2025-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Interpretable machine learning for individualized sex estimation from long bones.\",\"authors\":\"Siam Knecht, Paolo Morandini, Lucie Biehler-Gomez, Yann Ardagna, Marie Perrin, Cristina Cattaneo, Christophe Roman, Pascal Adalian\",\"doi\":\"10.1007/s00414-025-03635-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sex estimation is an essential task in forensic anthropology. It is not only crucial for the identification of individuals from skeletal remains, but it is also essential for improving the reliability of other methods of biological profile estimation, such as age and stature, some of which perform better when sex is taken into account. This study investigates the application of machine learning (ML) techniques to sex estimation, with a particular focus on interpretability to address the \\\"black box\\\" challenge inherent in AI models. Using a diverse dataset of long bone measurements from 2,969 individuals, 12 different ML algorithms were evaluated. Missing data were handled using iterative regression imputation, though challenges arising from incomplete datasets underscored the need for improved data handling strategies. Linear Discriminant Analysis (LDA) emerged as the most accurate approach, achieving 95.2% accuracy. A key feature of this study is the integration of SHapley Additive exPlanations (SHAP) values, which provide individualized insights into the factors influencing each prediction. This interpretability framework ensures transparency and addresses legal and scientific concerns about the admissibility of AI-generated evidence in court. Indeed, misclassifications possibilities highlight the importance of clear, understandable models in forensic applications. The study emphasizes the significance of individualized prediction, illustrated by the probability of male or female classification for each individual, as well as the impact of missing values on prediction accuracy. This research demonstrates that ML models can effectively balance accuracy with interpretability, offering personalized, actionable insights for forensic investigations. It paves the way for AI-driven methods that meet both scientific rigor and legal standards, transforming sex estimation in forensic science by providing individualized, defensible evidence suitable for court.\",\"PeriodicalId\":14071,\"journal\":{\"name\":\"International Journal of Legal Medicine\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-10-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Legal Medicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s00414-025-03635-7\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MEDICINE, LEGAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Legal Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00414-025-03635-7","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, LEGAL","Score":null,"Total":0}

引用次数: 0

摘要

性别估计是法医人类学的一项重要工作。这不仅对于从骨骼遗骸中识别个体至关重要，而且对于提高其他生物轮廓估计方法的可靠性也至关重要，例如年龄和身材，其中一些方法在考虑到性别时表现得更好。本研究探讨了机器学习（ML）技术在性别估计中的应用，特别关注可解释性，以解决人工智能模型固有的“黑箱”挑战。使用来自2969个人的长骨测量的不同数据集，评估了12种不同的ML算法。使用迭代回归归算处理缺失数据，但不完整数据集带来的挑战强调需要改进数据处理策略。线性判别分析（LDA）是最准确的方法，准确率达到95.2%。本研究的一个关键特征是SHapley加性解释（SHAP）值的整合，它为影响每种预测的因素提供了个性化的见解。这一可解释性框架确保了透明度，并解决了法院对人工智能生成证据可采性的法律和科学关切。事实上，错误分类的可能性突出了清晰、可理解的模型在法医应用中的重要性。该研究强调了个性化预测的重要性，通过每个个体的男性或女性分类的概率以及缺失值对预测准确性的影响来说明。这项研究表明，机器学习模型可以有效地平衡准确性和可解释性，为法医调查提供个性化的、可操作的见解。它为既符合科学严谨性又符合法律标准的人工智能驱动方法铺平了道路，通过提供适合法庭的个性化、可辩护的证据，改变了法医学中的性别估计。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Interpretable machine learning for individualized sex estimation from long bones.

Sex estimation is an essential task in forensic anthropology. It is not only crucial for the identification of individuals from skeletal remains, but it is also essential for improving the reliability of other methods of biological profile estimation, such as age and stature, some of which perform better when sex is taken into account. This study investigates the application of machine learning (ML) techniques to sex estimation, with a particular focus on interpretability to address the "black box" challenge inherent in AI models. Using a diverse dataset of long bone measurements from 2,969 individuals, 12 different ML algorithms were evaluated. Missing data were handled using iterative regression imputation, though challenges arising from incomplete datasets underscored the need for improved data handling strategies. Linear Discriminant Analysis (LDA) emerged as the most accurate approach, achieving 95.2% accuracy. A key feature of this study is the integration of SHapley Additive exPlanations (SHAP) values, which provide individualized insights into the factors influencing each prediction. This interpretability framework ensures transparency and addresses legal and scientific concerns about the admissibility of AI-generated evidence in court. Indeed, misclassifications possibilities highlight the importance of clear, understandable models in forensic applications. The study emphasizes the significance of individualized prediction, illustrated by the probability of male or female classification for each individual, as well as the impact of missing values on prediction accuracy. This research demonstrates that ML models can effectively balance accuracy with interpretability, offering personalized, actionable insights for forensic investigations. It paves the way for AI-driven methods that meet both scientific rigor and legal standards, transforming sex estimation in forensic science by providing individualized, defensible evidence suitable for court.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Legal Medicine 医学-医学：法

CiteScore

5.80

自引率

9.50%

发文量

165

审稿时长

1 months

期刊介绍： The International Journal of Legal Medicine aims to improve the scientific resources used in the elucidation of crime and related forensic applications at a high level of evidential proof. The journal offers review articles tracing development in specific areas, with up-to-date analysis; original articles discussing significant recent research results; case reports describing interesting and exceptional examples; population data; letters to the editors; and technical notes, which appear in a section originally created for rapid publication of data in the dynamic field of DNA analysis.