A new robust AI/ML based model for accurate forensic age estimation using DNA methylation markers.

IF 1.5 4区 医学 Q2 MEDICINE, LEGAL
Jinsu Ann Mathew, Geetha Paul, Joe Jacob, Janesh Kumar, Neelima Dubey, Ninan Sajeeth Philip
{"title":"A new robust AI/ML based model for accurate forensic age estimation using DNA methylation markers.","authors":"Jinsu Ann Mathew, Geetha Paul, Joe Jacob, Janesh Kumar, Neelima Dubey, Ninan Sajeeth Philip","doi":"10.1007/s12024-025-00985-x","DOIUrl":null,"url":null,"abstract":"<p><p>CpG sites are regions of DNA where a cytosine nucleotide is followed by a guanine nucleotide in the 5' → 3' direction. Epigenetic markers based on methylation values at CpG sites are valuable for accurate age prediction and have become essential in forensic science, supporting criminal investigations and human identification. The present study identified 12 CpG sites from a collection of 476,366 CpG sites based on the following criteria: (a) CpG sites were retained if the Pearson correlation coefficient between the methylation values and the chronological age of the individual is greater than 0.85, and (b) if the mutual correlation coefficient between a pair of selected CpG sites is greater than 0.15, only one of them is retained. The identified CpG sites are associated with genes FHL2, ELOVL2, TRIM59, PCDHB1, KLF14, C1orf132, ACSS3, and CCDC102B. To ensure that the predictive accuracy is intrinsic to the selected CpG sites and not model dependent, the identified CpG sites were passed to three different Neural network models. All models achieved comparable accuracy across diverse populations, genders, and health conditions. The model's accuracy and reliability were validated through age predictions on independent datasets. By utilizing a minimal set of CpG sites, this approach offers a robust and efficient solution for forensic age estimation, significantly enhancing the precision and reliability of forensic investigations.</p>","PeriodicalId":12449,"journal":{"name":"Forensic Science, Medicine and Pathology","volume":" ","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Forensic Science, Medicine and Pathology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s12024-025-00985-x","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, LEGAL","Score":null,"Total":0}
引用次数: 0

Abstract

CpG sites are regions of DNA where a cytosine nucleotide is followed by a guanine nucleotide in the 5' → 3' direction. Epigenetic markers based on methylation values at CpG sites are valuable for accurate age prediction and have become essential in forensic science, supporting criminal investigations and human identification. The present study identified 12 CpG sites from a collection of 476,366 CpG sites based on the following criteria: (a) CpG sites were retained if the Pearson correlation coefficient between the methylation values and the chronological age of the individual is greater than 0.85, and (b) if the mutual correlation coefficient between a pair of selected CpG sites is greater than 0.15, only one of them is retained. The identified CpG sites are associated with genes FHL2, ELOVL2, TRIM59, PCDHB1, KLF14, C1orf132, ACSS3, and CCDC102B. To ensure that the predictive accuracy is intrinsic to the selected CpG sites and not model dependent, the identified CpG sites were passed to three different Neural network models. All models achieved comparable accuracy across diverse populations, genders, and health conditions. The model's accuracy and reliability were validated through age predictions on independent datasets. By utilizing a minimal set of CpG sites, this approach offers a robust and efficient solution for forensic age estimation, significantly enhancing the precision and reliability of forensic investigations.

使用DNA甲基化标记准确估计法医年龄的新的鲁棒AI/ML模型。
CpG位点是DNA的一个区域,其中一个胞嘧啶核苷酸在5‘→3’方向上紧随一个鸟嘌呤核苷酸。基于CpG位点甲基化值的表观遗传标记对于准确的年龄预测是有价值的,并且在法医学、支持刑事调查和人类身份识别方面已经成为必不可少的。本研究根据以下标准从476,366个CpG位点中鉴定出12个CpG位点:(a)如果甲基化值与个体实年年龄之间的Pearson相关系数大于0.85,则CpG位点被保留;(b)如果一对选择的CpG位点之间的相互相关系数大于0.15,则仅保留其中一个。所鉴定的CpG位点与基因FHL2、ELOVL2、TRIM59、PCDHB1、KLF14、C1orf132、ACSS3和CCDC102B相关。为了确保预测精度与所选择的CpG位点有关,而不依赖于模型,将识别的CpG位点传递给三种不同的神经网络模型。所有模型在不同人群、性别和健康状况下都达到了相当的准确性。通过独立数据集的年龄预测,验证了模型的准确性和可靠性。通过利用最小的CpG站点集,该方法为法医年龄估计提供了一个强大而有效的解决方案,显着提高了法医调查的准确性和可靠性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Forensic Science, Medicine and Pathology
Forensic Science, Medicine and Pathology MEDICINE, LEGAL-PATHOLOGY
CiteScore
3.90
自引率
5.60%
发文量
114
审稿时长
6-12 weeks
期刊介绍: Forensic Science, Medicine and Pathology encompasses all aspects of modern day forensics, equally applying to children or adults, either living or the deceased. This includes forensic science, medicine, nursing, and pathology, as well as toxicology, human identification, mass disasters/mass war graves, profiling, imaging, policing, wound assessment, sexual assault, anthropology, archeology, forensic search, entomology, botany, biology, veterinary pathology, and DNA. Forensic Science, Medicine, and Pathology presents a balance of forensic research and reviews from around the world to reflect modern advances through peer-reviewed papers, short communications, meeting proceedings and case reports.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信