YHP: Y-chromosome Haplogroup Predictor for predicting male lineages based on Y-STRs

IF 2.2 3区 医学 Q1 MEDICINE, LEGAL
Mengyuan Song , Yuxiang Zhou , Chenxi Zhao , Feng Song , Yiping Hou
{"title":"YHP: Y-chromosome Haplogroup Predictor for predicting male lineages based on Y-STRs","authors":"Mengyuan Song ,&nbsp;Yuxiang Zhou ,&nbsp;Chenxi Zhao ,&nbsp;Feng Song ,&nbsp;Yiping Hou","doi":"10.1016/j.forsciint.2024.112113","DOIUrl":null,"url":null,"abstract":"<div><p>Human Y chromosome reflects the evolutionary process of males. Male lineage tracing by Y chromosome is of great use in evolutionary, forensic, and anthropological studies. Identifying the male lineage based on the specific distribution of Y haplogroups narrows down the investigation scope, which has been used in forensic scenarios. However, existing software aids in familial searching using Y-STRs (Y-chromosome short tandem repeats) to predict Y-SNP (Y-chromosome single nucleotide polymorphism) haplogroups, they often lack resolution. In this study, we developed YHP (Y Haplogroup Predictor), a novel software offering high-resolution haplogroup inference without requiring extensive Y-SNP sequencing. Leveraging existing datasets (219 haplogroups, 4064 samples in total), YHP predicts haplogroups with 0.923 accuracy under the highest haplogroup resolution, employing a random forest algorithm. YHP, available on Github (<span>https://github.com/cissy123/YHP-Y-Haplogroup-Predictor</span><svg><path></path></svg>-), facilitates high-resolution haplogroup prediction, haplotype mismatch analysis, and haplotype similarity comparison. Notably, it demonstrates efficacy in East Asian populations, benefiting from training data from eight distinct East Asian ethnic populations. Moreover, it enables seamless integration of additional training sets, extending its utility to diverse populations.</p></div>","PeriodicalId":12341,"journal":{"name":"Forensic science international","volume":null,"pages":null},"PeriodicalIF":2.2000,"publicationDate":"2024-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Forensic science international","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0379073824001944","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, LEGAL","Score":null,"Total":0}
引用次数: 0

Abstract

Human Y chromosome reflects the evolutionary process of males. Male lineage tracing by Y chromosome is of great use in evolutionary, forensic, and anthropological studies. Identifying the male lineage based on the specific distribution of Y haplogroups narrows down the investigation scope, which has been used in forensic scenarios. However, existing software aids in familial searching using Y-STRs (Y-chromosome short tandem repeats) to predict Y-SNP (Y-chromosome single nucleotide polymorphism) haplogroups, they often lack resolution. In this study, we developed YHP (Y Haplogroup Predictor), a novel software offering high-resolution haplogroup inference without requiring extensive Y-SNP sequencing. Leveraging existing datasets (219 haplogroups, 4064 samples in total), YHP predicts haplogroups with 0.923 accuracy under the highest haplogroup resolution, employing a random forest algorithm. YHP, available on Github (https://github.com/cissy123/YHP-Y-Haplogroup-Predictor-), facilitates high-resolution haplogroup prediction, haplotype mismatch analysis, and haplotype similarity comparison. Notably, it demonstrates efficacy in East Asian populations, benefiting from training data from eight distinct East Asian ethnic populations. Moreover, it enables seamless integration of additional training sets, extending its utility to diverse populations.

YHP:根据 Y-STR 预测男性世系的 Y 染色体单倍群预测器。
人类 Y 染色体反映了男性的进化过程。通过 Y 染色体追踪男性世系在进化、法医和人类学研究中具有重要作用。根据 Y 单倍群的具体分布情况确定男性世系可缩小调查范围,这已被用于法医研究。然而,现有软件在利用 Y-STR(Y 染色体短串联重复序列)预测 Y-SNP(Y 染色体单核苷酸多态性)单倍群进行家族搜索时,往往缺乏分辨率。在这项研究中,我们开发了 YHP(Y Haplogroup Predictor),这是一款无需大量 Y-SNP 测序就能提供高分辨率单倍群推断的新型软件。利用现有数据集(219 个单倍群,共 4064 个样本),YHP 采用随机森林算法,在最高单倍群分辨率下预测单倍群的准确率为 0.923。YHP 可在 Github (https://github.com/cissy123/YHP-Y-Haplogroup-Predictor-) 上下载,它有助于高分辨率单倍群预测、单倍型错配分析和单倍型相似性比较。值得注意的是,它在东亚人群中显示出功效,从八个不同的东亚种族人群的训练数据中获益匪浅。此外,它还能无缝集成更多的训练集,将其用途扩展到不同的人群。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Forensic science international
Forensic science international 医学-医学:法
CiteScore
5.00
自引率
9.10%
发文量
285
审稿时长
49 days
期刊介绍: Forensic Science International is the flagship journal in the prestigious Forensic Science International family, publishing the most innovative, cutting-edge, and influential contributions across the forensic sciences. Fields include: forensic pathology and histochemistry, chemistry, biochemistry and toxicology, biology, serology, odontology, psychiatry, anthropology, digital forensics, the physical sciences, firearms, and document examination, as well as investigations of value to public health in its broadest sense, and the important marginal area where science and medicine interact with the law. The journal publishes: Case Reports Commentaries Letters to the Editor Original Research Papers (Regular Papers) Rapid Communications Review Articles Technical Notes.
文献相关原料
公司名称 产品信息 采购帮参考价格
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信