Comprehensive evaluations of individual discrimination, kinship analysis, genetic relationship exploration and biogeographic origin prediction in Chinese Dongxiang group by a 60-plex DIP panel.

IF 2.7 3区 生物学
Man Chen, Wei Cui, Xiaole Bai, Yating Fang, Hongbin Yao, Xingru Zhang, Fanzhang Lei, Bofeng Zhu
{"title":"Comprehensive evaluations of individual discrimination, kinship analysis, genetic relationship exploration and biogeographic origin prediction in Chinese Dongxiang group by a 60-plex DIP panel.","authors":"Man Chen,&nbsp;Wei Cui,&nbsp;Xiaole Bai,&nbsp;Yating Fang,&nbsp;Hongbin Yao,&nbsp;Xingru Zhang,&nbsp;Fanzhang Lei,&nbsp;Bofeng Zhu","doi":"10.1186/s41065-023-00271-2","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Dongxiang group, as an important minority, resides in Gansu province which is located at the northwest China, forensic detection system with more loci needed to be studied to improve the application efficiency of forensic case investigation in this group.</p><p><strong>Methods: </strong>A 60-plex system including 57 autosomal deletion/insertion polymorphisms (A-DIPs), 2 Y chromosome DIPs (Y-DIPs) and the sex determination locus (Amelogenin) was explored to evaluate the forensic application efficiencies of individual discrimination, kinship analysis and biogeographic origin prediction in Gansu Dongxiang group based on the 60-plex genotype results of 233 unrelated Dongxiang individuals. The 60-plex genotype results of 4582 unrelated individuals from 33 reference populations in five different continents were also collected to analyze the genetic background of Dongxiang group and its genetic relationships with other continental populations.</p><p><strong>Results: </strong>The system showed high individual discrimination power, as the cumulative power of discrimination (CPD), cumulative power of exclusion (CPE) for trio and cumulative match probability (CMP) values were 0.99999999999999999999997297, 0.999980 and 2.7029E<sup>- 24</sup>, respectively. The system could distinguish 98.12%, 93.78%, 82.18%, 62.35% and 39.32% of full sibling pairs from unrelated individual pairs, when the likelihood ratio (LR) limits were set as 1, 10, 100, 1000 and 10,000 based on the simulated family samples, respectively. Additionally, Dongxiang group had the close genetic distances with populations in East Asia, especially showed the intimate genetic relationships with Chinese Han populations, which were concluded from the genetic affinities and genetic background analyses of Dongxiang group and 33 reference populations. In terms of the effectiveness of biogeographic origin inference, different artificial intelligent algorithms possessed different efficacies. Among them, the random forest (RF) and extreme gradient boosting (XGBoost) algorithm models could accurately predict the biogeographic origins of 99.7% and 90.59% of three and five continental individuals, respectively.</p><p><strong>Conclusion: </strong>This 60-plex system had good performance for individual discrimination, kinship analysis and biogeographic origin prediction in Dongxiang group, which could be used as a powerful tool for case investigation.</p>","PeriodicalId":12862,"journal":{"name":"Hereditas","volume":null,"pages":null},"PeriodicalIF":2.7000,"publicationDate":"2023-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10052841/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Hereditas","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s41065-023-00271-2","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Dongxiang group, as an important minority, resides in Gansu province which is located at the northwest China, forensic detection system with more loci needed to be studied to improve the application efficiency of forensic case investigation in this group.

Methods: A 60-plex system including 57 autosomal deletion/insertion polymorphisms (A-DIPs), 2 Y chromosome DIPs (Y-DIPs) and the sex determination locus (Amelogenin) was explored to evaluate the forensic application efficiencies of individual discrimination, kinship analysis and biogeographic origin prediction in Gansu Dongxiang group based on the 60-plex genotype results of 233 unrelated Dongxiang individuals. The 60-plex genotype results of 4582 unrelated individuals from 33 reference populations in five different continents were also collected to analyze the genetic background of Dongxiang group and its genetic relationships with other continental populations.

Results: The system showed high individual discrimination power, as the cumulative power of discrimination (CPD), cumulative power of exclusion (CPE) for trio and cumulative match probability (CMP) values were 0.99999999999999999999997297, 0.999980 and 2.7029E- 24, respectively. The system could distinguish 98.12%, 93.78%, 82.18%, 62.35% and 39.32% of full sibling pairs from unrelated individual pairs, when the likelihood ratio (LR) limits were set as 1, 10, 100, 1000 and 10,000 based on the simulated family samples, respectively. Additionally, Dongxiang group had the close genetic distances with populations in East Asia, especially showed the intimate genetic relationships with Chinese Han populations, which were concluded from the genetic affinities and genetic background analyses of Dongxiang group and 33 reference populations. In terms of the effectiveness of biogeographic origin inference, different artificial intelligent algorithms possessed different efficacies. Among them, the random forest (RF) and extreme gradient boosting (XGBoost) algorithm models could accurately predict the biogeographic origins of 99.7% and 90.59% of three and five continental individuals, respectively.

Conclusion: This 60-plex system had good performance for individual discrimination, kinship analysis and biogeographic origin prediction in Dongxiang group, which could be used as a powerful tool for case investigation.

Abstract Image

Abstract Image

Abstract Image

60 plex DIP面板对中国东乡族个体歧视、亲缘关系分析、亲缘关系探索和生物地理起源预测的综合评价。
背景:东乡族作为中国西北省份甘肃省的重要少数民族,为提高东乡族在司法案件调查中的应用效率,需要研究具有更多位点的司法鉴定系统。方法:基于233个无亲缘关系的东乡族个体60 plex基因型分析结果,构建包含57个常染色体缺失/插入多态性(A-DIPs)、2个Y染色体缺失/插入多态性(Y-DIPs)和性别决定位点(Amelogenin)的60 plex系统,评价甘肃东乡族个体鉴别、亲缘关系分析和生物地理来源预测的法医学应用效率。收集5个大陆33个参考群体4582个无亲缘关系个体的60 plex基因型,分析东乡群体的遗传背景及其与其他大陆群体的亲缘关系。结果:系统具有较高的个体辨析能力,三人组的累积辨析能力(CPD)、累积排除能力(CPE)和累积匹配概率(CMP)值分别为0.999999999999999999999999997297、0.999980和2.7029E- 24。在模拟家族样本的似然比(LR)限值分别为1、10、100、1000和10000时,系统能区分出98.12%、93.78%、82.18%、62.35%和39.32%的全同胞对和无亲缘关系个体对。通过对东乡类群与33个参考居群的遗传亲和和遗传背景分析,发现东乡类群与东亚地区居群的遗传距离较近,特别是与中国汉族居群的遗传亲缘关系较近。在生物地理起源推理的有效性方面,不同的人工智能算法具有不同的有效性。其中,随机森林(RF)和极端梯度增强(XGBoost)算法模型对3个和5个大陆个体的生物地理起源预测准确率分别为99.7%和90.59%。结论:该系统在东乡类群个体鉴别、亲缘关系分析、生物地理来源预测等方面具有较好的效果,可作为病例调查的有力工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Hereditas
Hereditas Biochemistry, Genetics and Molecular Biology-Genetics
CiteScore
3.80
自引率
3.70%
发文量
0
期刊介绍: For almost a century, Hereditas has published original cutting-edge research and reviews. As the Official journal of the Mendelian Society of Lund, the journal welcomes research from across all areas of genetics and genomics. Topics of interest include human and medical genetics, animal and plant genetics, microbial genetics, agriculture and bioinformatics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信