Yuhan Hu, Xuan Dai, Haoyu Wang, Yifan Wei, Yuntao Cai, Chun Yang, Qiang Zhu, Ji Zhang
{"title":"Population substructure affects kinship testing in multi-ethnic areas of China.","authors":"Yuhan Hu, Xuan Dai, Haoyu Wang, Yifan Wei, Yuntao Cai, Chun Yang, Qiang Zhu, Ji Zhang","doi":"10.1007/s00414-025-03572-5","DOIUrl":null,"url":null,"abstract":"<p><p>The likelihood ratio (LR) is a recommended metric for assessing the strength of genetic information in relationship testing, one of the most important tasks in forensic science. LR calculation incorporate population frequencies, which is affected by population substructure. This study utilized population frequency data from 18 short tandem repeat (STR) loci across 13 Chinese populations, encompassing both majority and minority ethnic groups. Six kinship types were constructed for each population. To understand the impact of population substructure on kinship testing, LRs were calculated using various frequency data: population-specific allele frequencies, national allele frequencies, and national allele frequencies adjusted with overall national F<sub>ST</sub> or population-specific F<sub>ST</sub>. LRs were also compared using the cutoff and comparison methods. The study found that LRs calculated using national allele frequencies tend to be the largest, which could overestimate the degree of relatedness compared to population-specific allele frequencies. Fst correction decreased the LR values, resulting in more conservative outcomes and suggested more distant relationships. While the F<sub>ST</sub> correction had a minimal effect on the majority and some minority populations across different kinships, it was insufficiently conservative for more isolated minority populations when the overall national F<sub>ST</sub> was applied. In conclusion, for isolated subpopulations with F<sub>ST</sub> values above the national average, utilizing population-specific allele frequencies and applying higher F<sub>ST</sub> values (e.g. 0.03 or 0.05) leads to more accurate and conservative inferences of relatedness. In contrast, for other groups, national frequencies without F<sub>ST</sub> correction appear sufficient for relationship testing.</p>","PeriodicalId":14071,"journal":{"name":"International Journal of Legal Medicine","volume":" ","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2025-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Legal Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00414-025-03572-5","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, LEGAL","Score":null,"Total":0}
引用次数: 0
Abstract
The likelihood ratio (LR) is a recommended metric for assessing the strength of genetic information in relationship testing, one of the most important tasks in forensic science. LR calculation incorporate population frequencies, which is affected by population substructure. This study utilized population frequency data from 18 short tandem repeat (STR) loci across 13 Chinese populations, encompassing both majority and minority ethnic groups. Six kinship types were constructed for each population. To understand the impact of population substructure on kinship testing, LRs were calculated using various frequency data: population-specific allele frequencies, national allele frequencies, and national allele frequencies adjusted with overall national FST or population-specific FST. LRs were also compared using the cutoff and comparison methods. The study found that LRs calculated using national allele frequencies tend to be the largest, which could overestimate the degree of relatedness compared to population-specific allele frequencies. Fst correction decreased the LR values, resulting in more conservative outcomes and suggested more distant relationships. While the FST correction had a minimal effect on the majority and some minority populations across different kinships, it was insufficiently conservative for more isolated minority populations when the overall national FST was applied. In conclusion, for isolated subpopulations with FST values above the national average, utilizing population-specific allele frequencies and applying higher FST values (e.g. 0.03 or 0.05) leads to more accurate and conservative inferences of relatedness. In contrast, for other groups, national frequencies without FST correction appear sufficient for relationship testing.
期刊介绍:
The International Journal of Legal Medicine aims to improve the scientific resources used in the elucidation of crime and related forensic applications at a high level of evidential proof. The journal offers review articles tracing development in specific areas, with up-to-date analysis; original articles discussing significant recent research results; case reports describing interesting and exceptional examples; population data; letters to the editors; and technical notes, which appear in a section originally created for rapid publication of data in the dynamic field of DNA analysis.