Shubo Wen, Man Chen, Qianqian Kong, Ao Huang, Ziwei Wang, Qiannan Xu, Xiling Liu
{"title":"Systematically improving the efficiency of complex kinship identification through the addition of genetic markers and reference individuals.","authors":"Shubo Wen, Man Chen, Qianqian Kong, Ao Huang, Ziwei Wang, Qiannan Xu, Xiling Liu","doi":"10.1007/s00414-025-03473-7","DOIUrl":null,"url":null,"abstract":"<p><p>There is a growing demand for complex kinship identification due to the increasing complexity of social structures. However, the system efficiency of complex kinship identification systems, particularly for identifying second-degree and higher kinship relationships, needs improvement. This study aimed to further explore the selection of introduced relatives and genetic markers for second- and third-degree kinship identification by comparing the statistical parameters, such as likelihood ratio (LR), effectiveness, and error rate, of four different detection systems (19 STRs, 39 STRs, 55 STRs, and 55 STRs + 94 SNPs) in 109 common kinship identification scenarios. The study found that increasing the number of genetic markers or introducing more reference individuals generally enhanced the system efficiency for identifying complex genetic relationships by introducing various reference individuals into three-generation family lines to identify second- and third-degree kinships. The results were validated using true pedigree samples. Ranking the system efficiencies of kinship identification systems with different combinations of reference individuals and genetic markers provided insight into reference selection priorities and the optimal number of loci required. For second-degree relatives, adding one relative to the 39 STR marker set resulted in an effectiveness of over 90% with an error rate of less than 0.0005, while adding two known relatives achieved an effectiveness of over 95% with an error rate of less than 0.0001. Certain models could completely discriminate between related and unrelated individuals when using 55 STRs. For third-degree relatives, incorporating two known relatives in the 55 STR marker set resulted in an effectiveness of over 90% with an error rate of less than 0.0001 in most scenarios. In addition, when using both 55 STRs and 94 SNPs, a total of five models used for first-cousin relationships, that incorporated two reference individuals within second-degree relatives of both subjects, could completely distinguish between related and unrelated individuals. This study not only offers theoretical insights for complex kinship identification but also provides a valuable reference that can significantly improve the practice of identifying second- and third-degree kinships.</p>","PeriodicalId":14071,"journal":{"name":"International Journal of Legal Medicine","volume":" ","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Legal Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00414-025-03473-7","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, LEGAL","Score":null,"Total":0}
引用次数: 0
Abstract
There is a growing demand for complex kinship identification due to the increasing complexity of social structures. However, the system efficiency of complex kinship identification systems, particularly for identifying second-degree and higher kinship relationships, needs improvement. This study aimed to further explore the selection of introduced relatives and genetic markers for second- and third-degree kinship identification by comparing the statistical parameters, such as likelihood ratio (LR), effectiveness, and error rate, of four different detection systems (19 STRs, 39 STRs, 55 STRs, and 55 STRs + 94 SNPs) in 109 common kinship identification scenarios. The study found that increasing the number of genetic markers or introducing more reference individuals generally enhanced the system efficiency for identifying complex genetic relationships by introducing various reference individuals into three-generation family lines to identify second- and third-degree kinships. The results were validated using true pedigree samples. Ranking the system efficiencies of kinship identification systems with different combinations of reference individuals and genetic markers provided insight into reference selection priorities and the optimal number of loci required. For second-degree relatives, adding one relative to the 39 STR marker set resulted in an effectiveness of over 90% with an error rate of less than 0.0005, while adding two known relatives achieved an effectiveness of over 95% with an error rate of less than 0.0001. Certain models could completely discriminate between related and unrelated individuals when using 55 STRs. For third-degree relatives, incorporating two known relatives in the 55 STR marker set resulted in an effectiveness of over 90% with an error rate of less than 0.0001 in most scenarios. In addition, when using both 55 STRs and 94 SNPs, a total of five models used for first-cousin relationships, that incorporated two reference individuals within second-degree relatives of both subjects, could completely distinguish between related and unrelated individuals. This study not only offers theoretical insights for complex kinship identification but also provides a valuable reference that can significantly improve the practice of identifying second- and third-degree kinships.
期刊介绍:
The International Journal of Legal Medicine aims to improve the scientific resources used in the elucidation of crime and related forensic applications at a high level of evidential proof. The journal offers review articles tracing development in specific areas, with up-to-date analysis; original articles discussing significant recent research results; case reports describing interesting and exceptional examples; population data; letters to the editors; and technical notes, which appear in a section originally created for rapid publication of data in the dynamic field of DNA analysis.