{"title":"朝着与CODIS STR档案记录匹配的最小SNP集的方向发展。","authors":"Tamara Gjorgjieva, Noah A Rosenberg","doi":"10.1038/s41431-025-01941-7","DOIUrl":null,"url":null,"abstract":"<p><p>Genetic record-matching is a technique by which profiles with one set of genetic markers can be queried against databases of profiles with a different set of markers to determine if profiles containing different marker sets trace to the same individual. In forensic genetics, the potential for using genetic record-matching to test single-nucleotide polymorphism (SNP) profiles for genetic matches to short-tandem repeat (STR) profiles could enable development of backward-compatible SNP marker systems to ultimately replace existing forensic STR systems. This study aims to identify minimal SNP sets for achieving record-matching accuracies comparable to those previously observed with tens or hundreds of thousands of SNPs. Using phased SNP-STR reference data in a worldwide panel of individuals, we evaluate record-matching accuracy with SNP sets chosen by each of a variety of SNP selection strategies. When selecting SNPs randomly, ~9000 SNPs are required for achieving record-matching accuracy comparable to that seen with the full SNP set in the \"needle-in-haystack\" matching scenario, namely 99% of SNP and STR profiles correctly paired with no false-positive identifications in the median accuracy for test sets of size 626 profile pairs. Selecting SNPs based on various thresholds for their minimal minor allele frequency and physical distance to the STR, however, panels of 1800 SNPs, and as few as 900 SNPs, suffice. These results advance toward a potential minimal size for backward-compatible forensic SNP systems that proceed by genetic record-matching.</p>","PeriodicalId":12016,"journal":{"name":"European Journal of Human Genetics","volume":" ","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Toward minimal SNP sets for record-matching with CODIS STR profiles.\",\"authors\":\"Tamara Gjorgjieva, Noah A Rosenberg\",\"doi\":\"10.1038/s41431-025-01941-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Genetic record-matching is a technique by which profiles with one set of genetic markers can be queried against databases of profiles with a different set of markers to determine if profiles containing different marker sets trace to the same individual. In forensic genetics, the potential for using genetic record-matching to test single-nucleotide polymorphism (SNP) profiles for genetic matches to short-tandem repeat (STR) profiles could enable development of backward-compatible SNP marker systems to ultimately replace existing forensic STR systems. This study aims to identify minimal SNP sets for achieving record-matching accuracies comparable to those previously observed with tens or hundreds of thousands of SNPs. Using phased SNP-STR reference data in a worldwide panel of individuals, we evaluate record-matching accuracy with SNP sets chosen by each of a variety of SNP selection strategies. When selecting SNPs randomly, ~9000 SNPs are required for achieving record-matching accuracy comparable to that seen with the full SNP set in the \\\"needle-in-haystack\\\" matching scenario, namely 99% of SNP and STR profiles correctly paired with no false-positive identifications in the median accuracy for test sets of size 626 profile pairs. Selecting SNPs based on various thresholds for their minimal minor allele frequency and physical distance to the STR, however, panels of 1800 SNPs, and as few as 900 SNPs, suffice. These results advance toward a potential minimal size for backward-compatible forensic SNP systems that proceed by genetic record-matching.</p>\",\"PeriodicalId\":12016,\"journal\":{\"name\":\"European Journal of Human Genetics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-09-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Journal of Human Genetics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1038/s41431-025-01941-7\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Human Genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s41431-025-01941-7","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
Toward minimal SNP sets for record-matching with CODIS STR profiles.
Genetic record-matching is a technique by which profiles with one set of genetic markers can be queried against databases of profiles with a different set of markers to determine if profiles containing different marker sets trace to the same individual. In forensic genetics, the potential for using genetic record-matching to test single-nucleotide polymorphism (SNP) profiles for genetic matches to short-tandem repeat (STR) profiles could enable development of backward-compatible SNP marker systems to ultimately replace existing forensic STR systems. This study aims to identify minimal SNP sets for achieving record-matching accuracies comparable to those previously observed with tens or hundreds of thousands of SNPs. Using phased SNP-STR reference data in a worldwide panel of individuals, we evaluate record-matching accuracy with SNP sets chosen by each of a variety of SNP selection strategies. When selecting SNPs randomly, ~9000 SNPs are required for achieving record-matching accuracy comparable to that seen with the full SNP set in the "needle-in-haystack" matching scenario, namely 99% of SNP and STR profiles correctly paired with no false-positive identifications in the median accuracy for test sets of size 626 profile pairs. Selecting SNPs based on various thresholds for their minimal minor allele frequency and physical distance to the STR, however, panels of 1800 SNPs, and as few as 900 SNPs, suffice. These results advance toward a potential minimal size for backward-compatible forensic SNP systems that proceed by genetic record-matching.
期刊介绍:
The European Journal of Human Genetics is the official journal of the European Society of Human Genetics, publishing high-quality, original research papers, short reports and reviews in the rapidly expanding field of human genetics and genomics. It covers molecular, clinical and cytogenetics, interfacing between advanced biomedical research and the clinician, and bridging the great diversity of facilities, resources and viewpoints in the genetics community.
Key areas include:
-Monogenic and multifactorial disorders
-Development and malformation
-Hereditary cancer
-Medical Genomics
-Gene mapping and functional studies
-Genotype-phenotype correlations
-Genetic variation and genome diversity
-Statistical and computational genetics
-Bioinformatics
-Advances in diagnostics
-Therapy and prevention
-Animal models
-Genetic services
-Community genetics