{"title":"Genomic estimates of Identity-By-Descent relationships in large scale data sets.","authors":"Theo Meuwisen,Xijiang Yu,Peer Berg","doi":"10.1186/s12711-026-01038-9","DOIUrl":null,"url":null,"abstract":"BACKGROUND\r\nGenomic relationship and inbreeding estimates are either based on genetic drift (e.g. the Genomic Relationship Matrix; GRM), homozygosity (e.g. Runs of Homozygosity; ROH), or Identity-By-Descent (IBD). A genomic IBD-based relationship matrix, Gla, is obtained by linkage analysis which uses genomic data to distinguish paternal versus maternal inheritances of chromosomal segments to replace the 50/50 probabilities used to calculate pedigree-based relationships (A matrix). Our aim was to develop a fast approximate algorithm, FGla, to estimate the Gla matrix in large complex pedigrees making use of dense marker genotypes, and to compare Gla to A, GRM and ROH based inbreeding (FROH) in simulated and a large scale Norwegian Red Cattle (NRF) data set.\r\n\r\nRESULTS\r\nGiven pedigree data and ≥ 3 generations of 45 k marker genotypes, marker positions were detected that unambiguously identified maternal/paternal inheritance, and inheritances at intermediate positions were imputed by the Viterbi algorithm from the positions with known inheritance. Any remaining unknown inheritances were randomly sampled (paternal or maternal), and the sampling errors that this introduced were averaged out by the large number of marker loci used (correlation between replicated estimates: 0.9998). Also, calculations were limited to the relationship coefficients that were actually needed, assuming that relationships for a limited set of candidates were needed. The accuracy of estimated Gla coefficients increased from 0.971 to 0.998, when genotyping increased from the actually genotyped NRF cattle towards all pedigreed animals. The accuracy of the GRM was 0.936, but required only genotyping of the animals whose relationships were needed. Gla relationships were approximately unbiased in the Best Linear Unbiased Prediction (BLUP) sense. Hence, if Gla based inbreeding management predicts an increase in relationships then an identical increase in true IBD relationships is expected. Gla uses the same base population as A, namely that of the pedigree.\r\n\r\nCONCLUSIONS\r\nAn approximate computationally efficient multipoint linkage analysis algorithm was developed to estimate unbiased IBD-based relationship and inbreeding coefficients. Its unbiasedness and precise definition of the base population makes it well suited for the genomic management of inbreeding and genomic optimal contribution selection. In addition, Gla based optimal contribution selection is neutral with respect to allele frequency changes.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"9 1","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2026-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genetics Selection Evolution","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12711-026-01038-9","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, DAIRY & ANIMAL SCIENCE","Score":null,"Total":0}
引用次数: 0
Abstract
BACKGROUND
Genomic relationship and inbreeding estimates are either based on genetic drift (e.g. the Genomic Relationship Matrix; GRM), homozygosity (e.g. Runs of Homozygosity; ROH), or Identity-By-Descent (IBD). A genomic IBD-based relationship matrix, Gla, is obtained by linkage analysis which uses genomic data to distinguish paternal versus maternal inheritances of chromosomal segments to replace the 50/50 probabilities used to calculate pedigree-based relationships (A matrix). Our aim was to develop a fast approximate algorithm, FGla, to estimate the Gla matrix in large complex pedigrees making use of dense marker genotypes, and to compare Gla to A, GRM and ROH based inbreeding (FROH) in simulated and a large scale Norwegian Red Cattle (NRF) data set.
RESULTS
Given pedigree data and ≥ 3 generations of 45 k marker genotypes, marker positions were detected that unambiguously identified maternal/paternal inheritance, and inheritances at intermediate positions were imputed by the Viterbi algorithm from the positions with known inheritance. Any remaining unknown inheritances were randomly sampled (paternal or maternal), and the sampling errors that this introduced were averaged out by the large number of marker loci used (correlation between replicated estimates: 0.9998). Also, calculations were limited to the relationship coefficients that were actually needed, assuming that relationships for a limited set of candidates were needed. The accuracy of estimated Gla coefficients increased from 0.971 to 0.998, when genotyping increased from the actually genotyped NRF cattle towards all pedigreed animals. The accuracy of the GRM was 0.936, but required only genotyping of the animals whose relationships were needed. Gla relationships were approximately unbiased in the Best Linear Unbiased Prediction (BLUP) sense. Hence, if Gla based inbreeding management predicts an increase in relationships then an identical increase in true IBD relationships is expected. Gla uses the same base population as A, namely that of the pedigree.
CONCLUSIONS
An approximate computationally efficient multipoint linkage analysis algorithm was developed to estimate unbiased IBD-based relationship and inbreeding coefficients. Its unbiasedness and precise definition of the base population makes it well suited for the genomic management of inbreeding and genomic optimal contribution selection. In addition, Gla based optimal contribution selection is neutral with respect to allele frequency changes.
期刊介绍:
Genetics Selection Evolution invites basic, applied and methodological content that will aid the current understanding and the utilization of genetic variability in domestic animal species. Although the focus is on domestic animal species, research on other species is invited if it contributes to the understanding of the use of genetic variability in domestic animals. Genetics Selection Evolution publishes results from all levels of study, from the gene to the quantitative trait, from the individual to the population, the breed or the species. Contributions concerning both the biological approach, from molecular genetics to quantitative genetics, as well as the mathematical approach, from population genetics to statistics, are welcome. Specific areas of interest include but are not limited to: gene and QTL identification, mapping and characterization, analysis of new phenotypes, high-throughput SNP data analysis, functional genomics, cytogenetics, genetic diversity of populations and breeds, genetic evaluation, applied and experimental selection, genomic selection, selection efficiency, and statistical methodology for the genetic analysis of phenotypes with quantitative and mixed inheritance.