Tesfaye K Belay, Arne B Gjuvsland, Janez Jenko, Leiv S Eikje, Morten Svendsen, Theo Meuwissen
{"title":"Single-Step Genomic BLUP With Unknown Parent Groups and Metafounders in Norwegian Red Evaluations.","authors":"Tesfaye K Belay, Arne B Gjuvsland, Janez Jenko, Leiv S Eikje, Morten Svendsen, Theo Meuwissen","doi":"10.1111/jbg.12939","DOIUrl":null,"url":null,"abstract":"<p><p>The objective of this study was to examine the effects of different methods for handling missing pedigree data on biases, stability, relative increase in accuracy, and genetic trends using national data from Norwegian Red (NRF) cattle. The dataset comprised 8,402,773 milk yield records from 3,896,116 NRF cows, a pedigree with 4,957,544 animals, and a genomic dataset from 170,293 animals with 121,741 SNPs. Missing parents were modelled using three approaches: unknown parent groups (UPG), metafounders (MF), and \"Q-Q<sup>+</sup>\" methods. The UPG method is routinely used for genetic evaluations of NRF cattle by including 52 fixed UPG in the pedigree. In the MF method, two MF were defined: MF14 and MF52, with MF treated as random effects. The MF14 included 6 MF defined by birth year intervals for NRF breed and 8 MF defined by breed origins for other breeds. The MF52 classification included all the 52 UPG as MF considering relationships among them. The \"Q-Q<sup>+</sup>\" approach corrects for the combined effects of UPG and \"J factor\" in non-genotyped animals while avoiding such corrections in genotyped animals. The three approaches, combined with different G matrices (G<sub>rtn</sub> matrix constructed with a 0.5 allele frequency (AF) and 10% weight (w) on A, G<sub>05</sub> constructed using AF = 0.5 and w = 0.0, and G<sub>cal</sub> constructed with observed AF and w = 0.0), led to eight ssGBLUP models being tested. This included one UPG model (using G<sub>rtn</sub>), four MF models (MF14 and MF52 using G<sub>rtn</sub> or G<sub>05</sub>), and three Q-Q+ models (using G<sub>cal</sub>, G<sub>05</sub>, or G<sub>rtn</sub>). The models were evaluated through cross-validation by masking the phenotypes of 5000 genotyped young cows. Results showed that the Q-Q<sup>+</sup> models using the G<sub>cal</sub> or G<sub>05</sub> matrix had significantly (p < 0.05) lower level biases and higher genetic trends than all other models. MF models with 14 or 52 groups using G<sub>05</sub> were second best for level bias and performed similarly or slightly better than Q-Q+ models regarding inflation bias and stability. Increasing the number of MF from 14 to 52 had minimal effects on biases but significantly improved stability and genetic trend estimates. Models with G<sub>rtn</sub> had slightly higher gain in accuracy from adding phenotypic data (2.01%) than G<sub>05</sub> (1.18%), but pedigree-based models showed the highest improvement in accuracy due to adding phenotypic (26%) or genomic (47%) data to the partial dataset. Overall, all models with G<sub>05</sub> showed the least bias (with a small standard error) and most stable predictions, while models using G<sub>rtn</sub> introduced biases and instability. Thus, the Q-Q<sup>+</sup> and MF models combined with G<sub>05</sub> and Q-Q<sup>+</sup> with G<sub>cal</sub> are recommended for their improved validation results and genetic trends.</p>","PeriodicalId":54885,"journal":{"name":"Journal of Animal Breeding and Genetics","volume":" ","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Animal Breeding and Genetics","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1111/jbg.12939","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AGRICULTURE, DAIRY & ANIMAL SCIENCE","Score":null,"Total":0}
Single-Step Genomic BLUP With Unknown Parent Groups and Metafounders in Norwegian Red Evaluations.
The objective of this study was to examine the effects of different methods for handling missing pedigree data on biases, stability, relative increase in accuracy, and genetic trends using national data from Norwegian Red (NRF) cattle. The dataset comprised 8,402,773 milk yield records from 3,896,116 NRF cows, a pedigree with 4,957,544 animals, and a genomic dataset from 170,293 animals with 121,741 SNPs. Missing parents were modelled using three approaches: unknown parent groups (UPG), metafounders (MF), and "Q-Q+" methods. The UPG method is routinely used for genetic evaluations of NRF cattle by including 52 fixed UPG in the pedigree. In the MF method, two MF were defined: MF14 and MF52, with MF treated as random effects. The MF14 included 6 MF defined by birth year intervals for NRF breed and 8 MF defined by breed origins for other breeds. The MF52 classification included all the 52 UPG as MF considering relationships among them. The "Q-Q+" approach corrects for the combined effects of UPG and "J factor" in non-genotyped animals while avoiding such corrections in genotyped animals. The three approaches, combined with different G matrices (Grtn matrix constructed with a 0.5 allele frequency (AF) and 10% weight (w) on A, G05 constructed using AF = 0.5 and w = 0.0, and Gcal constructed with observed AF and w = 0.0), led to eight ssGBLUP models being tested. This included one UPG model (using Grtn), four MF models (MF14 and MF52 using Grtn or G05), and three Q-Q+ models (using Gcal, G05, or Grtn). The models were evaluated through cross-validation by masking the phenotypes of 5000 genotyped young cows. Results showed that the Q-Q+ models using the Gcal or G05 matrix had significantly (p < 0.05) lower level biases and higher genetic trends than all other models. MF models with 14 or 52 groups using G05 were second best for level bias and performed similarly or slightly better than Q-Q+ models regarding inflation bias and stability. Increasing the number of MF from 14 to 52 had minimal effects on biases but significantly improved stability and genetic trend estimates. Models with Grtn had slightly higher gain in accuracy from adding phenotypic data (2.01%) than G05 (1.18%), but pedigree-based models showed the highest improvement in accuracy due to adding phenotypic (26%) or genomic (47%) data to the partial dataset. Overall, all models with G05 showed the least bias (with a small standard error) and most stable predictions, while models using Grtn introduced biases and instability. Thus, the Q-Q+ and MF models combined with G05 and Q-Q+ with Gcal are recommended for their improved validation results and genetic trends.
期刊介绍:
The Journal of Animal Breeding and Genetics publishes original articles by international scientists on genomic selection, and any other topic related to breeding programmes, selection, quantitative genetic, genomics, diversity and evolution of domestic animals. Researchers, teachers, and the animal breeding industry will find the reports of interest. Book reviews appear in many issues.