IF 1.9 3区 农林科学 Q2 AGRICULTURE, DAIRY & ANIMAL SCIENCE
Tesfaye K Belay, Arne B Gjuvsland, Janez Jenko, Leiv S Eikje, Morten Svendsen, Theo Meuwissen
{"title":"Single-Step Genomic BLUP With Unknown Parent Groups and Metafounders in Norwegian Red Evaluations.","authors":"Tesfaye K Belay, Arne B Gjuvsland, Janez Jenko, Leiv S Eikje, Morten Svendsen, Theo Meuwissen","doi":"10.1111/jbg.12939","DOIUrl":null,"url":null,"abstract":"<p><p>The objective of this study was to examine the effects of different methods for handling missing pedigree data on biases, stability, relative increase in accuracy, and genetic trends using national data from Norwegian Red (NRF) cattle. The dataset comprised 8,402,773 milk yield records from 3,896,116 NRF cows, a pedigree with 4,957,544 animals, and a genomic dataset from 170,293 animals with 121,741 SNPs. Missing parents were modelled using three approaches: unknown parent groups (UPG), metafounders (MF), and \"Q-Q<sup>+</sup>\" methods. The UPG method is routinely used for genetic evaluations of NRF cattle by including 52 fixed UPG in the pedigree. In the MF method, two MF were defined: MF14 and MF52, with MF treated as random effects. The MF14 included 6 MF defined by birth year intervals for NRF breed and 8 MF defined by breed origins for other breeds. The MF52 classification included all the 52 UPG as MF considering relationships among them. The \"Q-Q<sup>+</sup>\" approach corrects for the combined effects of UPG and \"J factor\" in non-genotyped animals while avoiding such corrections in genotyped animals. The three approaches, combined with different G matrices (G<sub>rtn</sub> matrix constructed with a 0.5 allele frequency (AF) and 10% weight (w) on A, G<sub>05</sub> constructed using AF = 0.5 and w = 0.0, and G<sub>cal</sub> constructed with observed AF and w = 0.0), led to eight ssGBLUP models being tested. This included one UPG model (using G<sub>rtn</sub>), four MF models (MF14 and MF52 using G<sub>rtn</sub> or G<sub>05</sub>), and three Q-Q+ models (using G<sub>cal</sub>, G<sub>05</sub>, or G<sub>rtn</sub>). The models were evaluated through cross-validation by masking the phenotypes of 5000 genotyped young cows. Results showed that the Q-Q<sup>+</sup> models using the G<sub>cal</sub> or G<sub>05</sub> matrix had significantly (p < 0.05) lower level biases and higher genetic trends than all other models. MF models with 14 or 52 groups using G<sub>05</sub> were second best for level bias and performed similarly or slightly better than Q-Q+ models regarding inflation bias and stability. Increasing the number of MF from 14 to 52 had minimal effects on biases but significantly improved stability and genetic trend estimates. Models with G<sub>rtn</sub> had slightly higher gain in accuracy from adding phenotypic data (2.01%) than G<sub>05</sub> (1.18%), but pedigree-based models showed the highest improvement in accuracy due to adding phenotypic (26%) or genomic (47%) data to the partial dataset. Overall, all models with G<sub>05</sub> showed the least bias (with a small standard error) and most stable predictions, while models using G<sub>rtn</sub> introduced biases and instability. Thus, the Q-Q<sup>+</sup> and MF models combined with G<sub>05</sub> and Q-Q<sup>+</sup> with G<sub>cal</sub> are recommended for their improved validation results and genetic trends.</p>","PeriodicalId":54885,"journal":{"name":"Journal of Animal Breeding and Genetics","volume":" ","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Animal Breeding and Genetics","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1111/jbg.12939","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AGRICULTURE, DAIRY & ANIMAL SCIENCE","Score":null,"Total":0}
引用次数: 0

摘要

本研究的目的是利用挪威红牛(NRF)的全国数据,检验处理缺失血统数据的不同方法对偏差、稳定性、准确性相对提高和遗传趋势的影响。数据集包括3,896,116头NRF奶牛的8,402,773条产奶量记录、包含4,957,544头动物的血统数据以及包含121,741个SNPs的170,293头动物的基因组数据集。对缺失父母的建模采用了三种方法:未知父母组(UPG)、元创始人(MF)和 "Q-Q+"方法。UPG 方法通常用于 NRF 牛的遗传评估,在血统中包含 52 个固定的 UPG。在 MF 方法中,定义了两个 MF:MF14 和 MF52,MF 被视为随机效应。MF14 包括 6 个按出生年份间隔定义的 NRF 品种 MF 和 8 个按品种起源定义的其他品种 MF。MF52 分类将所有 52 个 UPG 作为 MF,并考虑了它们之间的关系。Q-Q+"方法对非基因分型动物的 UPG 和 "J 因子 "的综合影响进行了校正,而对基因分型动物则避免了此类校正。这三种方法结合不同的 G 矩阵(Grtn 矩阵以 0.5 的等位基因频率(AF)和 10%的 A 权重(w)构建,G05 以 AF = 0.5 和 w = 0.0 构建,Gcal 以观察到的 AF 和 w = 0.0 构建),共测试了 8 个 ssGBLUP 模型。其中包括一个 UPG 模型(使用 Grtn)、四个 MF 模型(使用 Grtn 或 G05 的 MF14 和 MF52)和三个 Q-Q+ 模型(使用 Gcal、G05 或 Grtn)。通过屏蔽 5000 头基因分型青年牛的表型,对这些模型进行了交叉验证评估。结果表明,使用 Gcal 或 G05 矩阵的 Q-Q+ 模型在水平偏差方面明显(p 05)次于 Q-Q+ 模型,在膨胀偏差和稳定性方面的表现与 Q-Q+ 模型相似或略胜一筹。将 MF 的数量从 14 个增加到 52 个对偏差的影响很小,但对稳定性和遗传趋势估计值的影响很大。与 G05(1.18%)相比,使用 Grtn 的模型因增加表型数据而提高的准确率(2.01%)略高,但基于血统的模型因在部分数据集中增加表型数据(26%)或基因组数据(47%)而提高的准确率最高。总体而言,所有使用 G05 的模型都显示出最小的偏差(标准误差较小)和最稳定的预测,而使用 Grtn 的模型则显示出偏差和不稳定性。因此,建议使用 Q-Q+ 和 MF 模型与 G05 结合,以及 Q-Q+ 与 Gcal 结合,以改善验证结果和遗传趋势。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Single-Step Genomic BLUP With Unknown Parent Groups and Metafounders in Norwegian Red Evaluations.

The objective of this study was to examine the effects of different methods for handling missing pedigree data on biases, stability, relative increase in accuracy, and genetic trends using national data from Norwegian Red (NRF) cattle. The dataset comprised 8,402,773 milk yield records from 3,896,116 NRF cows, a pedigree with 4,957,544 animals, and a genomic dataset from 170,293 animals with 121,741 SNPs. Missing parents were modelled using three approaches: unknown parent groups (UPG), metafounders (MF), and "Q-Q+" methods. The UPG method is routinely used for genetic evaluations of NRF cattle by including 52 fixed UPG in the pedigree. In the MF method, two MF were defined: MF14 and MF52, with MF treated as random effects. The MF14 included 6 MF defined by birth year intervals for NRF breed and 8 MF defined by breed origins for other breeds. The MF52 classification included all the 52 UPG as MF considering relationships among them. The "Q-Q+" approach corrects for the combined effects of UPG and "J factor" in non-genotyped animals while avoiding such corrections in genotyped animals. The three approaches, combined with different G matrices (Grtn matrix constructed with a 0.5 allele frequency (AF) and 10% weight (w) on A, G05 constructed using AF = 0.5 and w = 0.0, and Gcal constructed with observed AF and w = 0.0), led to eight ssGBLUP models being tested. This included one UPG model (using Grtn), four MF models (MF14 and MF52 using Grtn or G05), and three Q-Q+ models (using Gcal, G05, or Grtn). The models were evaluated through cross-validation by masking the phenotypes of 5000 genotyped young cows. Results showed that the Q-Q+ models using the Gcal or G05 matrix had significantly (p < 0.05) lower level biases and higher genetic trends than all other models. MF models with 14 or 52 groups using G05 were second best for level bias and performed similarly or slightly better than Q-Q+ models regarding inflation bias and stability. Increasing the number of MF from 14 to 52 had minimal effects on biases but significantly improved stability and genetic trend estimates. Models with Grtn had slightly higher gain in accuracy from adding phenotypic data (2.01%) than G05 (1.18%), but pedigree-based models showed the highest improvement in accuracy due to adding phenotypic (26%) or genomic (47%) data to the partial dataset. Overall, all models with G05 showed the least bias (with a small standard error) and most stable predictions, while models using Grtn introduced biases and instability. Thus, the Q-Q+ and MF models combined with G05 and Q-Q+ with Gcal are recommended for their improved validation results and genetic trends.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Animal Breeding and Genetics
Journal of Animal Breeding and Genetics 农林科学-奶制品与动物科学
CiteScore
5.20
自引率
3.80%
发文量
58
审稿时长
12-24 weeks
期刊介绍: The Journal of Animal Breeding and Genetics publishes original articles by international scientists on genomic selection, and any other topic related to breeding programmes, selection, quantitative genetic, genomics, diversity and evolution of domestic animals. Researchers, teachers, and the animal breeding industry will find the reports of interest. Book reviews appear in many issues.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信