用 R 方法利用基因组信息估算遗传率

IF 1.9 3区 农林科学 Q2 AGRICULTURE, DAIRY & ANIMAL SCIENCE
Mary Kate Hollifield, Daniela Lourenco, Ignacy Misztal
{"title":"用 R 方法利用基因组信息估算遗传率","authors":"Mary Kate Hollifield,&nbsp;Daniela Lourenco,&nbsp;Ignacy Misztal","doi":"10.1111/jbg.12863","DOIUrl":null,"url":null,"abstract":"<p>Estimating heritabilities with large genomic models by established methods such as restricted maximum likelihood (REML) or Bayesian via Gibbs sampling is computationally expensive. Alternatively, heritability can be estimated indirectly by method R and by maximum predictivity, referred to as MaxPred here, at a much lower computing cost. By method R, the heritability used for predictions with whole and partial data is considered the best estimate when the predictions based on partial data are unbiased relative to those with the complete data. By MaxPred, the heritability estimate is the one that maximizes predictivity. This study compared heritability estimation with genomic information using average information REML (AI–REML), method R and MaxPred. A simulated population was generated with ten generations of 5000 animals each and an effective population size of 80. Each animal had one record for a trait with a heritability of 0.3, a phenotypic variance of 10.0 and was genotyped at 50 k SNP. In method R, the heritability estimate is found when the expectation of a regression coefficient is equal to one. The regression is the EBV of selection candidates calculated with the whole dataset regressed on the EBV of candidates calculated from a partial dataset. In this study, we used the GBLUP framework and therefore, GEBV was calculated. The partial dataset was created by removing the last generation of phenotypes. Predictivity was defined as the correlation between the adjusted phenotypes of the selection candidates and their GEBV calculated from the partial data. We estimated the heritability for populations that included between three and 10 generations. In every scenario, predictivity increased as more data was used and was the highest at the simulated heritability. However, the predictivity for all data subsets and all heritabilities compared did not differ more than 0.01, suggesting MaxPred is not the best indication for heritability estimation. For the whole dataset, the heritability was estimated as 0.30 ± 0.01, 0.26 ± 0.01 and 0.30 ± 0.04 for AI–REML without genomics, AI–REML with genomics and method R with genomics, respectively. Heritability estimation with genomics by method R reduced timing by 83%, implying a reduction in computing time from 9.5 to 1.6 h, on average, compared to AI–REML with genomics. Method R has the potential to estimate heritabilities with large genomic information at a low cost when many generations of animals are present; however, the standard error can be high when only a few iterations are used.</p>","PeriodicalId":54885,"journal":{"name":"Journal of Animal Breeding and Genetics","volume":"141 5","pages":"550-558"},"PeriodicalIF":1.9000,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/jbg.12863","citationCount":"0","resultStr":"{\"title\":\"Estimation of heritability with genomic information by method R\",\"authors\":\"Mary Kate Hollifield,&nbsp;Daniela Lourenco,&nbsp;Ignacy Misztal\",\"doi\":\"10.1111/jbg.12863\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Estimating heritabilities with large genomic models by established methods such as restricted maximum likelihood (REML) or Bayesian via Gibbs sampling is computationally expensive. Alternatively, heritability can be estimated indirectly by method R and by maximum predictivity, referred to as MaxPred here, at a much lower computing cost. By method R, the heritability used for predictions with whole and partial data is considered the best estimate when the predictions based on partial data are unbiased relative to those with the complete data. By MaxPred, the heritability estimate is the one that maximizes predictivity. This study compared heritability estimation with genomic information using average information REML (AI–REML), method R and MaxPred. A simulated population was generated with ten generations of 5000 animals each and an effective population size of 80. Each animal had one record for a trait with a heritability of 0.3, a phenotypic variance of 10.0 and was genotyped at 50 k SNP. In method R, the heritability estimate is found when the expectation of a regression coefficient is equal to one. The regression is the EBV of selection candidates calculated with the whole dataset regressed on the EBV of candidates calculated from a partial dataset. In this study, we used the GBLUP framework and therefore, GEBV was calculated. The partial dataset was created by removing the last generation of phenotypes. Predictivity was defined as the correlation between the adjusted phenotypes of the selection candidates and their GEBV calculated from the partial data. We estimated the heritability for populations that included between three and 10 generations. In every scenario, predictivity increased as more data was used and was the highest at the simulated heritability. However, the predictivity for all data subsets and all heritabilities compared did not differ more than 0.01, suggesting MaxPred is not the best indication for heritability estimation. For the whole dataset, the heritability was estimated as 0.30 ± 0.01, 0.26 ± 0.01 and 0.30 ± 0.04 for AI–REML without genomics, AI–REML with genomics and method R with genomics, respectively. Heritability estimation with genomics by method R reduced timing by 83%, implying a reduction in computing time from 9.5 to 1.6 h, on average, compared to AI–REML with genomics. Method R has the potential to estimate heritabilities with large genomic information at a low cost when many generations of animals are present; however, the standard error can be high when only a few iterations are used.</p>\",\"PeriodicalId\":54885,\"journal\":{\"name\":\"Journal of Animal Breeding and Genetics\",\"volume\":\"141 5\",\"pages\":\"550-558\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2024-03-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/jbg.12863\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Animal Breeding and Genetics\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/jbg.12863\",\"RegionNum\":3,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AGRICULTURE, DAIRY & ANIMAL SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Animal Breeding and Genetics","FirstCategoryId":"97","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/jbg.12863","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AGRICULTURE, DAIRY & ANIMAL SCIENCE","Score":null,"Total":0}
引用次数: 0

摘要

用限制性最大似然法(REML)或贝叶斯吉布斯抽样法等成熟方法估计大型基因组模型的遗传率,计算成本很高。另外,遗传率可以通过 R 方法和最大预测率(此处称为 MaxPred)间接估算,计算成本要低得多。根据 R 方法,当基于部分数据的预测相对于基于完整数据的预测无偏时,用于预测完整数据和部分数据的遗传率被认为是最佳估计值。根据 MaxPred 方法,遗传率估计值是预测性最大的估计值。本研究比较了使用平均信息 REML(AI-REML)、R 方法和 MaxPred 对基因组信息进行的遗传率估计。模拟种群共生成 10 代,每代 5000 只动物,有效种群规模为 80。每只动物都有一条性状记录,遗传率为 0.3,表型方差为 10.0,基因分型为 50 k SNP。在 R 方法中,当回归系数的期望值等于 1 时,就能找到遗传率估计值。回归系数是用整个数据集计算的候选基因的 EBV 值与用部分数据集计算的候选基因的 EBV 值的回归系数。在本研究中,我们使用的是 GBLUP 框架,因此计算的是 GEBV。部分数据集是通过删除上一代表型创建的。预测性被定义为候选品种调整后的表型与根据部分数据计算的 GEBV 之间的相关性。我们估算了三代到十代种群的遗传率。在每种情况下,随着使用的数据越多,预测率越高,在模拟遗传率时预测率最高。然而,所有数据子集的预测率和所有遗传率的比较差异都不超过 0.01,这表明 MaxPred 并不是遗传率估计的最佳指标。就整个数据集而言,无基因组学的 AI-REML、有基因组学的 AI-REML 和有基因组学的 R 方法的遗传率分别估计为 0.30 ± 0.01、0.26 ± 0.01 和 0.30 ± 0.04。与使用基因组学的 AI-REML 相比,使用 R 方法进行基因组学遗传率估计的时间减少了 83%,这意味着计算时间从平均 9.5 小时减少到 1.6 小时。当存在多代动物时,R方法有可能以较低的成本利用大量基因组信息估算遗传率;然而,当仅使用少量迭代时,标准误差可能会很高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Estimation of heritability with genomic information by method R

Estimation of heritability with genomic information by method R

Estimating heritabilities with large genomic models by established methods such as restricted maximum likelihood (REML) or Bayesian via Gibbs sampling is computationally expensive. Alternatively, heritability can be estimated indirectly by method R and by maximum predictivity, referred to as MaxPred here, at a much lower computing cost. By method R, the heritability used for predictions with whole and partial data is considered the best estimate when the predictions based on partial data are unbiased relative to those with the complete data. By MaxPred, the heritability estimate is the one that maximizes predictivity. This study compared heritability estimation with genomic information using average information REML (AI–REML), method R and MaxPred. A simulated population was generated with ten generations of 5000 animals each and an effective population size of 80. Each animal had one record for a trait with a heritability of 0.3, a phenotypic variance of 10.0 and was genotyped at 50 k SNP. In method R, the heritability estimate is found when the expectation of a regression coefficient is equal to one. The regression is the EBV of selection candidates calculated with the whole dataset regressed on the EBV of candidates calculated from a partial dataset. In this study, we used the GBLUP framework and therefore, GEBV was calculated. The partial dataset was created by removing the last generation of phenotypes. Predictivity was defined as the correlation between the adjusted phenotypes of the selection candidates and their GEBV calculated from the partial data. We estimated the heritability for populations that included between three and 10 generations. In every scenario, predictivity increased as more data was used and was the highest at the simulated heritability. However, the predictivity for all data subsets and all heritabilities compared did not differ more than 0.01, suggesting MaxPred is not the best indication for heritability estimation. For the whole dataset, the heritability was estimated as 0.30 ± 0.01, 0.26 ± 0.01 and 0.30 ± 0.04 for AI–REML without genomics, AI–REML with genomics and method R with genomics, respectively. Heritability estimation with genomics by method R reduced timing by 83%, implying a reduction in computing time from 9.5 to 1.6 h, on average, compared to AI–REML with genomics. Method R has the potential to estimate heritabilities with large genomic information at a low cost when many generations of animals are present; however, the standard error can be high when only a few iterations are used.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Animal Breeding and Genetics
Journal of Animal Breeding and Genetics 农林科学-奶制品与动物科学
CiteScore
5.20
自引率
3.80%
发文量
58
审稿时长
12-24 weeks
期刊介绍: The Journal of Animal Breeding and Genetics publishes original articles by international scientists on genomic selection, and any other topic related to breeding programmes, selection, quantitative genetic, genomics, diversity and evolution of domestic animals. Researchers, teachers, and the animal breeding industry will find the reports of interest. Book reviews appear in many issues.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信