整合全基因组关联研究的大规模荟萃分析,提高猪群联合基因组预测的准确性。

IF 1.9 3区 农林科学 Q2 AGRICULTURE, DAIRY & ANIMAL SCIENCE
Xiaodian Cai, Wenjing Zhang, Ning Gao, Chen Wei, Xibo Wu, Jinglei Si, Yahui Gao, Jiaqi Li, Tong Yin, Zhe Zhang
{"title":"整合全基因组关联研究的大规模荟萃分析,提高猪群联合基因组预测的准确性。","authors":"Xiaodian Cai, Wenjing Zhang, Ning Gao, Chen Wei, Xibo Wu, Jinglei Si, Yahui Gao, Jiaqi Li, Tong Yin, Zhe Zhang","doi":"10.1111/jbg.12896","DOIUrl":null,"url":null,"abstract":"<p><p>The strategy of combining reference populations has been widely recognized as an effective way to enhance the accuracy of genomic prediction (GP). This study investigated the efficiency of genomic prediction using prior information and combined reference population. In total, prior information considering trait-associated single nucleotide polymorphisms (SNPs) obtained from meta-analysis of genome-wide association studies (GWAS meta-analysis) was incorporated into three models to assess the performance of GP using combined reference populations. Two different Yorkshire populations with imputed whole genome sequence (WGS) data (9,741,620 SNPs), named as P1 (1259 individuals) and P2 (1018 individuals), were used to predict genomic estimated breeding values for three live carcass traits, including backfat thickness, loin muscle area, and loin muscle depth. A 10 × 5 fold cross-validation was used to evaluate the prediction accuracy of 203 randomly selected candidate pigs from the P2 population and the reference population consisted of the remaining pigs from P2 and the stepwise added pigs from P1. By integrating SNPs with different p-value thresholds from GWAS meta-analysis downloaded from PigGTEx Project, the prediction accuracy of GBLUP, genomic feature BLUP (GFBLUP) and GBLUP given genetic architecture (BLUP|GA) were compared. Moreover, we explored effects of reference population size and heritability enrichment of genomic features on the prediction accuracy improvement of GFBLUP and BLUP|GA relative to GBLUP. The prediction accuracy of GBLUP using all WGS markers showed average improvement of 4.380% using the P1 + P2 reference population compared with the P2 reference population. Using the combined reference population, GFBLUP and BLUP|GA yielded 6.179% and 5.525% higher accuracies than GBLUP using all SNPs based on the single reference population, respectively. Positive regression coefficients were estimated in relation to the improvement in prediction accuracy (between GFBLUP/BLUP|GA and GBLUP) and the size of the reference as well as the heritability enrichment of genomic features. Compared to the classic GBLUP model, GFBLUP and BLUP|GA models integrating GWAS meta-analysis information increase the prediction accuracy and using combined populations with enlarged reference population size further enhances prediction accuracy of the two approaches. The heritability enrichment of genomic features can be used as an indicator to reflect weather prior information is accurately presented.</p>","PeriodicalId":54885,"journal":{"name":"Journal of Animal Breeding and Genetics","volume":null,"pages":null},"PeriodicalIF":1.9000,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Integrating large-scale meta-analysis of genome-wide association studies improve the genomic prediction accuracy for combined pig populations.\",\"authors\":\"Xiaodian Cai, Wenjing Zhang, Ning Gao, Chen Wei, Xibo Wu, Jinglei Si, Yahui Gao, Jiaqi Li, Tong Yin, Zhe Zhang\",\"doi\":\"10.1111/jbg.12896\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The strategy of combining reference populations has been widely recognized as an effective way to enhance the accuracy of genomic prediction (GP). This study investigated the efficiency of genomic prediction using prior information and combined reference population. In total, prior information considering trait-associated single nucleotide polymorphisms (SNPs) obtained from meta-analysis of genome-wide association studies (GWAS meta-analysis) was incorporated into three models to assess the performance of GP using combined reference populations. Two different Yorkshire populations with imputed whole genome sequence (WGS) data (9,741,620 SNPs), named as P1 (1259 individuals) and P2 (1018 individuals), were used to predict genomic estimated breeding values for three live carcass traits, including backfat thickness, loin muscle area, and loin muscle depth. A 10 × 5 fold cross-validation was used to evaluate the prediction accuracy of 203 randomly selected candidate pigs from the P2 population and the reference population consisted of the remaining pigs from P2 and the stepwise added pigs from P1. By integrating SNPs with different p-value thresholds from GWAS meta-analysis downloaded from PigGTEx Project, the prediction accuracy of GBLUP, genomic feature BLUP (GFBLUP) and GBLUP given genetic architecture (BLUP|GA) were compared. Moreover, we explored effects of reference population size and heritability enrichment of genomic features on the prediction accuracy improvement of GFBLUP and BLUP|GA relative to GBLUP. The prediction accuracy of GBLUP using all WGS markers showed average improvement of 4.380% using the P1 + P2 reference population compared with the P2 reference population. Using the combined reference population, GFBLUP and BLUP|GA yielded 6.179% and 5.525% higher accuracies than GBLUP using all SNPs based on the single reference population, respectively. Positive regression coefficients were estimated in relation to the improvement in prediction accuracy (between GFBLUP/BLUP|GA and GBLUP) and the size of the reference as well as the heritability enrichment of genomic features. Compared to the classic GBLUP model, GFBLUP and BLUP|GA models integrating GWAS meta-analysis information increase the prediction accuracy and using combined populations with enlarged reference population size further enhances prediction accuracy of the two approaches. The heritability enrichment of genomic features can be used as an indicator to reflect weather prior information is accurately presented.</p>\",\"PeriodicalId\":54885,\"journal\":{\"name\":\"Journal of Animal Breeding and Genetics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2024-08-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Animal Breeding and Genetics\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://doi.org/10.1111/jbg.12896\",\"RegionNum\":3,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AGRICULTURE, DAIRY & ANIMAL SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Animal Breeding and Genetics","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1111/jbg.12896","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AGRICULTURE, DAIRY & ANIMAL SCIENCE","Score":null,"Total":0}
引用次数: 0

摘要

结合参考群体的策略已被广泛认为是提高基因组预测(GP)准确性的有效方法。本研究调查了利用先验信息和组合参考群体进行基因组预测的效率。从全基因组关联研究荟萃分析(GWAS meta-analysis)中获得的与性状相关的单核苷酸多态性(SNPs)先验信息被纳入三个模型中,以评估使用联合参考人群进行基因组预测的性能。使用两个不同的约克郡群体(P1(1259 个个体)和 P2(1018 个个体))的全基因组序列(WGS)数据(9,741,620 个 SNPs)来预测三个活体胴体性状(包括背膘厚、腰肌面积和腰肌深度)的基因组估计育种值。使用 10 × 5 倍交叉验证来评估从 P2 群体中随机选择的 203 头候选猪的预测准确性,参考群体由 P2 群体中的剩余猪和 P1 群体中逐步添加的猪组成。通过整合从 PigGTEx 项目下载的 GWAS meta-analysis 中不同 p 值阈值的 SNPs,比较了 GBLUP、基因组特征 BLUP(GFBLUP)和 GBLUP 给定遗传结构(BLUP|GA)的预测准确性。此外,我们还探讨了参考种群大小和基因组特征遗传力富集对 GFBLUP 和 BLUP|GA 相对于 GBLUP 预测准确率提高的影响。使用 P1 + P2 参考群体与 P2 参考群体相比,GBLUP 使用所有 WGS 标记的预测准确率平均提高了 4.380%。与使用基于单一参考群体的所有 SNP 的 GBLUP 相比,使用综合参考群体的 GFBLUP 和 BLUP|GA 预测准确率分别高出 6.179% 和 5.525%。根据预测准确率的提高(GFBLUP/BLUP|GA 与 GBLUP 之间)、参考群体的大小以及基因组特征的遗传富集程度估算出了正回归系数。与经典的 GBLUP 模型相比,整合了 GWAS 元分析信息的 GFBLUP 和 BLUP|GA 模型提高了预测准确率,而使用扩大了参考群体规模的合并群体则进一步提高了这两种方法的预测准确率。基因组特征的遗传力富集可以作为反映天气先验信息是否准确呈现的指标。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Integrating large-scale meta-analysis of genome-wide association studies improve the genomic prediction accuracy for combined pig populations.

The strategy of combining reference populations has been widely recognized as an effective way to enhance the accuracy of genomic prediction (GP). This study investigated the efficiency of genomic prediction using prior information and combined reference population. In total, prior information considering trait-associated single nucleotide polymorphisms (SNPs) obtained from meta-analysis of genome-wide association studies (GWAS meta-analysis) was incorporated into three models to assess the performance of GP using combined reference populations. Two different Yorkshire populations with imputed whole genome sequence (WGS) data (9,741,620 SNPs), named as P1 (1259 individuals) and P2 (1018 individuals), were used to predict genomic estimated breeding values for three live carcass traits, including backfat thickness, loin muscle area, and loin muscle depth. A 10 × 5 fold cross-validation was used to evaluate the prediction accuracy of 203 randomly selected candidate pigs from the P2 population and the reference population consisted of the remaining pigs from P2 and the stepwise added pigs from P1. By integrating SNPs with different p-value thresholds from GWAS meta-analysis downloaded from PigGTEx Project, the prediction accuracy of GBLUP, genomic feature BLUP (GFBLUP) and GBLUP given genetic architecture (BLUP|GA) were compared. Moreover, we explored effects of reference population size and heritability enrichment of genomic features on the prediction accuracy improvement of GFBLUP and BLUP|GA relative to GBLUP. The prediction accuracy of GBLUP using all WGS markers showed average improvement of 4.380% using the P1 + P2 reference population compared with the P2 reference population. Using the combined reference population, GFBLUP and BLUP|GA yielded 6.179% and 5.525% higher accuracies than GBLUP using all SNPs based on the single reference population, respectively. Positive regression coefficients were estimated in relation to the improvement in prediction accuracy (between GFBLUP/BLUP|GA and GBLUP) and the size of the reference as well as the heritability enrichment of genomic features. Compared to the classic GBLUP model, GFBLUP and BLUP|GA models integrating GWAS meta-analysis information increase the prediction accuracy and using combined populations with enlarged reference population size further enhances prediction accuracy of the two approaches. The heritability enrichment of genomic features can be used as an indicator to reflect weather prior information is accurately presented.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Animal Breeding and Genetics
Journal of Animal Breeding and Genetics 农林科学-奶制品与动物科学
CiteScore
5.20
自引率
3.80%
发文量
58
审稿时长
12-24 weeks
期刊介绍: The Journal of Animal Breeding and Genetics publishes original articles by international scientists on genomic selection, and any other topic related to breeding programmes, selection, quantitative genetic, genomics, diversity and evolution of domestic animals. Researchers, teachers, and the animal breeding industry will find the reports of interest. Book reviews appear in many issues.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信