Maria V. Kjetså, Arne B. Gjuvsland, Eli Grindflek, Theo Meuwissen
{"title":"利用全基因组序列、高密度和组合注释依赖性损耗基因型,参考种群规模和结构对两个猪品系母系性状基因组预测的影响。","authors":"Maria V. Kjetså, Arne B. Gjuvsland, Eli Grindflek, Theo Meuwissen","doi":"10.1111/jbg.12865","DOIUrl":null,"url":null,"abstract":"<p>The aim of this study was to investigate the reference population size required to obtain substantial prediction accuracy within- and across-lines and the effect of using a multi-line reference population for genomic predictions of maternal traits in pigs. The data consisted of two nucleus pig populations, one pure-bred Landrace (L) and one Synthetic (S) Yorkshire/Large White line. All animals were genotyped with up to 30 K animals in each line, and all had records on maternal traits. Prediction accuracy was tested with three different marker data sets: High-density SNP (HD), whole genome sequence (WGS), and markers derived from WGS based on pig combined annotation dependent depletion-score (pCADD). Also, two different genomic prediction methods (GBLUP and Bayes GC) were compared for four maternal traits; total number piglets born (TNB), total number of stillborn piglets (STB), Shoulder Lesion Score and Body Condition Score. The main results from this study showed that a reference population of 3 K–6 K animals for within-line prediction generally was sufficient to achieve high prediction accuracy. However, when the number of animals in the reference population was increased to 30 K, the prediction accuracy significantly increased for the traits TNB and STB. For multi-line prediction accuracy, the accuracy was most dependent on the number of within-line animals in the reference data. The S-line provided a generally higher prediction accuracy compared to the L-line. Using pCADD scores to reduce the number of markers from WGS data in combination with the GBLUP method generally reduced prediction accuracies relative to GBLUP using HD genotypes. The BayesGC method benefited from a large reference population and was less dependent on the different genotype marker datasets to achieve a high prediction accuracy.</p>","PeriodicalId":54885,"journal":{"name":"Journal of Animal Breeding and Genetics","volume":"141 6","pages":"587-601"},"PeriodicalIF":1.9000,"publicationDate":"2024-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/jbg.12865","citationCount":"0","resultStr":"{\"title\":\"Effects of reference population size and structure on genomic prediction of maternal traits in two pig lines using whole-genome sequence-, high-density- and combined annotation-dependent depletion genotypes\",\"authors\":\"Maria V. Kjetså, Arne B. Gjuvsland, Eli Grindflek, Theo Meuwissen\",\"doi\":\"10.1111/jbg.12865\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>The aim of this study was to investigate the reference population size required to obtain substantial prediction accuracy within- and across-lines and the effect of using a multi-line reference population for genomic predictions of maternal traits in pigs. The data consisted of two nucleus pig populations, one pure-bred Landrace (L) and one Synthetic (S) Yorkshire/Large White line. All animals were genotyped with up to 30 K animals in each line, and all had records on maternal traits. Prediction accuracy was tested with three different marker data sets: High-density SNP (HD), whole genome sequence (WGS), and markers derived from WGS based on pig combined annotation dependent depletion-score (pCADD). Also, two different genomic prediction methods (GBLUP and Bayes GC) were compared for four maternal traits; total number piglets born (TNB), total number of stillborn piglets (STB), Shoulder Lesion Score and Body Condition Score. The main results from this study showed that a reference population of 3 K–6 K animals for within-line prediction generally was sufficient to achieve high prediction accuracy. However, when the number of animals in the reference population was increased to 30 K, the prediction accuracy significantly increased for the traits TNB and STB. For multi-line prediction accuracy, the accuracy was most dependent on the number of within-line animals in the reference data. The S-line provided a generally higher prediction accuracy compared to the L-line. Using pCADD scores to reduce the number of markers from WGS data in combination with the GBLUP method generally reduced prediction accuracies relative to GBLUP using HD genotypes. The BayesGC method benefited from a large reference population and was less dependent on the different genotype marker datasets to achieve a high prediction accuracy.</p>\",\"PeriodicalId\":54885,\"journal\":{\"name\":\"Journal of Animal Breeding and Genetics\",\"volume\":\"141 6\",\"pages\":\"587-601\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2024-04-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/jbg.12865\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Animal Breeding and Genetics\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/jbg.12865\",\"RegionNum\":3,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AGRICULTURE, DAIRY & ANIMAL SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Animal Breeding and Genetics","FirstCategoryId":"97","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/jbg.12865","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AGRICULTURE, DAIRY & ANIMAL SCIENCE","Score":null,"Total":0}
引用次数: 0
摘要
本研究的目的是调查在猪的母系性状基因组预测中,为获得较高的系内和跨系预测准确性所需的参考种群规模,以及使用多系参考种群的效果。数据由两个核心猪群组成,一个是纯种兰德猪(L),另一个是约克夏/大白猪合成猪(S)。所有动物都进行了基因分型,每个品系有多达 30 K 头动物,所有动物都有母性性状记录。预测准确性用三个不同的标记数据集进行了测试:高密度 SNP(HD)、全基因组序列(WGS)和基于猪联合注释依赖性损耗分数(pCADD)的 WGS 衍生标记。此外,还比较了两种不同的基因组预测方法(GBLUP 和 Bayes GC)对四个母性性状的预测结果:出生仔猪总数(TNB)、死胎仔猪总数(STB)、肩部损伤评分和体况评分。该研究的主要结果表明,一般来说,线内预测的参考群体为 3 K-6 K 头动物就足以达到较高的预测准确率。然而,当参考群体中的动物数量增加到 30 K 时,TNB 和 STB 性状的预测准确率显著提高。在多线预测准确性方面,准确性主要取决于参考数据中的线内动物数量。与 L 线相比,S 线的预测准确率普遍较高。与使用 HD 基因型的 GBLUP 方法相比,使用 pCADD 分数减少 WGS 数据中的标记数量通常会降低预测准确率。BayesGC 方法得益于庞大的参考群体,较少依赖不同的基因型标记数据集来获得较高的预测准确率。
Effects of reference population size and structure on genomic prediction of maternal traits in two pig lines using whole-genome sequence-, high-density- and combined annotation-dependent depletion genotypes
The aim of this study was to investigate the reference population size required to obtain substantial prediction accuracy within- and across-lines and the effect of using a multi-line reference population for genomic predictions of maternal traits in pigs. The data consisted of two nucleus pig populations, one pure-bred Landrace (L) and one Synthetic (S) Yorkshire/Large White line. All animals were genotyped with up to 30 K animals in each line, and all had records on maternal traits. Prediction accuracy was tested with three different marker data sets: High-density SNP (HD), whole genome sequence (WGS), and markers derived from WGS based on pig combined annotation dependent depletion-score (pCADD). Also, two different genomic prediction methods (GBLUP and Bayes GC) were compared for four maternal traits; total number piglets born (TNB), total number of stillborn piglets (STB), Shoulder Lesion Score and Body Condition Score. The main results from this study showed that a reference population of 3 K–6 K animals for within-line prediction generally was sufficient to achieve high prediction accuracy. However, when the number of animals in the reference population was increased to 30 K, the prediction accuracy significantly increased for the traits TNB and STB. For multi-line prediction accuracy, the accuracy was most dependent on the number of within-line animals in the reference data. The S-line provided a generally higher prediction accuracy compared to the L-line. Using pCADD scores to reduce the number of markers from WGS data in combination with the GBLUP method generally reduced prediction accuracies relative to GBLUP using HD genotypes. The BayesGC method benefited from a large reference population and was less dependent on the different genotype marker datasets to achieve a high prediction accuracy.
期刊介绍:
The Journal of Animal Breeding and Genetics publishes original articles by international scientists on genomic selection, and any other topic related to breeding programmes, selection, quantitative genetic, genomics, diversity and evolution of domestic animals. Researchers, teachers, and the animal breeding industry will find the reports of interest. Book reviews appear in many issues.