A meta-analysis on the effects of marker coverage, status number, and size of training set on predictive accuracy and heritability estimates from genomic selection in tree breeding

IF 1.6 3区生物学 Q2 FORESTRY

Tree Genetics & Genomes Pub Date : 2024-06-17 DOI:10.1007/s11295-024-01653-x

Jean Beaulieu, Patrick R.N. Lenz, Jean-Philippe Laverdière, Simon Nadeau, Jean Bousquet

{"title":"A meta-analysis on the effects of marker coverage, status number, and size of training set on predictive accuracy and heritability estimates from genomic selection in tree breeding","authors":"Jean Beaulieu, Patrick R.N. Lenz, Jean-Philippe Laverdière, Simon Nadeau, Jean Bousquet","doi":"10.1007/s11295-024-01653-x","DOIUrl":null,"url":null,"abstract":"<p>Genomic selection (GS) is increasingly used in tree breeding because of the possibility to hasten breeding cycles, increase selection intensity or facilitate multi-trait selection, and to obtain less biased estimates of quantitative genetic parameters such as heritability. However, tree breeders are aiming to obtain accurate estimates of such parameters and breeding values while optimizing sampling and genotyping costs. We conducted a metadata analysis of results from 28 GS studies totalling 115 study-traits. We found that heritability estimates obtained using DNA marker-based information for a variety of traits and species were not significantly related to variation in the total number of markers ranging from about 1500 to 116 000, nor by the marker density, ranging from about 1 to 60 markers/centimorgan, nor by the status number of the breeding populations ranging from about 10 to 620, nor by the size of the training set ranging from 236 to 2458. However, the predictive accuracy of breeding values was generally higher when the status number of the breeding population was smaller, which was expected given the higher level of relatedness in small breeding populations, and the increased ability of a given number of markers to trace the long-range linkage disequilibrium in such conditions. According to expectations, the predictive accuracy also increased with the size of the training set used to build marker-based models. Genotyping arrays with a few to many thousand markers exist for several tree species and with the actual costs, GS could thus be efficiently implemented in many more tree breeding programs, delivering less biased genetic parameters and more accurate estimates of breeding values.</p>","PeriodicalId":23335,"journal":{"name":"Tree Genetics & Genomes","volume":"77 1","pages":""},"PeriodicalIF":1.6000,"publicationDate":"2024-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tree Genetics & Genomes","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s11295-024-01653-x","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"FORESTRY","Score":null,"Total":0}

引用次数: 0

Abstract

Genomic selection (GS) is increasingly used in tree breeding because of the possibility to hasten breeding cycles, increase selection intensity or facilitate multi-trait selection, and to obtain less biased estimates of quantitative genetic parameters such as heritability. However, tree breeders are aiming to obtain accurate estimates of such parameters and breeding values while optimizing sampling and genotyping costs. We conducted a metadata analysis of results from 28 GS studies totalling 115 study-traits. We found that heritability estimates obtained using DNA marker-based information for a variety of traits and species were not significantly related to variation in the total number of markers ranging from about 1500 to 116 000, nor by the marker density, ranging from about 1 to 60 markers/centimorgan, nor by the status number of the breeding populations ranging from about 10 to 620, nor by the size of the training set ranging from 236 to 2458. However, the predictive accuracy of breeding values was generally higher when the status number of the breeding population was smaller, which was expected given the higher level of relatedness in small breeding populations, and the increased ability of a given number of markers to trace the long-range linkage disequilibrium in such conditions. According to expectations, the predictive accuracy also increased with the size of the training set used to build marker-based models. Genotyping arrays with a few to many thousand markers exist for several tree species and with the actual costs, GS could thus be efficiently implemented in many more tree breeding programs, delivering less biased genetic parameters and more accurate estimates of breeding values.

Abstract Image

查看原文本刊更多论文

标记覆盖率、状态数和训练集大小对树木育种中基因组选择的预测准确性和遗传率估算的影响的荟萃分析

基因组选择（GS）在林木育种中的应用越来越广泛，因为它可以加快育种周期、提高选择强度或促进多性状选择，并能获得偏差较小的数量遗传参数（如遗传率）估算值。然而，树木育种者的目标是在优化采样和基因分型成本的同时，获得此类参数和育种值的准确估计值。我们对 28 项 GS 研究共 115 个研究性状的结果进行了元数据分析。我们发现，利用基于 DNA 标记的信息获得的各种性状和物种的遗传率估计值与标记总数（从约 1500 个到 116 000 个不等）、标记密度（从约 1 个到 60 个标记/厘米器官不等）、育种群体的数量（从约 10 个到 620 个不等）以及训练集的大小（从 236 个到 2458 个不等）的变化关系不大。然而，当育种群体的数量较少时，育种值的预测准确率普遍较高，这是预料之中的，因为小规模育种群体的亲缘关系水平较高，在这种情况下，一定数量的标记追踪长程连锁不平衡的能力较强。根据预期，预测准确率也会随着用于建立基于标记的模型的训练集的大小而提高。一些树种的基因分型阵列有几千到几万个标记，在实际成本允许的情况下，GS 可以有效地应用于更多的树木育种计划，从而减少遗传参数的偏差，更准确地估计育种价值。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Tree Genetics & Genomes 生物-林学

CiteScore

4.40

自引率

4.20%

发文量

审稿时长

2 months

期刊介绍： Tree Genetics and Genomes is an international, peer-reviewed journal, which provides for the rapid publication of high quality papers covering the areas of forest and horticultural tree genetics and genomics. Topics covered in this journal include: Structural, functional and comparative genomics Evolutionary, population and quantitative genetics Ecological and physiological genetics Molecular, cellular and developmental genetics Conservation and restoration genetics Breeding and germplasm development Bioinformatics and databases Tree Genetics and Genomes publishes four types of papers: (1) Original Paper (2) Review (3) Opinion Paper (4) Short Communication.