Jean Beaulieu, Patrick R.N. Lenz, Jean-Philippe Laverdière, Simon Nadeau, Jean Bousquet
{"title":"标记覆盖率、状态数和训练集大小对树木育种中基因组选择的预测准确性和遗传率估算的影响的荟萃分析","authors":"Jean Beaulieu, Patrick R.N. Lenz, Jean-Philippe Laverdière, Simon Nadeau, Jean Bousquet","doi":"10.1007/s11295-024-01653-x","DOIUrl":null,"url":null,"abstract":"<p>Genomic selection (GS) is increasingly used in tree breeding because of the possibility to hasten breeding cycles, increase selection intensity or facilitate multi-trait selection, and to obtain less biased estimates of quantitative genetic parameters such as heritability. However, tree breeders are aiming to obtain accurate estimates of such parameters and breeding values while optimizing sampling and genotyping costs. We conducted a metadata analysis of results from 28 GS studies totalling 115 study-traits. We found that heritability estimates obtained using DNA marker-based information for a variety of traits and species were not significantly related to variation in the total number of markers ranging from about 1500 to 116 000, nor by the marker density, ranging from about 1 to 60 markers/centimorgan, nor by the status number of the breeding populations ranging from about 10 to 620, nor by the size of the training set ranging from 236 to 2458. However, the predictive accuracy of breeding values was generally higher when the status number of the breeding population was smaller, which was expected given the higher level of relatedness in small breeding populations, and the increased ability of a given number of markers to trace the long-range linkage disequilibrium in such conditions. According to expectations, the predictive accuracy also increased with the size of the training set used to build marker-based models. Genotyping arrays with a few to many thousand markers exist for several tree species and with the actual costs, GS could thus be efficiently implemented in many more tree breeding programs, delivering less biased genetic parameters and more accurate estimates of breeding values.</p>","PeriodicalId":23335,"journal":{"name":"Tree Genetics & Genomes","volume":"77 1","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2024-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A meta-analysis on the effects of marker coverage, status number, and size of training set on predictive accuracy and heritability estimates from genomic selection in tree breeding\",\"authors\":\"Jean Beaulieu, Patrick R.N. Lenz, Jean-Philippe Laverdière, Simon Nadeau, Jean Bousquet\",\"doi\":\"10.1007/s11295-024-01653-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Genomic selection (GS) is increasingly used in tree breeding because of the possibility to hasten breeding cycles, increase selection intensity or facilitate multi-trait selection, and to obtain less biased estimates of quantitative genetic parameters such as heritability. However, tree breeders are aiming to obtain accurate estimates of such parameters and breeding values while optimizing sampling and genotyping costs. We conducted a metadata analysis of results from 28 GS studies totalling 115 study-traits. We found that heritability estimates obtained using DNA marker-based information for a variety of traits and species were not significantly related to variation in the total number of markers ranging from about 1500 to 116 000, nor by the marker density, ranging from about 1 to 60 markers/centimorgan, nor by the status number of the breeding populations ranging from about 10 to 620, nor by the size of the training set ranging from 236 to 2458. However, the predictive accuracy of breeding values was generally higher when the status number of the breeding population was smaller, which was expected given the higher level of relatedness in small breeding populations, and the increased ability of a given number of markers to trace the long-range linkage disequilibrium in such conditions. According to expectations, the predictive accuracy also increased with the size of the training set used to build marker-based models. Genotyping arrays with a few to many thousand markers exist for several tree species and with the actual costs, GS could thus be efficiently implemented in many more tree breeding programs, delivering less biased genetic parameters and more accurate estimates of breeding values.</p>\",\"PeriodicalId\":23335,\"journal\":{\"name\":\"Tree Genetics & Genomes\",\"volume\":\"77 1\",\"pages\":\"\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2024-06-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Tree Genetics & Genomes\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1007/s11295-024-01653-x\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"FORESTRY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tree Genetics & Genomes","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s11295-024-01653-x","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"FORESTRY","Score":null,"Total":0}
A meta-analysis on the effects of marker coverage, status number, and size of training set on predictive accuracy and heritability estimates from genomic selection in tree breeding
Genomic selection (GS) is increasingly used in tree breeding because of the possibility to hasten breeding cycles, increase selection intensity or facilitate multi-trait selection, and to obtain less biased estimates of quantitative genetic parameters such as heritability. However, tree breeders are aiming to obtain accurate estimates of such parameters and breeding values while optimizing sampling and genotyping costs. We conducted a metadata analysis of results from 28 GS studies totalling 115 study-traits. We found that heritability estimates obtained using DNA marker-based information for a variety of traits and species were not significantly related to variation in the total number of markers ranging from about 1500 to 116 000, nor by the marker density, ranging from about 1 to 60 markers/centimorgan, nor by the status number of the breeding populations ranging from about 10 to 620, nor by the size of the training set ranging from 236 to 2458. However, the predictive accuracy of breeding values was generally higher when the status number of the breeding population was smaller, which was expected given the higher level of relatedness in small breeding populations, and the increased ability of a given number of markers to trace the long-range linkage disequilibrium in such conditions. According to expectations, the predictive accuracy also increased with the size of the training set used to build marker-based models. Genotyping arrays with a few to many thousand markers exist for several tree species and with the actual costs, GS could thus be efficiently implemented in many more tree breeding programs, delivering less biased genetic parameters and more accurate estimates of breeding values.
期刊介绍:
Tree Genetics and Genomes is an international, peer-reviewed journal, which provides for the rapid publication of high quality papers covering the areas of forest and horticultural tree genetics and genomics.
Topics covered in this journal include:
Structural, functional and comparative genomics
Evolutionary, population and quantitative genetics
Ecological and physiological genetics
Molecular, cellular and developmental genetics
Conservation and restoration genetics
Breeding and germplasm development
Bioinformatics and databases
Tree Genetics and Genomes publishes four types of papers:
(1) Original Paper
(2) Review
(3) Opinion Paper
(4) Short Communication.