{"title":"利用SFSI R-package进行多性状/环境稀疏基因组预测。","authors":"Marco Lopez-Cruz, Gustavo de Los Campos","doi":"10.1002/tpg2.70050","DOIUrl":null,"url":null,"abstract":"<p><p>Sparse selection indices (SSIs) can be used to predict the genetic merit of selection candidates using high-dimensional phenotypes (e.g., crop imaging) measured on each of the candidates of selection. Unlike traditional selection indices, SSIs can perform variable selection, thus enabling borrowing of information from a subset of the measured phenotypes. Likewise, sparse genomic prediction (SGP) can be used to predict genetic merit by borrowing information from a subset of the training dataset. In this study, we introduce a framework for multi-trait/environment SGP (MT-SGP) that combines the features of SSI and SGP into a single model. For candidates of selection, an MT-SGP produces prediction equations that use subsets of the training data, borrowing information from correlated traits expressed in training genotypes that are genetically close to the candidates of selection. Along with the methodology, we present an R-package (sparse family and selection index) that provides functions to solve SSIs, SGP, and MT-SGP problems. After presenting simplified examples that illustrate the use of the functions included in the package, we provide extensive benchmarks (using three data sets covering three crops and 30 traits/environments). Our results suggest that MT-SGP either outperforms (with up to 15% gains in prediction accuracy) or performs similarly to MT-genomic best linear unbiased prediction. The benchmarks provide insight regarding the conditions (sample size, genetic correlation among traits, and trait heritability) under which the use of MT-SGP can lead to gains in prediction accuracy.</p>","PeriodicalId":49002,"journal":{"name":"Plant Genome","volume":"18 2","pages":"e70050"},"PeriodicalIF":3.9000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12166114/pdf/","citationCount":"0","resultStr":"{\"title\":\"Multi-trait/environment sparse genomic prediction using the SFSI R-package.\",\"authors\":\"Marco Lopez-Cruz, Gustavo de Los Campos\",\"doi\":\"10.1002/tpg2.70050\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Sparse selection indices (SSIs) can be used to predict the genetic merit of selection candidates using high-dimensional phenotypes (e.g., crop imaging) measured on each of the candidates of selection. Unlike traditional selection indices, SSIs can perform variable selection, thus enabling borrowing of information from a subset of the measured phenotypes. Likewise, sparse genomic prediction (SGP) can be used to predict genetic merit by borrowing information from a subset of the training dataset. In this study, we introduce a framework for multi-trait/environment SGP (MT-SGP) that combines the features of SSI and SGP into a single model. For candidates of selection, an MT-SGP produces prediction equations that use subsets of the training data, borrowing information from correlated traits expressed in training genotypes that are genetically close to the candidates of selection. Along with the methodology, we present an R-package (sparse family and selection index) that provides functions to solve SSIs, SGP, and MT-SGP problems. After presenting simplified examples that illustrate the use of the functions included in the package, we provide extensive benchmarks (using three data sets covering three crops and 30 traits/environments). Our results suggest that MT-SGP either outperforms (with up to 15% gains in prediction accuracy) or performs similarly to MT-genomic best linear unbiased prediction. The benchmarks provide insight regarding the conditions (sample size, genetic correlation among traits, and trait heritability) under which the use of MT-SGP can lead to gains in prediction accuracy.</p>\",\"PeriodicalId\":49002,\"journal\":{\"name\":\"Plant Genome\",\"volume\":\"18 2\",\"pages\":\"e70050\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12166114/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Plant Genome\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1002/tpg2.70050\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Plant Genome","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/tpg2.70050","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
Multi-trait/environment sparse genomic prediction using the SFSI R-package.
Sparse selection indices (SSIs) can be used to predict the genetic merit of selection candidates using high-dimensional phenotypes (e.g., crop imaging) measured on each of the candidates of selection. Unlike traditional selection indices, SSIs can perform variable selection, thus enabling borrowing of information from a subset of the measured phenotypes. Likewise, sparse genomic prediction (SGP) can be used to predict genetic merit by borrowing information from a subset of the training dataset. In this study, we introduce a framework for multi-trait/environment SGP (MT-SGP) that combines the features of SSI and SGP into a single model. For candidates of selection, an MT-SGP produces prediction equations that use subsets of the training data, borrowing information from correlated traits expressed in training genotypes that are genetically close to the candidates of selection. Along with the methodology, we present an R-package (sparse family and selection index) that provides functions to solve SSIs, SGP, and MT-SGP problems. After presenting simplified examples that illustrate the use of the functions included in the package, we provide extensive benchmarks (using three data sets covering three crops and 30 traits/environments). Our results suggest that MT-SGP either outperforms (with up to 15% gains in prediction accuracy) or performs similarly to MT-genomic best linear unbiased prediction. The benchmarks provide insight regarding the conditions (sample size, genetic correlation among traits, and trait heritability) under which the use of MT-SGP can lead to gains in prediction accuracy.
期刊介绍:
The Plant Genome publishes original research investigating all aspects of plant genomics. Technical breakthroughs reporting improvements in the efficiency and speed of acquiring and interpreting plant genomics data are welcome. The editorial board gives preference to novel reports that use innovative genomic applications that advance our understanding of plant biology that may have applications to crop improvement. The journal also publishes invited review articles and perspectives that offer insight and commentary on recent advances in genomics and their potential for agronomic improvement.