Enhancing prediction accuracy of grain yield in wheat lines adapted to the southeastern United States through multivariate and multi-environment genomic prediction models incorporating spectral and thermal information.
Jordan McBreen, Md Ali Babar, Diego Jarquin, Naeem Khan, Steve Harrison, Noah DeWitt, Mohamed Mergoum, Ben Lopez, Richard Boyles, Jeanette Lyerly, J Paul Murphy, Ehsan Shakiba, Russel Sutton, Amir Ibrahim, Kimberly Howell, Jared H Smith, Gina Brown-Guedira, Vijay Tiwari, Nicholas Santantonio, David A Van Sanford
{"title":"Enhancing prediction accuracy of grain yield in wheat lines adapted to the southeastern United States through multivariate and multi-environment genomic prediction models incorporating spectral and thermal information.","authors":"Jordan McBreen, Md Ali Babar, Diego Jarquin, Naeem Khan, Steve Harrison, Noah DeWitt, Mohamed Mergoum, Ben Lopez, Richard Boyles, Jeanette Lyerly, J Paul Murphy, Ehsan Shakiba, Russel Sutton, Amir Ibrahim, Kimberly Howell, Jared H Smith, Gina Brown-Guedira, Vijay Tiwari, Nicholas Santantonio, David A Van Sanford","doi":"10.1002/tpg2.20532","DOIUrl":null,"url":null,"abstract":"<p><p>Enhancing predictive modeling accuracy in wheat (Triticum aestivum) breeding through the integration of high-throughput phenotyping (HTP) data with genomic information is crucial for maximizing genetic gain. In this study, spanning four locations in the southeastern United States over 3 years, models to predict grain yield (GY) were investigated through different cross-validation approaches. The results demonstrate the superiority of multivariate comprehensive models that incorporate both genomic and HTP data, particularly in accurately predicting GY across diverse locations and years. These HTP-incorporating models achieve prediction accuracies ranging from 0.59 to 0.68, compared to 0.40-0.54 for genomic-only models when tested under different prediction scenarios both across years and locations. The comprehensive models exhibit superior generalization to new environments and achieve the highest accuracy when trained on diverse datasets. Predictive accuracy improves as models incorporate data from multiple years, highlighting the importance of considering temporal dynamics in modeling approaches. The study reveals that multivariate prediction outperformed genomic prediction methods in predicting lines across years and locations. The percentage of top 25% lines selected based on multivariate prediction was higher compared to genomic-only models, indicated by higher specificity, which is the proportion of correctly identified top-yielding lines that matched the observed top 25% performance across different sites and years. Additionally, the study addresses the prediction of untested locations based on other locations within the same year and in new years at previously tested locations. Findings show the comprehensive models effectively extrapolate to new environments, highlighting their potential for guiding breeding strategies.</p>","PeriodicalId":49002,"journal":{"name":"Plant Genome","volume":" ","pages":"e20532"},"PeriodicalIF":3.9000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Plant Genome","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/tpg2.20532","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
Enhancing predictive modeling accuracy in wheat (Triticum aestivum) breeding through the integration of high-throughput phenotyping (HTP) data with genomic information is crucial for maximizing genetic gain. In this study, spanning four locations in the southeastern United States over 3 years, models to predict grain yield (GY) were investigated through different cross-validation approaches. The results demonstrate the superiority of multivariate comprehensive models that incorporate both genomic and HTP data, particularly in accurately predicting GY across diverse locations and years. These HTP-incorporating models achieve prediction accuracies ranging from 0.59 to 0.68, compared to 0.40-0.54 for genomic-only models when tested under different prediction scenarios both across years and locations. The comprehensive models exhibit superior generalization to new environments and achieve the highest accuracy when trained on diverse datasets. Predictive accuracy improves as models incorporate data from multiple years, highlighting the importance of considering temporal dynamics in modeling approaches. The study reveals that multivariate prediction outperformed genomic prediction methods in predicting lines across years and locations. The percentage of top 25% lines selected based on multivariate prediction was higher compared to genomic-only models, indicated by higher specificity, which is the proportion of correctly identified top-yielding lines that matched the observed top 25% performance across different sites and years. Additionally, the study addresses the prediction of untested locations based on other locations within the same year and in new years at previously tested locations. Findings show the comprehensive models effectively extrapolate to new environments, highlighting their potential for guiding breeding strategies.
期刊介绍:
The Plant Genome publishes original research investigating all aspects of plant genomics. Technical breakthroughs reporting improvements in the efficiency and speed of acquiring and interpreting plant genomics data are welcome. The editorial board gives preference to novel reports that use innovative genomic applications that advance our understanding of plant biology that may have applications to crop improvement. The journal also publishes invited review articles and perspectives that offer insight and commentary on recent advances in genomics and their potential for agronomic improvement.