Global Genotype by Environment Prediction Competition Reveals That Diverse Modeling Strategies Can Deliver Satisfactory Maize Yield Estimates.

IF 3.3 3区 生物学 Q2 GENETICS & HEREDITY
Genetics Pub Date : 2024-11-22 DOI:10.1093/genetics/iyae195
Jacob D Washburn, José Ignacio Varela, Alencar Xavier, Qiuyue Chen, David Ertl, Joseph L Gage, James B Holland, Dayane Cristina Lima, Maria Cinta Romay, Marco Lopez-Cruz, Gustavo de Los Campos, Wesley Barber, Cristiano Zimmer, Ignacio Trucillo Silva, Fabiani Rocha, Renaud Rincent, Baber Ali, Haixiao Hu, Daniel E Runcie, Kirill Gusev, Andrei Slabodkin, Phillip Bax, Julie Aubert, Hugo Gangloff, Tristan Mary-Huard, Theodore Vanrenterghem, Carles Quesada-Traver, Steven Yates, Daniel Ariza-Suárez, Argeo Ulrich, Michele Wyler, Daniel R Kick, Emily S Bellis, Jason L Causey, Emilio Soriano Chavez, Yixing Wang, Ved Piyush, Gayara D Fernando, Robert K Hu, Rachit Kumar, Annan J Timon, Rasika Venkatesh, Kenia Segura Abá, Huan Chen, Thilanka Ranaweera, Shin-Han Shiu, Peiran Wang, Max J Gordon, B K Amos, Sebastiano Busato, Daniel Perondi, Abhishek Gogna, Dennis Psaroudakis, C P James Chen, Hawlader A Al-Mamun, Monica F Danilevicz, Shriprabha R Upadhyaya, David Edwards, Natalia de Leon
{"title":"Global Genotype by Environment Prediction Competition Reveals That Diverse Modeling Strategies Can Deliver Satisfactory Maize Yield Estimates.","authors":"Jacob D Washburn, José Ignacio Varela, Alencar Xavier, Qiuyue Chen, David Ertl, Joseph L Gage, James B Holland, Dayane Cristina Lima, Maria Cinta Romay, Marco Lopez-Cruz, Gustavo de Los Campos, Wesley Barber, Cristiano Zimmer, Ignacio Trucillo Silva, Fabiani Rocha, Renaud Rincent, Baber Ali, Haixiao Hu, Daniel E Runcie, Kirill Gusev, Andrei Slabodkin, Phillip Bax, Julie Aubert, Hugo Gangloff, Tristan Mary-Huard, Theodore Vanrenterghem, Carles Quesada-Traver, Steven Yates, Daniel Ariza-Suárez, Argeo Ulrich, Michele Wyler, Daniel R Kick, Emily S Bellis, Jason L Causey, Emilio Soriano Chavez, Yixing Wang, Ved Piyush, Gayara D Fernando, Robert K Hu, Rachit Kumar, Annan J Timon, Rasika Venkatesh, Kenia Segura Abá, Huan Chen, Thilanka Ranaweera, Shin-Han Shiu, Peiran Wang, Max J Gordon, B K Amos, Sebastiano Busato, Daniel Perondi, Abhishek Gogna, Dennis Psaroudakis, C P James Chen, Hawlader A Al-Mamun, Monica F Danilevicz, Shriprabha R Upadhyaya, David Edwards, Natalia de Leon","doi":"10.1093/genetics/iyae195","DOIUrl":null,"url":null,"abstract":"<p><p>Predicting phenotypes from a combination of genetic and environmental factors is a grand challenge of modern biology. Slight improvements in this area have the potential to save lives, improve food and fuel security, permit better care of the planet, and create other positive outcomes. In 2022 and 2023 the first open-to-the-public Genomes to Fields (G2F) initiative Genotype by Environment (GxE) prediction competition was held using a large dataset including genomic variation, phenotype and weather measurements and field management notes, gathered by the project over nine years. The competition attracted registrants from around the world with representation from academic, government, industry, and non-profit institutions as well as unaffiliated. These participants came from diverse disciplines include plant science, animal science, breeding, statistics, computational biology and others. Some participants had no formal genetics or plant-related training, and some were just beginning their graduate education. The teams applied varied methods and strategies, providing a wealth of modeling knowledge based on a common dataset. The winner's strategy involved two models combining machine learning and traditional breeding tools: one model emphasized environment using features extracted by Random Forest, Ridge Regression and Least-squares, and one focused on genetics. Other high-performing teams' methods included quantitative genetics, machine learning/deep learning, mechanistic models, and model ensembles. The dataset factors used, such as genetics; weather; and management data, were also diverse, demonstrating that no single model or strategy is far superior to all others within the context of this competition.</p>","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/genetics/iyae195","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Predicting phenotypes from a combination of genetic and environmental factors is a grand challenge of modern biology. Slight improvements in this area have the potential to save lives, improve food and fuel security, permit better care of the planet, and create other positive outcomes. In 2022 and 2023 the first open-to-the-public Genomes to Fields (G2F) initiative Genotype by Environment (GxE) prediction competition was held using a large dataset including genomic variation, phenotype and weather measurements and field management notes, gathered by the project over nine years. The competition attracted registrants from around the world with representation from academic, government, industry, and non-profit institutions as well as unaffiliated. These participants came from diverse disciplines include plant science, animal science, breeding, statistics, computational biology and others. Some participants had no formal genetics or plant-related training, and some were just beginning their graduate education. The teams applied varied methods and strategies, providing a wealth of modeling knowledge based on a common dataset. The winner's strategy involved two models combining machine learning and traditional breeding tools: one model emphasized environment using features extracted by Random Forest, Ridge Regression and Least-squares, and one focused on genetics. Other high-performing teams' methods included quantitative genetics, machine learning/deep learning, mechanistic models, and model ensembles. The dataset factors used, such as genetics; weather; and management data, were also diverse, demonstrating that no single model or strategy is far superior to all others within the context of this competition.

全球基因型与环境预测竞赛表明,多样化的建模策略可以提供令人满意的玉米产量估计。
从遗传和环境因素的结合中预测表型是现代生物学面临的巨大挑战。在这一领域稍有改进,就有可能拯救生命、提高粮食和燃料安全、更好地爱护地球,并创造其他积极成果。2022 年和 2023 年,首次面向公众的 "从基因组到田野(G2F)"计划 "环境基因型(GxE)"预测竞赛举行,使用的是该项目历时九年收集的大量数据集,包括基因组变异、表型和天气测量数据以及田间管理记录。比赛吸引了来自世界各地的参赛者,包括学术界、政府、工业界、非营利机构以及非相关机构的代表。这些参赛者来自植物科学、动物科学、育种学、统计学、计算生物学等不同学科。有些参与者没有接受过正规的遗传学或植物相关培训,有些则刚刚开始研究生教育。各队采用了不同的方法和策略,在共同数据集的基础上提供了丰富的建模知识。获胜者的策略包括将机器学习和传统育种工具相结合的两个模型:一个模型利用随机森林、岭回归和最小二乘法提取的特征来强调环境,另一个模型则侧重于遗传学。其他优秀团队的方法包括定量遗传学、机器学习/深度学习、机理模型和模型组合。使用的数据集因素(如遗传学、天气和管理数据)也多种多样,这表明在本次竞赛中,没有任何一种模型或策略能远远优于所有其他模型或策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Genetics
Genetics GENETICS & HEREDITY-
CiteScore
6.90
自引率
6.10%
发文量
177
审稿时长
1.5 months
期刊介绍: GENETICS is published by the Genetics Society of America, a scholarly society that seeks to deepen our understanding of the living world by advancing our understanding of genetics. Since 1916, GENETICS has published high-quality, original research presenting novel findings bearing on genetics and genomics. The journal publishes empirical studies of organisms ranging from microbes to humans, as well as theoretical work. While it has an illustrious history, GENETICS has changed along with the communities it serves: it is not your mentor''s journal. The editors make decisions quickly – in around 30 days – without sacrificing the excellence and scholarship for which the journal has long been known. GENETICS is a peer reviewed, peer-edited journal, with an international reach and increasing visibility and impact. All editorial decisions are made through collaboration of at least two editors who are practicing scientists. GENETICS is constantly innovating: expanded types of content include Reviews, Commentary (current issues of interest to geneticists), Perspectives (historical), Primers (to introduce primary literature into the classroom), Toolbox Reviews, plus YeastBook, FlyBook, and WormBook (coming spring 2016). For particularly time-sensitive results, we publish Communications. As part of our mission to serve our communities, we''ve published thematic collections, including Genomic Selection, Multiparental Populations, Mouse Collaborative Cross, and the Genetics of Sex.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信