Zhou Yao, Mengting Yao, Chuang Wang, Ke Li, Junhao Guo, Yingjie Xiao, Jianbing Yan, Jianxiao Liu
{"title":"GEFormer: a Genomic Prediction Method of Genotype-Environment Interaction in Maize by Integrating Gating Mechanism MLP and Linear Attention Mechanism.","authors":"Zhou Yao, Mengting Yao, Chuang Wang, Ke Li, Junhao Guo, Yingjie Xiao, Jianbing Yan, Jianxiao Liu","doi":"10.1016/j.molp.2025.01.020","DOIUrl":null,"url":null,"abstract":"<p><p>The integration of genotypic and environmental data can enhance the prediction accuracy of field traits of crops. The existing genomic prediction methods fail to consider the environmental factors and do not consider the real growing environment of crops, resulting in low genomic prediction accuracy. In this work, we propose a genotype-environment interaction genomic prediction method in maize, called GEFormer, based on integrating the gating mechanism MLP and linear attention mechanism. Firstly, it uses gated multilayer perceptron (gMLP) to extract the local and global features among SNPs. Then, the Omni-dimensional Dynamic Convolution is used to extract the dynamic and comprehensive features of multiple environmental factors within each day in the consideration of the real growth pattern of crops. The linear attention mechanism is used to capture the temporal features of environmental changes. Finally, it uses the gating mechanism to fuse the genomic and environmental features effectively. We validate the accuracy of GEFormer in predicting important agronomic traits of maize, rice and wheat in three experimental scenarios: untested genotypes in tested environments, tested genotypes in untested environments, untested genotypes in untested environments. Experimental results show that GEFormer outperforms six cutting-edge statistical learning methods and four machine learning methods. Furthermore, it shows great advantages in the experimental scenario of untested genotypes in untested environments. In addition, we used GEFormer into three real-world breeding applications: phenotype prediction in unknown environments, hybrid phenotype prediction using inbred population, and cross-population phenotype prediction. The results illustrate that GEFormer exhibiting better prediction performance in actual breeding scenarios, and it can be utilized to assist crop breeding.</p>","PeriodicalId":19012,"journal":{"name":"Molecular Plant","volume":" ","pages":""},"PeriodicalIF":17.1000,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Plant","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.molp.2025.01.020","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The integration of genotypic and environmental data can enhance the prediction accuracy of field traits of crops. The existing genomic prediction methods fail to consider the environmental factors and do not consider the real growing environment of crops, resulting in low genomic prediction accuracy. In this work, we propose a genotype-environment interaction genomic prediction method in maize, called GEFormer, based on integrating the gating mechanism MLP and linear attention mechanism. Firstly, it uses gated multilayer perceptron (gMLP) to extract the local and global features among SNPs. Then, the Omni-dimensional Dynamic Convolution is used to extract the dynamic and comprehensive features of multiple environmental factors within each day in the consideration of the real growth pattern of crops. The linear attention mechanism is used to capture the temporal features of environmental changes. Finally, it uses the gating mechanism to fuse the genomic and environmental features effectively. We validate the accuracy of GEFormer in predicting important agronomic traits of maize, rice and wheat in three experimental scenarios: untested genotypes in tested environments, tested genotypes in untested environments, untested genotypes in untested environments. Experimental results show that GEFormer outperforms six cutting-edge statistical learning methods and four machine learning methods. Furthermore, it shows great advantages in the experimental scenario of untested genotypes in untested environments. In addition, we used GEFormer into three real-world breeding applications: phenotype prediction in unknown environments, hybrid phenotype prediction using inbred population, and cross-population phenotype prediction. The results illustrate that GEFormer exhibiting better prediction performance in actual breeding scenarios, and it can be utilized to assist crop breeding.
期刊介绍:
Molecular Plant is dedicated to serving the plant science community by publishing novel and exciting findings with high significance in plant biology. The journal focuses broadly on cellular biology, physiology, biochemistry, molecular biology, genetics, development, plant-microbe interaction, genomics, bioinformatics, and molecular evolution.
Molecular Plant publishes original research articles, reviews, Correspondence, and Spotlights on the most important developments in plant biology.