Chaokun Yan , Jiabao Li , Qi Feng , Junwei Luo , Huimin Luo
{"title":"ResDeepGS: A deep learning-based method for crop phenotype prediction","authors":"Chaokun Yan , Jiabao Li , Qi Feng , Junwei Luo , Huimin Luo","doi":"10.1016/j.ymeth.2025.07.013","DOIUrl":null,"url":null,"abstract":"<div><div>Genomic selection (GS) is a breeding technique that utilizes genomic markers to predict the genetic potential of crops and animals. This approach holds significant promise for accelerating the improvement of agronomic traits and addressing food security challenges. While traditional breeding methods based on statistical or machine learning techniques have been useful in predicting traits for some crops, they often fail to capture the complex interactions between genotypes and phenotypes. Additionally, these methods struggle to handle large-scale data, limiting their predictive performance. Recent advancements in deep learning offer a promising solution by better capturing nonlinear relationships and gene interactions.</div><div>In this study, we propose a novel crop phenotype prediction method, ResDeepGS, which leverages deep learning techniques. The model consists of two main components: the feature selection module and the phenotype prediction module. The feature selection module employs an incremental recursive feature elimination method, combining the strengths of recursive feature elimination and incremental learning to improve both the efficiency and reliability of feature selection. The phenotype prediction module integrates an enhanced multi-layer convolutional neural network with residual structures and dropout strategies to better capture complex relationships in gene data, accelerate convergence, and reduce overfitting. Through extensive experimentation, we demonstrate that ResDeepGS outperforms current state-of-the-art methods on three datasets: wheat, maize, and soybean. Notably, on the wheat dataset, ResDeepGS improved prediction accuracy by 5% to 9%, highlighting its superior performance in genomic selection tasks. These results underscore the robustness and adaptability of ResDeepGS, offering a promising solution for enhancing crop breeding efficiency and addressing future food security challenges.</div></div>","PeriodicalId":390,"journal":{"name":"Methods","volume":"244 ","pages":"Pages 65-74"},"PeriodicalIF":4.3000,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Methods","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1046202325001872","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Genomic selection (GS) is a breeding technique that utilizes genomic markers to predict the genetic potential of crops and animals. This approach holds significant promise for accelerating the improvement of agronomic traits and addressing food security challenges. While traditional breeding methods based on statistical or machine learning techniques have been useful in predicting traits for some crops, they often fail to capture the complex interactions between genotypes and phenotypes. Additionally, these methods struggle to handle large-scale data, limiting their predictive performance. Recent advancements in deep learning offer a promising solution by better capturing nonlinear relationships and gene interactions.
In this study, we propose a novel crop phenotype prediction method, ResDeepGS, which leverages deep learning techniques. The model consists of two main components: the feature selection module and the phenotype prediction module. The feature selection module employs an incremental recursive feature elimination method, combining the strengths of recursive feature elimination and incremental learning to improve both the efficiency and reliability of feature selection. The phenotype prediction module integrates an enhanced multi-layer convolutional neural network with residual structures and dropout strategies to better capture complex relationships in gene data, accelerate convergence, and reduce overfitting. Through extensive experimentation, we demonstrate that ResDeepGS outperforms current state-of-the-art methods on three datasets: wheat, maize, and soybean. Notably, on the wheat dataset, ResDeepGS improved prediction accuracy by 5% to 9%, highlighting its superior performance in genomic selection tasks. These results underscore the robustness and adaptability of ResDeepGS, offering a promising solution for enhancing crop breeding efficiency and addressing future food security challenges.
期刊介绍:
Methods focuses on rapidly developing techniques in the experimental biological and medical sciences.
Each topical issue, organized by a guest editor who is an expert in the area covered, consists solely of invited quality articles by specialist authors, many of them reviews. Issues are devoted to specific technical approaches with emphasis on clear detailed descriptions of protocols that allow them to be reproduced easily. The background information provided enables researchers to understand the principles underlying the methods; other helpful sections include comparisons of alternative methods giving the advantages and disadvantages of particular methods, guidance on avoiding potential pitfalls, and suggestions for troubleshooting.