In silico prediction of variant effects: promises and limitations for precision plant breeding.

IF 4.2 1区 农林科学 Q1 AGRONOMY
Janek Sendrowski, Thomas Bataillon, Guillaume P Ramstein
{"title":"In silico prediction of variant effects: promises and limitations for precision plant breeding.","authors":"Janek Sendrowski, Thomas Bataillon, Guillaume P Ramstein","doi":"10.1007/s00122-025-04973-1","DOIUrl":null,"url":null,"abstract":"<p><strong>Key message: </strong>Sequence-based AI models show great potential for prediction of variant effects at high resolution, but their practical value in plant breeding remains to be confirmed through rigorous validation studies. Plant breeding has traditionally relied on phenotyping to select individuals with desirable traits-a process that is both costly and time-consuming. Increasingly, breeding strategies are shifting toward precision breeding, where causal variants are directly targeted based on their effects. To predict the effects of causal variants, in silico methods are emerging as efficient alternatives or complements to mutagenesis screens. Here, we review state-of-the-art machine learning methods for predicting variant effects in plants across both coding and noncoding regions, contrasting supervised approaches in functional genomics with unsupervised methods in comparative genomics. We discuss challenges in validating predictions, and compare these methods with traditional association and comparative genomics techniques. We argue that modern sequence models extend traditional methods by generalizing across genomic contexts, fitting a unified model across loci rather than a separate model for each locus. In doing so, they address inherent limitations of traditional quantitative and evolutionary comparative genetics techniques. However, the accuracy and generalizability of sequence models heavily depend on the training data, highlighting the need for validation experiments. We point to successful applications of sequence models, especially with protein sequences, and identify areas for further improvement, especially in modeling regulatory sequences. While not yet mature for in silico-driven precision breeding, sequence models show strong potential to become an integral part of the breeder's toolbox.</p>","PeriodicalId":22955,"journal":{"name":"Theoretical and Applied Genetics","volume":"138 8","pages":"193"},"PeriodicalIF":4.2000,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12304032/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Theoretical and Applied Genetics","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1007/s00122-025-04973-1","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0

Abstract

Key message: Sequence-based AI models show great potential for prediction of variant effects at high resolution, but their practical value in plant breeding remains to be confirmed through rigorous validation studies. Plant breeding has traditionally relied on phenotyping to select individuals with desirable traits-a process that is both costly and time-consuming. Increasingly, breeding strategies are shifting toward precision breeding, where causal variants are directly targeted based on their effects. To predict the effects of causal variants, in silico methods are emerging as efficient alternatives or complements to mutagenesis screens. Here, we review state-of-the-art machine learning methods for predicting variant effects in plants across both coding and noncoding regions, contrasting supervised approaches in functional genomics with unsupervised methods in comparative genomics. We discuss challenges in validating predictions, and compare these methods with traditional association and comparative genomics techniques. We argue that modern sequence models extend traditional methods by generalizing across genomic contexts, fitting a unified model across loci rather than a separate model for each locus. In doing so, they address inherent limitations of traditional quantitative and evolutionary comparative genetics techniques. However, the accuracy and generalizability of sequence models heavily depend on the training data, highlighting the need for validation experiments. We point to successful applications of sequence models, especially with protein sequences, and identify areas for further improvement, especially in modeling regulatory sequences. While not yet mature for in silico-driven precision breeding, sequence models show strong potential to become an integral part of the breeder's toolbox.

变异效应的计算机预测:精确植物育种的希望与局限。
关键信息:基于序列的人工智能模型在高分辨率预测变异效应方面显示出巨大的潜力,但其在植物育种中的实用价值仍需通过严格的验证研究来证实。传统上,植物育种依靠表型来选择具有理想性状的个体,这一过程既昂贵又耗时。越来越多的育种策略转向精确育种,根据因果变异的影响直接针对它们。为了预测因果变异的影响,计算机方法正在成为突变筛选的有效替代或补充。在这里,我们回顾了用于预测植物编码区和非编码区变异效应的最先进的机器学习方法,比较了功能基因组学中的监督方法和比较基因组学中的无监督方法。我们讨论验证预测的挑战,并将这些方法与传统的关联和比较基因组学技术进行比较。我们认为,现代序列模型扩展了传统的方法,通过推广整个基因组背景,拟合一个统一的基因座模型,而不是每个基因座单独的模型。在这样做的过程中,他们解决了传统的定量和进化比较遗传学技术的固有局限性。然而,序列模型的准确性和泛化性在很大程度上依赖于训练数据,这突出了验证实验的必要性。我们指出了序列模型的成功应用,特别是蛋白质序列,并确定了进一步改进的领域,特别是在调节序列建模方面。虽然在硅驱动的精确育种尚未成熟,序列模型显示出强大的潜力,成为育种工具箱的一个组成部分。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
9.60
自引率
7.40%
发文量
241
审稿时长
2.3 months
期刊介绍: Theoretical and Applied Genetics publishes original research and review articles in all key areas of modern plant genetics, plant genomics and plant biotechnology. All work needs to have a clear genetic component and significant impact on plant breeding. Theoretical considerations are only accepted in combination with new experimental data and/or if they indicate a relevant application in plant genetics or breeding. Emphasizing the practical, the journal focuses on research into leading crop plants and articles presenting innovative approaches.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信