Graph neural network and diffusion model for modeling RNA interatomic interactions.

IF 5.4
Marek Justyna, Craig Zirbel, Maciej Antczak, Marta Szachniuk
{"title":"Graph neural network and diffusion model for modeling RNA interatomic interactions.","authors":"Marek Justyna, Craig Zirbel, Maciej Antczak, Marta Szachniuk","doi":"10.1093/bioinformatics/btaf515","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Ribonucleic acid (RNA) function is inherently linked to its 3D structure, traditionally determined by X-ray crystallography, Nuclear Magnetic Resonance, and Cryo-EM. However, these techniques often lack atomic-level resolution, highlighting the need for accurate in silico RNA structure prediction tools. Current state-of-the-art methods, such as AlphaFold3, Boltz1, RhoFold, or trRosettaRNA, rely on deep learning models that represent residues as frames and use transformers to learn relative positions. While effective for known RNA families, their performance drops for synthetic or novel families.</p><p><strong>Results: </strong>In this work, we explore the potential of graph neural networks and denoising diffusion probabilistic models for learning interatomic interactions. We model RNA as a graph in a coarse-grained, five-atom representation and evaluate our approach on a dataset of small RNA substructures, known as local RNA descriptors, which recur even in non-homologous structures. Generalization is assessed using a dataset partitioned by RNA family: the training set consists of rRNA and tRNA structures, while the test set includes descriptors from all other families. Our results demonstrate that the proposed method reliably predicts the structures of unseen descriptors and effectively adheres to user-defined constraints, such as Watson-Crick-Franklin interactions.</p><p><strong>Availability and implementation: </strong>The GraphaRNA source code is available on GitHub (github.com/mjustynaPhD/GraphaRNA); training/test datasets and pre-trained model weights are provided on Zenodo (zenodo.org/records/13750967).</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12472125/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btaf515","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Motivation: Ribonucleic acid (RNA) function is inherently linked to its 3D structure, traditionally determined by X-ray crystallography, Nuclear Magnetic Resonance, and Cryo-EM. However, these techniques often lack atomic-level resolution, highlighting the need for accurate in silico RNA structure prediction tools. Current state-of-the-art methods, such as AlphaFold3, Boltz1, RhoFold, or trRosettaRNA, rely on deep learning models that represent residues as frames and use transformers to learn relative positions. While effective for known RNA families, their performance drops for synthetic or novel families.

Results: In this work, we explore the potential of graph neural networks and denoising diffusion probabilistic models for learning interatomic interactions. We model RNA as a graph in a coarse-grained, five-atom representation and evaluate our approach on a dataset of small RNA substructures, known as local RNA descriptors, which recur even in non-homologous structures. Generalization is assessed using a dataset partitioned by RNA family: the training set consists of rRNA and tRNA structures, while the test set includes descriptors from all other families. Our results demonstrate that the proposed method reliably predicts the structures of unseen descriptors and effectively adheres to user-defined constraints, such as Watson-Crick-Franklin interactions.

Availability and implementation: The GraphaRNA source code is available on GitHub (github.com/mjustynaPhD/GraphaRNA); training/test datasets and pre-trained model weights are provided on Zenodo (zenodo.org/records/13750967).

基于图神经网络和扩散模型的RNA原子间相互作用建模。
动机:RNA的功能与它的三维结构有着内在的联系,传统上是通过x射线晶体学、核磁共振和低温电镜来确定的。然而,这些技术往往缺乏原子水平的分辨率,突出了对精确的硅RNA结构预测工具的需求。目前最先进的方法,如AlphaFold3、Boltz1、RhoFold或trRosettaRNA,依赖于将残基表示为框架的深度学习模型,并使用变压器来学习相对位置。虽然对已知的RNA家族有效,但对合成的或新的RNA家族的性能下降。结果:在这项工作中,我们探索了图神经网络和去噪扩散概率模型在学习原子间相互作用方面的潜力。我们将RNA建模为一个粗粒度的,五原子表示的图,并在小RNA亚结构的数据集上评估我们的方法,称为局部RNA描述符,即使在非同源结构中也会重复出现。使用按RNA家族划分的数据集来评估泛化:训练集由rRNA和tRNA结构组成,而测试集包括来自所有其他家族的描述符。我们的结果表明,所提出的方法可靠地预测了看不见的描述符的结构,并有效地遵守了用户定义的约束,如沃森-克里克-富兰克林相互作用。可用性:graphana源代码可在GitHub (github.com/mjustynaPhD/GraphaRNA)上获得;Zenodo提供了训练/测试数据集和预训练的模型权重(zenodo.org/records/13750967).Supplementary信息:补充数据可在Bioinformatics在线获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信