Graph neural network and diffusion model for modeling RNA interatomic interactions.

IF 5.4

Bioinformatics (Oxford, England) Pub Date : 2025-09-01 DOI:10.1093/bioinformatics/btaf515

Marek Justyna, Craig Zirbel, Maciej Antczak, Marta Szachniuk

{"title":"Graph neural network and diffusion model for modeling RNA interatomic interactions.","authors":"Marek Justyna, Craig Zirbel, Maciej Antczak, Marta Szachniuk","doi":"10.1093/bioinformatics/btaf515","DOIUrl":null,"url":null,"abstract":"Motivation: Ribonucleic acid (RNA) function is inherently linked to its 3D structure, traditionally determined by X-ray crystallography, Nuclear Magnetic Resonance, and Cryo-EM. However, these techniques often lack atomic-level resolution, highlighting the need for accurate in silico RNA structure prediction tools. Current state-of-the-art methods, such as AlphaFold3, Boltz1, RhoFold, or trRosettaRNA, rely on deep learning models that represent residues as frames and use transformers to learn relative positions. While effective for known RNA families, their performance drops for synthetic or novel families.Results: In this work, we explore the potential of graph neural networks and denoising diffusion probabilistic models for learning interatomic interactions. We model RNA as a graph in a coarse-grained, five-atom representation and evaluate our approach on a dataset of small RNA substructures, known as local RNA descriptors, which recur even in non-homologous structures. Generalization is assessed using a dataset partitioned by RNA family: the training set consists of rRNA and tRNA structures, while the test set includes descriptors from all other families. Our results demonstrate that the proposed method reliably predicts the structures of unseen descriptors and effectively adheres to user-defined constraints, such as Watson-Crick-Franklin interactions.Availability and implementation: The GraphaRNA source code is available on GitHub (github.com/mjustynaPhD/GraphaRNA); training/test datasets and pre-trained model weights are provided on Zenodo (zenodo.org/records/13750967).","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12472125/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btaf515","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Motivation: Ribonucleic acid (RNA) function is inherently linked to its 3D structure, traditionally determined by X-ray crystallography, Nuclear Magnetic Resonance, and Cryo-EM. However, these techniques often lack atomic-level resolution, highlighting the need for accurate in silico RNA structure prediction tools. Current state-of-the-art methods, such as AlphaFold3, Boltz1, RhoFold, or trRosettaRNA, rely on deep learning models that represent residues as frames and use transformers to learn relative positions. While effective for known RNA families, their performance drops for synthetic or novel families.

Results: In this work, we explore the potential of graph neural networks and denoising diffusion probabilistic models for learning interatomic interactions. We model RNA as a graph in a coarse-grained, five-atom representation and evaluate our approach on a dataset of small RNA substructures, known as local RNA descriptors, which recur even in non-homologous structures. Generalization is assessed using a dataset partitioned by RNA family: the training set consists of rRNA and tRNA structures, while the test set includes descriptors from all other families. Our results demonstrate that the proposed method reliably predicts the structures of unseen descriptors and effectively adheres to user-defined constraints, such as Watson-Crick-Franklin interactions.

Availability and implementation: The GraphaRNA source code is available on GitHub (github.com/mjustynaPhD/GraphaRNA); training/test datasets and pre-trained model weights are provided on Zenodo (zenodo.org/records/13750967).

查看原文本刊更多论文

基于图神经网络和扩散模型的RNA原子间相互作用建模。

动机：RNA的功能与它的三维结构有着内在的联系，传统上是通过x射线晶体学、核磁共振和低温电镜来确定的。然而，这些技术往往缺乏原子水平的分辨率，突出了对精确的硅RNA结构预测工具的需求。目前最先进的方法，如AlphaFold3、Boltz1、RhoFold或trRosettaRNA，依赖于将残基表示为框架的深度学习模型，并使用变压器来学习相对位置。虽然对已知的RNA家族有效，但对合成的或新的RNA家族的性能下降。结果：在这项工作中，我们探索了图神经网络和去噪扩散概率模型在学习原子间相互作用方面的潜力。我们将RNA建模为一个粗粒度的，五原子表示的图，并在小RNA亚结构的数据集上评估我们的方法，称为局部RNA描述符，即使在非同源结构中也会重复出现。使用按RNA家族划分的数据集来评估泛化：训练集由rRNA和tRNA结构组成，而测试集包括来自所有其他家族的描述符。我们的结果表明，所提出的方法可靠地预测了看不见的描述符的结构，并有效地遵守了用户定义的约束，如沃森-克里克-富兰克林相互作用。可用性：graphana源代码可在GitHub （github.com/mjustynaPhD/GraphaRNA）上获得；Zenodo提供了训练/测试数据集和预训练的模型权重（zenodo.org/records/13750967）.Supplementary信息：补充数据可在Bioinformatics在线获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Bioinformatics (Oxford, England)

自引率

0.00%

发文量