使用 RNAstructure 软件包预测保守的 RNA 结构。

Abhinav Mittal, Sara E. Ali, David H. Mathews
{"title":"使用 RNAstructure 软件包预测保守的 RNA 结构。","authors":"Abhinav Mittal,&nbsp;Sara E. Ali,&nbsp;David H. Mathews","doi":"10.1002/cpz1.70054","DOIUrl":null,"url":null,"abstract":"<p>The structures of many non-coding RNAs (ncRNA) are conserved by evolution to a greater extent than their sequences. By predicting the conserved structure of two or more homologous sequences, the accuracy of secondary structure prediction can be improved as compared to structure prediction for a single sequence. Here, we provide protocols for the use of four programs in the RNAstructure suite to predict conserved structures: Multilign, TurboFold, Dynalign, and PARTS. TurboFold iteratively aligns multiple homologous sequences and estimates the pairing probabilities for the conserved structure. Dynalign, PARTS, and Multilign are dynamic programming algorithms that simultaneously align sequences and identify the common secondary structure. Dynalign uses a pair of homologs and finds the lowest free energy common structure. PARTS uses a pair of homologs and estimates pairing probabilities from the base pairing probabilities estimated for each sequence. Multilign uses two or more homologs and finds the lowest free energy common structure using multiple pairwise calculations with Dynalign. It scales linearly with the number of sequences. We outline the strengths of each program. These programs can be run through web servers, on the command line, or with graphical user interfaces. © 2024 Wiley Periodicals LLC.</p><p><b>Basic Protocol 1</b>: Predicting a structure conserved in three or more sequences with the RNAstructure web server</p><p><b>Basic Protocol 2</b>: Predicting a structure conserved in two sequences with the RNAstructure web server</p><p><b>Alternative Protocol 1</b>: Predicting a structure conserved in multiple sequences in the RNAstructure graphical user interface</p><p><b>Alternative Protocol 2</b>: Predicting a structure conserved in two sequences with Dynalign in the RNAstructure graphical user interface</p><p><b>Alternative Protocol 3</b>: Running TurboFold on the command line</p>","PeriodicalId":93970,"journal":{"name":"Current protocols","volume":"4 11","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Using the RNAstructure Software Package to Predict Conserved RNA Structures\",\"authors\":\"Abhinav Mittal,&nbsp;Sara E. Ali,&nbsp;David H. Mathews\",\"doi\":\"10.1002/cpz1.70054\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>The structures of many non-coding RNAs (ncRNA) are conserved by evolution to a greater extent than their sequences. By predicting the conserved structure of two or more homologous sequences, the accuracy of secondary structure prediction can be improved as compared to structure prediction for a single sequence. Here, we provide protocols for the use of four programs in the RNAstructure suite to predict conserved structures: Multilign, TurboFold, Dynalign, and PARTS. TurboFold iteratively aligns multiple homologous sequences and estimates the pairing probabilities for the conserved structure. Dynalign, PARTS, and Multilign are dynamic programming algorithms that simultaneously align sequences and identify the common secondary structure. Dynalign uses a pair of homologs and finds the lowest free energy common structure. PARTS uses a pair of homologs and estimates pairing probabilities from the base pairing probabilities estimated for each sequence. Multilign uses two or more homologs and finds the lowest free energy common structure using multiple pairwise calculations with Dynalign. It scales linearly with the number of sequences. We outline the strengths of each program. These programs can be run through web servers, on the command line, or with graphical user interfaces. © 2024 Wiley Periodicals LLC.</p><p><b>Basic Protocol 1</b>: Predicting a structure conserved in three or more sequences with the RNAstructure web server</p><p><b>Basic Protocol 2</b>: Predicting a structure conserved in two sequences with the RNAstructure web server</p><p><b>Alternative Protocol 1</b>: Predicting a structure conserved in multiple sequences in the RNAstructure graphical user interface</p><p><b>Alternative Protocol 2</b>: Predicting a structure conserved in two sequences with Dynalign in the RNAstructure graphical user interface</p><p><b>Alternative Protocol 3</b>: Running TurboFold on the command line</p>\",\"PeriodicalId\":93970,\"journal\":{\"name\":\"Current protocols\",\"volume\":\"4 11\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-11-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Current protocols\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/cpz1.70054\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current protocols","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpz1.70054","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

许多非编码 RNA(ncRNA)的结构在进化过程中的保守程度要高于其序列。通过预测两个或更多同源序列的保守结构,二级结构预测的准确性可以比单一序列的结构预测有所提高。在此,我们提供了使用 RNAstructure 套件中的四个程序预测保守结构的协议:Multilign、TurboFold、Dynalign 和 PARTS。TurboFold 对多个同源序列进行迭代对齐,并估计保守结构的配对概率。Dynalign、PARTS 和 Multilign 是动态编程算法,可同时对齐序列并识别共同的二级结构。Dynalign 使用一对同源物,找出自由能最低的共同结构。PARTS 使用一对同源物,并根据每个序列的碱基配对概率估算配对概率。Multilign 使用两个或更多同源物,并通过与 Dynalign 的多次配对计算找到自由能最低的共同结构。它与序列的数量成线性比例。我们概述了每个程序的优势。这些程序可通过网络服务器、命令行或图形用户界面运行。© 2024 Wiley Periodicals LLC.基本协议 1:使用 RNAstructure 网络服务器预测三个或更多序列中的保守结构 基本协议 2:使用 RNAstructure 网络服务器预测两个序列中的保守结构 替代协议 1:在 RNAstructure 图形用户界面中预测多个序列中的保守结构 替代协议 2:在 RNAstructure 图形用户界面中使用 Dynalign 预测两个序列中的保守结构 替代协议 3:在命令行上运行 TurboFold。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Using the RNAstructure Software Package to Predict Conserved RNA Structures

The structures of many non-coding RNAs (ncRNA) are conserved by evolution to a greater extent than their sequences. By predicting the conserved structure of two or more homologous sequences, the accuracy of secondary structure prediction can be improved as compared to structure prediction for a single sequence. Here, we provide protocols for the use of four programs in the RNAstructure suite to predict conserved structures: Multilign, TurboFold, Dynalign, and PARTS. TurboFold iteratively aligns multiple homologous sequences and estimates the pairing probabilities for the conserved structure. Dynalign, PARTS, and Multilign are dynamic programming algorithms that simultaneously align sequences and identify the common secondary structure. Dynalign uses a pair of homologs and finds the lowest free energy common structure. PARTS uses a pair of homologs and estimates pairing probabilities from the base pairing probabilities estimated for each sequence. Multilign uses two or more homologs and finds the lowest free energy common structure using multiple pairwise calculations with Dynalign. It scales linearly with the number of sequences. We outline the strengths of each program. These programs can be run through web servers, on the command line, or with graphical user interfaces. © 2024 Wiley Periodicals LLC.

Basic Protocol 1: Predicting a structure conserved in three or more sequences with the RNAstructure web server

Basic Protocol 2: Predicting a structure conserved in two sequences with the RNAstructure web server

Alternative Protocol 1: Predicting a structure conserved in multiple sequences in the RNAstructure graphical user interface

Alternative Protocol 2: Predicting a structure conserved in two sequences with Dynalign in the RNAstructure graphical user interface

Alternative Protocol 3: Running TurboFold on the command line

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
4.00
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信