Abhinav Mittal, Sara E. Ali, David H. Mathews
{"title":"使用 RNAstructure 软件包预测保守的 RNA 结构。","authors":"Abhinav Mittal, Sara E. Ali, David H. Mathews","doi":"10.1002/cpz1.70054","DOIUrl":null,"url":null,"abstract":"<p>The structures of many non-coding RNAs (ncRNA) are conserved by evolution to a greater extent than their sequences. By predicting the conserved structure of two or more homologous sequences, the accuracy of secondary structure prediction can be improved as compared to structure prediction for a single sequence. Here, we provide protocols for the use of four programs in the RNAstructure suite to predict conserved structures: Multilign, TurboFold, Dynalign, and PARTS. TurboFold iteratively aligns multiple homologous sequences and estimates the pairing probabilities for the conserved structure. Dynalign, PARTS, and Multilign are dynamic programming algorithms that simultaneously align sequences and identify the common secondary structure. Dynalign uses a pair of homologs and finds the lowest free energy common structure. PARTS uses a pair of homologs and estimates pairing probabilities from the base pairing probabilities estimated for each sequence. Multilign uses two or more homologs and finds the lowest free energy common structure using multiple pairwise calculations with Dynalign. It scales linearly with the number of sequences. We outline the strengths of each program. These programs can be run through web servers, on the command line, or with graphical user interfaces. © 2024 Wiley Periodicals LLC.</p><p><b>Basic Protocol 1</b>: Predicting a structure conserved in three or more sequences with the RNAstructure web server</p><p><b>Basic Protocol 2</b>: Predicting a structure conserved in two sequences with the RNAstructure web server</p><p><b>Alternative Protocol 1</b>: Predicting a structure conserved in multiple sequences in the RNAstructure graphical user interface</p><p><b>Alternative Protocol 2</b>: Predicting a structure conserved in two sequences with Dynalign in the RNAstructure graphical user interface</p><p><b>Alternative Protocol 3</b>: Running TurboFold on the command line</p>","PeriodicalId":93970,"journal":{"name":"Current protocols","volume":"4 11","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Using the RNAstructure Software Package to Predict Conserved RNA Structures\",\"authors\":\"Abhinav Mittal, Sara E. Ali, David H. Mathews\",\"doi\":\"10.1002/cpz1.70054\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>The structures of many non-coding RNAs (ncRNA) are conserved by evolution to a greater extent than their sequences. By predicting the conserved structure of two or more homologous sequences, the accuracy of secondary structure prediction can be improved as compared to structure prediction for a single sequence. Here, we provide protocols for the use of four programs in the RNAstructure suite to predict conserved structures: Multilign, TurboFold, Dynalign, and PARTS. TurboFold iteratively aligns multiple homologous sequences and estimates the pairing probabilities for the conserved structure. Dynalign, PARTS, and Multilign are dynamic programming algorithms that simultaneously align sequences and identify the common secondary structure. Dynalign uses a pair of homologs and finds the lowest free energy common structure. PARTS uses a pair of homologs and estimates pairing probabilities from the base pairing probabilities estimated for each sequence. Multilign uses two or more homologs and finds the lowest free energy common structure using multiple pairwise calculations with Dynalign. It scales linearly with the number of sequences. We outline the strengths of each program. These programs can be run through web servers, on the command line, or with graphical user interfaces. © 2024 Wiley Periodicals LLC.</p><p><b>Basic Protocol 1</b>: Predicting a structure conserved in three or more sequences with the RNAstructure web server</p><p><b>Basic Protocol 2</b>: Predicting a structure conserved in two sequences with the RNAstructure web server</p><p><b>Alternative Protocol 1</b>: Predicting a structure conserved in multiple sequences in the RNAstructure graphical user interface</p><p><b>Alternative Protocol 2</b>: Predicting a structure conserved in two sequences with Dynalign in the RNAstructure graphical user interface</p><p><b>Alternative Protocol 3</b>: Running TurboFold on the command line</p>\",\"PeriodicalId\":93970,\"journal\":{\"name\":\"Current protocols\",\"volume\":\"4 11\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-11-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Current protocols\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/cpz1.70054\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current protocols","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpz1.70054","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Using the RNAstructure Software Package to Predict Conserved RNA Structures
The structures of many non-coding RNAs (ncRNA) are conserved by evolution to a greater extent than their sequences. By predicting the conserved structure of two or more homologous sequences, the accuracy of secondary structure prediction can be improved as compared to structure prediction for a single sequence. Here, we provide protocols for the use of four programs in the RNAstructure suite to predict conserved structures: Multilign, TurboFold, Dynalign, and PARTS. TurboFold iteratively aligns multiple homologous sequences and estimates the pairing probabilities for the conserved structure. Dynalign, PARTS, and Multilign are dynamic programming algorithms that simultaneously align sequences and identify the common secondary structure. Dynalign uses a pair of homologs and finds the lowest free energy common structure. PARTS uses a pair of homologs and estimates pairing probabilities from the base pairing probabilities estimated for each sequence. Multilign uses two or more homologs and finds the lowest free energy common structure using multiple pairwise calculations with Dynalign. It scales linearly with the number of sequences. We outline the strengths of each program. These programs can be run through web servers, on the command line, or with graphical user interfaces. © 2024 Wiley Periodicals LLC.
Basic Protocol 1: Predicting a structure conserved in three or more sequences with the RNAstructure web server
Basic Protocol 2: Predicting a structure conserved in two sequences with the RNAstructure web server
Alternative Protocol 1: Predicting a structure conserved in multiple sequences in the RNAstructure graphical user interface
Alternative Protocol 2: Predicting a structure conserved in two sequences with Dynalign in the RNAstructure graphical user interface
Alternative Protocol 3: Running TurboFold on the command line