A new distance measure of RNA ensembles and its application to phylogenetic tree construction

Sven Siebert, R. Backofen
{"title":"A new distance measure of RNA ensembles and its application to phylogenetic tree construction","authors":"Sven Siebert, R. Backofen","doi":"10.1109/CIBCB.2005.1594911","DOIUrl":null,"url":null,"abstract":"A major challenge in RNA structure analysis is to infer common catalytic or regulatory functions based on primary sequences and secondary structures. Some programs have been developed that compare RNAs with such given structures. Nevertheless, the most important problem is that it is hard to determine the adopted structures of RNAs which are a necessary prerequisite to numerous applications; once a structure has been assigned to a sequence (e.g. the minimum free energy structure), it influences the output of the programs and thus affects the scientific result, especially when dealing with a set of multiple RNAs. In this paper, we go one step further and analyze distances between RNA structure ensembles. They reflect structural relationships computed basically on base-pairing probability matrices. We propose a distance measure between two base-pairing probability matrices showing similar or non-similar structural folding behaviour. This includes the detection of shared optimal, suboptimal and local secondary structures. Consequently, our distance measure avoids falling into the trap of fixing specific structures. A pairwise comparison strategy in a set of multiple RNAs leads us to construct a network of structural relationships using the neighbour joining method. Attempts to predict phylogenetic trees are discussed and demonstrated by means of viral RNAs.","PeriodicalId":330810,"journal":{"name":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIBCB.2005.1594911","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

A major challenge in RNA structure analysis is to infer common catalytic or regulatory functions based on primary sequences and secondary structures. Some programs have been developed that compare RNAs with such given structures. Nevertheless, the most important problem is that it is hard to determine the adopted structures of RNAs which are a necessary prerequisite to numerous applications; once a structure has been assigned to a sequence (e.g. the minimum free energy structure), it influences the output of the programs and thus affects the scientific result, especially when dealing with a set of multiple RNAs. In this paper, we go one step further and analyze distances between RNA structure ensembles. They reflect structural relationships computed basically on base-pairing probability matrices. We propose a distance measure between two base-pairing probability matrices showing similar or non-similar structural folding behaviour. This includes the detection of shared optimal, suboptimal and local secondary structures. Consequently, our distance measure avoids falling into the trap of fixing specific structures. A pairwise comparison strategy in a set of multiple RNAs leads us to construct a network of structural relationships using the neighbour joining method. Attempts to predict phylogenetic trees are discussed and demonstrated by means of viral RNAs.
一种新的RNA集合距离测量方法及其在系统发育树构建中的应用
RNA结构分析的一个主要挑战是基于一级序列和二级结构推断共同的催化或调节功能。一些程序已经开发出来,将rna与这些给定的结构进行比较。然而,最重要的问题是很难确定rna所采用的结构,这是许多应用的必要先决条件;一旦一个结构被分配给一个序列(例如最小自由能结构),它就会影响程序的输出,从而影响科学结果,特别是在处理一组多个rna时。在本文中,我们进一步分析了RNA结构集成之间的距离。它们反映了基本在碱基配对概率矩阵上计算的结构关系。我们提出了两个碱基配对概率矩阵之间的距离测量显示相似或不相似的结构折叠行为。这包括共享最优、次优和局部二级结构的检测。因此,我们的距离测量避免落入固定特定结构的陷阱。在一组多个rna中的两两比较策略使我们使用邻居连接方法构建结构关系网络。试图预测系统发育树的讨论和证明,通过病毒rna的手段。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信