A new distance measure of RNA ensembles and its application to phylogenetic tree construction

2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology Pub Date : 1900-01-01 DOI:10.1109/CIBCB.2005.1594911

Sven Siebert, R. Backofen

{"title":"A new distance measure of RNA ensembles and its application to phylogenetic tree construction","authors":"Sven Siebert, R. Backofen","doi":"10.1109/CIBCB.2005.1594911","DOIUrl":null,"url":null,"abstract":"A major challenge in RNA structure analysis is to infer common catalytic or regulatory functions based on primary sequences and secondary structures. Some programs have been developed that compare RNAs with such given structures. Nevertheless, the most important problem is that it is hard to determine the adopted structures of RNAs which are a necessary prerequisite to numerous applications; once a structure has been assigned to a sequence (e.g. the minimum free energy structure), it influences the output of the programs and thus affects the scientific result, especially when dealing with a set of multiple RNAs. In this paper, we go one step further and analyze distances between RNA structure ensembles. They reflect structural relationships computed basically on base-pairing probability matrices. We propose a distance measure between two base-pairing probability matrices showing similar or non-similar structural folding behaviour. This includes the detection of shared optimal, suboptimal and local secondary structures. Consequently, our distance measure avoids falling into the trap of fixing specific structures. A pairwise comparison strategy in a set of multiple RNAs leads us to construct a network of structural relationships using the neighbour joining method. Attempts to predict phylogenetic trees are discussed and demonstrated by means of viral RNAs.","PeriodicalId":330810,"journal":{"name":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIBCB.2005.1594911","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

A major challenge in RNA structure analysis is to infer common catalytic or regulatory functions based on primary sequences and secondary structures. Some programs have been developed that compare RNAs with such given structures. Nevertheless, the most important problem is that it is hard to determine the adopted structures of RNAs which are a necessary prerequisite to numerous applications; once a structure has been assigned to a sequence (e.g. the minimum free energy structure), it influences the output of the programs and thus affects the scientific result, especially when dealing with a set of multiple RNAs. In this paper, we go one step further and analyze distances between RNA structure ensembles. They reflect structural relationships computed basically on base-pairing probability matrices. We propose a distance measure between two base-pairing probability matrices showing similar or non-similar structural folding behaviour. This includes the detection of shared optimal, suboptimal and local secondary structures. Consequently, our distance measure avoids falling into the trap of fixing specific structures. A pairwise comparison strategy in a set of multiple RNAs leads us to construct a network of structural relationships using the neighbour joining method. Attempts to predict phylogenetic trees are discussed and demonstrated by means of viral RNAs.

查看原文本刊更多论文

一种新的RNA集合距离测量方法及其在系统发育树构建中的应用

RNA结构分析的一个主要挑战是基于一级序列和二级结构推断共同的催化或调节功能。一些程序已经开发出来，将rna与这些给定的结构进行比较。然而，最重要的问题是很难确定rna所采用的结构，这是许多应用的必要先决条件;一旦一个结构被分配给一个序列(例如最小自由能结构)，它就会影响程序的输出，从而影响科学结果，特别是在处理一组多个rna时。在本文中，我们进一步分析了RNA结构集成之间的距离。它们反映了基本在碱基配对概率矩阵上计算的结构关系。我们提出了两个碱基配对概率矩阵之间的距离测量显示相似或不相似的结构折叠行为。这包括共享最优、次优和局部二级结构的检测。因此，我们的距离测量避免落入固定特定结构的陷阱。在一组多个rna中的两两比较策略使我们使用邻居连接方法构建结构关系网络。试图预测系统发育树的讨论和证明，通过病毒rna的手段。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology

自引率

0.00%

发文量