{"title":"一种新的RNA集合距离测量方法及其在系统发育树构建中的应用","authors":"Sven Siebert, R. Backofen","doi":"10.1109/CIBCB.2005.1594911","DOIUrl":null,"url":null,"abstract":"A major challenge in RNA structure analysis is to infer common catalytic or regulatory functions based on primary sequences and secondary structures. Some programs have been developed that compare RNAs with such given structures. Nevertheless, the most important problem is that it is hard to determine the adopted structures of RNAs which are a necessary prerequisite to numerous applications; once a structure has been assigned to a sequence (e.g. the minimum free energy structure), it influences the output of the programs and thus affects the scientific result, especially when dealing with a set of multiple RNAs. In this paper, we go one step further and analyze distances between RNA structure ensembles. They reflect structural relationships computed basically on base-pairing probability matrices. We propose a distance measure between two base-pairing probability matrices showing similar or non-similar structural folding behaviour. This includes the detection of shared optimal, suboptimal and local secondary structures. Consequently, our distance measure avoids falling into the trap of fixing specific structures. A pairwise comparison strategy in a set of multiple RNAs leads us to construct a network of structural relationships using the neighbour joining method. Attempts to predict phylogenetic trees are discussed and demonstrated by means of viral RNAs.","PeriodicalId":330810,"journal":{"name":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A new distance measure of RNA ensembles and its application to phylogenetic tree construction\",\"authors\":\"Sven Siebert, R. Backofen\",\"doi\":\"10.1109/CIBCB.2005.1594911\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A major challenge in RNA structure analysis is to infer common catalytic or regulatory functions based on primary sequences and secondary structures. Some programs have been developed that compare RNAs with such given structures. Nevertheless, the most important problem is that it is hard to determine the adopted structures of RNAs which are a necessary prerequisite to numerous applications; once a structure has been assigned to a sequence (e.g. the minimum free energy structure), it influences the output of the programs and thus affects the scientific result, especially when dealing with a set of multiple RNAs. In this paper, we go one step further and analyze distances between RNA structure ensembles. They reflect structural relationships computed basically on base-pairing probability matrices. We propose a distance measure between two base-pairing probability matrices showing similar or non-similar structural folding behaviour. This includes the detection of shared optimal, suboptimal and local secondary structures. Consequently, our distance measure avoids falling into the trap of fixing specific structures. A pairwise comparison strategy in a set of multiple RNAs leads us to construct a network of structural relationships using the neighbour joining method. Attempts to predict phylogenetic trees are discussed and demonstrated by means of viral RNAs.\",\"PeriodicalId\":330810,\"journal\":{\"name\":\"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology\",\"volume\":\"27 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIBCB.2005.1594911\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIBCB.2005.1594911","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A new distance measure of RNA ensembles and its application to phylogenetic tree construction
A major challenge in RNA structure analysis is to infer common catalytic or regulatory functions based on primary sequences and secondary structures. Some programs have been developed that compare RNAs with such given structures. Nevertheless, the most important problem is that it is hard to determine the adopted structures of RNAs which are a necessary prerequisite to numerous applications; once a structure has been assigned to a sequence (e.g. the minimum free energy structure), it influences the output of the programs and thus affects the scientific result, especially when dealing with a set of multiple RNAs. In this paper, we go one step further and analyze distances between RNA structure ensembles. They reflect structural relationships computed basically on base-pairing probability matrices. We propose a distance measure between two base-pairing probability matrices showing similar or non-similar structural folding behaviour. This includes the detection of shared optimal, suboptimal and local secondary structures. Consequently, our distance measure avoids falling into the trap of fixing specific structures. A pairwise comparison strategy in a set of multiple RNAs leads us to construct a network of structural relationships using the neighbour joining method. Attempts to predict phylogenetic trees are discussed and demonstrated by means of viral RNAs.