统一提升对准的新用途

Mathematical Support for Molecular Biology Pub Date : 1900-01-01 DOI:10.1090/dimacs/047/02

D. Gusfield, Lusheng Wang

{"title":"统一提升对准的新用途","authors":"D. Gusfield, Lusheng Wang","doi":"10.1090/dimacs/047/02","DOIUrl":null,"url":null,"abstract":"The phylogenetic alignment problem a k a the tree alignment problem arises in e orts to deduce histories of molecular evolution and in certain methods to multiply align more than two sequences The problem is known to be NP hard but several bounded error approximation methods and polynomial time approximation schemes have been developed for the problem The rst of these approximationmethods is based on what are called lifted alignments and the second method is based on simpler uniform lifted alignments The simplicity of uniform lifted alignments compared to lifted alignments allows a deeper study of their properties and yet also gives a way to derive or compute results about lifted and optimal phylogenetic alignments In this paper we rst prove the factor of two error bound on the optimal uniform lifted alignment di erently than was previously done in Next we use uniform lifted alignments to establish error bounds on random lifted alignments Finally we use results about uniform lifted alignments to create an e cient algorithm to compute a non trivial lower bound on the cost of the optimal solution to the phylogenetic alignment problem given any problem instance We use that lower bound to gauge the accuracy of a phylogenetic alignment computed by Sanko et al AMS Subject Classi cation Primary Q Secondary C R C C B D Phylogenetic tree Alignment Evolutionary history is frequently represented by an evolutionary tree where known extant organ isms are represented at the leaves of the tree and their unknown but perhaps deduced ancestors are represented at internal nodes of the tree It is common now to deduce such evolutionary trees from molecular sequence data obtained from the organisms under study However the opposite direction of study is also possible When the evolutionary tree is already known from previous data and deductions it can be used to deduce possible ancestral molecular sequences that gave rise to the extant sequences through a series of mutational events This general problem has been called the phylogenetic alignment problem or the tree alignment problem and has been formalized as the problem of deducing sequences at the internal nodes to minimize the cost given by an objective function de ned below Partially supported by Dept of Energy grant DE FG ER In the above description the tree with its deduced internal node labels is the desired output of the problem However once the labeled tree is in hand one can also use it to nd a multiple alignment of the extant sequences which is in uenced by the hypothesized evolutionary history see or The details are a bit involved and we only mention this application as additional motivation for the phylogenetic alignment problem We will not discuss it further in this paper","PeriodicalId":175691,"journal":{"name":"Mathematical Support for Molecular Biology","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"New uses for uniform lifted alignments\",\"authors\":\"D. Gusfield, Lusheng Wang\",\"doi\":\"10.1090/dimacs/047/02\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The phylogenetic alignment problem a k a the tree alignment problem arises in e orts to deduce histories of molecular evolution and in certain methods to multiply align more than two sequences The problem is known to be NP hard but several bounded error approximation methods and polynomial time approximation schemes have been developed for the problem The rst of these approximationmethods is based on what are called lifted alignments and the second method is based on simpler uniform lifted alignments The simplicity of uniform lifted alignments compared to lifted alignments allows a deeper study of their properties and yet also gives a way to derive or compute results about lifted and optimal phylogenetic alignments In this paper we rst prove the factor of two error bound on the optimal uniform lifted alignment di erently than was previously done in Next we use uniform lifted alignments to establish error bounds on random lifted alignments Finally we use results about uniform lifted alignments to create an e cient algorithm to compute a non trivial lower bound on the cost of the optimal solution to the phylogenetic alignment problem given any problem instance We use that lower bound to gauge the accuracy of a phylogenetic alignment computed by Sanko et al AMS Subject Classi cation Primary Q Secondary C R C C B D Phylogenetic tree Alignment Evolutionary history is frequently represented by an evolutionary tree where known extant organ isms are represented at the leaves of the tree and their unknown but perhaps deduced ancestors are represented at internal nodes of the tree It is common now to deduce such evolutionary trees from molecular sequence data obtained from the organisms under study However the opposite direction of study is also possible When the evolutionary tree is already known from previous data and deductions it can be used to deduce possible ancestral molecular sequences that gave rise to the extant sequences through a series of mutational events This general problem has been called the phylogenetic alignment problem or the tree alignment problem and has been formalized as the problem of deducing sequences at the internal nodes to minimize the cost given by an objective function de ned below Partially supported by Dept of Energy grant DE FG ER In the above description the tree with its deduced internal node labels is the desired output of the problem However once the labeled tree is in hand one can also use it to nd a multiple alignment of the extant sequences which is in uenced by the hypothesized evolutionary history see or The details are a bit involved and we only mention this application as additional motivation for the phylogenetic alignment problem We will not discuss it further in this paper\",\"PeriodicalId\":175691,\"journal\":{\"name\":\"Mathematical Support for Molecular Biology\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mathematical Support for Molecular Biology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1090/dimacs/047/02\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical Support for Molecular Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1090/dimacs/047/02","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

摘要

系统校准的问题一个k树对齐问题在e运动推断分子进化的历史和在某些方法把两个多序列对齐问题是NP困难但有界的一些误差近似方法和多项式时间近似方案开发问题的rst approximationmethods是基于所谓的解除比对,第二种方法是基于简单的制服了均匀提升比对提升比对的简单性使得我们可以更深入地研究它们的性质，同时也提供了一种推导或计算提升比对和最优系统发育比对结果的方法。在本文中，我们首先证明了最优均匀提升比对的两个误差界的因子，这与之前在文章中所做的不同解除联盟创建一个e字母系数算法来计算一个非平凡的上下界的成本最优解系统一致性问题给出任何问题我们使用实例,下界来衡量系统对准的精度计算Sanko et al AMS主题抚慰阳离子主要问二级C B R C C D系统树对齐进化历史通常是由一个已知现存的器官的进化树主义派代表出席树的叶子和它们未知的但可能推断出的祖先在树的内部节点上表示。现在从所研究的生物体的分子序列数据中推断出这样的进化树是很常见的。然而，相反的研究方向也是可能的。当进化树已经从以前的数据和推断中知道时，它可以用来推断出可能的祖先分子序列，通过一系列的遗传序列产生了现存的序列突变事件这一普遍问题被称为系统一致性问题或树对齐问题,推导出序列的形式化问题内部节点的成本最小化目标函数de ned下面部分支持的能源部门授予de FG ER在上面描述的树推导出问题的内部节点标签所需的输出但是一旦标签树的手还可以使用它来nd a现存序列的多重比对受到假设的进化史的影响，具体细节有点复杂，我们只提到这种应用作为系统发育比对问题的额外动机，本文将不再进一步讨论它

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

New uses for uniform lifted alignments

The phylogenetic alignment problem a k a the tree alignment problem arises in e orts to deduce histories of molecular evolution and in certain methods to multiply align more than two sequences The problem is known to be NP hard but several bounded error approximation methods and polynomial time approximation schemes have been developed for the problem The rst of these approximationmethods is based on what are called lifted alignments and the second method is based on simpler uniform lifted alignments The simplicity of uniform lifted alignments compared to lifted alignments allows a deeper study of their properties and yet also gives a way to derive or compute results about lifted and optimal phylogenetic alignments In this paper we rst prove the factor of two error bound on the optimal uniform lifted alignment di erently than was previously done in Next we use uniform lifted alignments to establish error bounds on random lifted alignments Finally we use results about uniform lifted alignments to create an e cient algorithm to compute a non trivial lower bound on the cost of the optimal solution to the phylogenetic alignment problem given any problem instance We use that lower bound to gauge the accuracy of a phylogenetic alignment computed by Sanko et al AMS Subject Classi cation Primary Q Secondary C R C C B D Phylogenetic tree Alignment Evolutionary history is frequently represented by an evolutionary tree where known extant organ isms are represented at the leaves of the tree and their unknown but perhaps deduced ancestors are represented at internal nodes of the tree It is common now to deduce such evolutionary trees from molecular sequence data obtained from the organisms under study However the opposite direction of study is also possible When the evolutionary tree is already known from previous data and deductions it can be used to deduce possible ancestral molecular sequences that gave rise to the extant sequences through a series of mutational events This general problem has been called the phylogenetic alignment problem or the tree alignment problem and has been formalized as the problem of deducing sequences at the internal nodes to minimize the cost given by an objective function de ned below Partially supported by Dept of Energy grant DE FG ER In the above description the tree with its deduced internal node labels is the desired output of the problem However once the labeled tree is in hand one can also use it to nd a multiple alignment of the extant sequences which is in uenced by the hypothesized evolutionary history see or The details are a bit involved and we only mention this application as additional motivation for the phylogenetic alignment problem We will not discuss it further in this paper

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Mathematical Support for Molecular Biology

自引率

0.00%

发文量