{"title":"Phylogenetic Consensus for Exact Median Trees","authors":"Pawel Tabaszewski, P. Górecki, O. Eulenstein","doi":"10.1145/3233547.3233560","DOIUrl":null,"url":null,"abstract":"Solving median tree problems is a classic approach for inferring species trees from a collection of discordant gene trees. Such problems are typically NP-hard and dealt with by local search heuristics. Unfortunately, such heuristics generally lack any provable correctness and precision. Algorithmic advances addressing this uncertainty, have led to exact dynamic programming formulations suitable to solve a well-studied group of median tree problems for smaller phylogenetic analyzes. However, these formulations allow to compute only very few optimal species trees out of possibly many such trees, and phylogenetic studies often require the analysis of all optimal solutions through their consensus tree. Here, we describe a significant algorithmic modification of the dynamic programming formulations that compute the cluster counts of all optimal species trees from which various types of consensus trees can be efficiently computed. Through experimental studies, we demonstrate that our parallel implementation of the modified programming formulation is more efficient than a previous implementation of the original formulation, and can greatly benefit phylogenetic analyses.","PeriodicalId":131906,"journal":{"name":"Proceedings of the 2018 ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2018 ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3233547.3233560","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Solving median tree problems is a classic approach for inferring species trees from a collection of discordant gene trees. Such problems are typically NP-hard and dealt with by local search heuristics. Unfortunately, such heuristics generally lack any provable correctness and precision. Algorithmic advances addressing this uncertainty, have led to exact dynamic programming formulations suitable to solve a well-studied group of median tree problems for smaller phylogenetic analyzes. However, these formulations allow to compute only very few optimal species trees out of possibly many such trees, and phylogenetic studies often require the analysis of all optimal solutions through their consensus tree. Here, we describe a significant algorithmic modification of the dynamic programming formulations that compute the cluster counts of all optimal species trees from which various types of consensus trees can be efficiently computed. Through experimental studies, we demonstrate that our parallel implementation of the modified programming formulation is more efficient than a previous implementation of the original formulation, and can greatly benefit phylogenetic analyses.