{"title":"Infer Species Phylogenies Using Self-Organizing Maps","authors":"Xiaoxu Han","doi":"10.4018/jkdb.2010040103","DOIUrl":null,"url":null,"abstract":"With rapid advances in genomics, phylogenetics has turned to phylogenomics due to the availability of large amounts of sequence and genome data. However, incongruence between species trees and gene trees remains a challenge in molecular phylogenetics for its biological and algorithmic complexities. A state-of-the-art gene concatenation approach was proposed to resolve this problem by inferring the species phylogeny using a random combination of widely distributed orthologous genes screened from genomes. However, such an approach may not be a robust solution to this problem because it ignores the fact that some genes are more informative than others in species inference. This paper presents a self-organizing map (SOM) based phylogeny inference method to overcome its weakness. The author’s proposed algorithm not only demonstrates its superiority to the original gene concatenation method by using same datasets, but also shows its advantages in generalization. This paper illustrates that data missing may not play a negative role in phylogeny inferring. This study presents a method to cluster multispecies genes, estimate multispecies gene entropy and visualize the species patterns through the self-organizing map mining.","PeriodicalId":160270,"journal":{"name":"Int. J. Knowl. Discov. Bioinform.","volume":"155 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Knowl. Discov. Bioinform.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/jkdb.2010040103","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
With rapid advances in genomics, phylogenetics has turned to phylogenomics due to the availability of large amounts of sequence and genome data. However, incongruence between species trees and gene trees remains a challenge in molecular phylogenetics for its biological and algorithmic complexities. A state-of-the-art gene concatenation approach was proposed to resolve this problem by inferring the species phylogeny using a random combination of widely distributed orthologous genes screened from genomes. However, such an approach may not be a robust solution to this problem because it ignores the fact that some genes are more informative than others in species inference. This paper presents a self-organizing map (SOM) based phylogeny inference method to overcome its weakness. The author’s proposed algorithm not only demonstrates its superiority to the original gene concatenation method by using same datasets, but also shows its advantages in generalization. This paper illustrates that data missing may not play a negative role in phylogeny inferring. This study presents a method to cluster multispecies genes, estimate multispecies gene entropy and visualize the species patterns through the self-organizing map mining.