{"title":"A Machine Learning Approach to Resolving Incongruence in Molecular Phylogenies and Visualization Analysis","authors":"Xiaoxu Han","doi":"10.1109/CIBCB.2005.1594939","DOIUrl":null,"url":null,"abstract":"The incongruence between gene trees and species trees is one of the most pervasive challenges in molecular phylogenetics. In this work, a machine learning approach is proposed to overcome this problem. In the machine learning approach, the gene data set is clustered by a self-organizing map (SOM). Then a phylogenetically informative core gene set is created by combining the maximum entropy gene from each cluster to conduct phylogenetic analysis. Using the same data set, this approach performs better than the previous random gene concatenation method. The SOM based information visualization is also employed to compare the species patterns in the phylogenetic tree constructions.","PeriodicalId":330810,"journal":{"name":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIBCB.2005.1594939","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The incongruence between gene trees and species trees is one of the most pervasive challenges in molecular phylogenetics. In this work, a machine learning approach is proposed to overcome this problem. In the machine learning approach, the gene data set is clustered by a self-organizing map (SOM). Then a phylogenetically informative core gene set is created by combining the maximum entropy gene from each cluster to conduct phylogenetic analysis. Using the same data set, this approach performs better than the previous random gene concatenation method. The SOM based information visualization is also employed to compare the species patterns in the phylogenetic tree constructions.