{"title":"fastTIGER: A rapid method for estimating evolutionary rates of sites from large datasets","authors":"Thu Kim Le, L. Vinh","doi":"10.1109/KSE53942.2021.9648748","DOIUrl":null,"url":null,"abstract":"The evolutionary processes vary among sites of an alignment, called rate heterogeneity, that must be properly handled when analyzing the evolutionary relationships among species based on their genomic data. To this end, methods have been proposed to estimate the relative evolutionary rates between sites. Tree Independent Generation of Evolutionary Rates (TIGER) is a popular method to estimate the evolutionary rates among sites. However, the TIGER method is computationally expensive to calculate the evolutionary rates for large datasets, especially for whole genome datasets. In this paper, we present a simplified, fast, and accurate method, called fastTIGER, to estimate evolutionary rates for large datasets. Experiments on several large real datasets show that the evolutionary rates from the fastTIGER method have a reasonable correlation with ones estimated from the TIGER method while the fastTIGER method is several orders of magnitudes faster than the TIGER method. Moreover, the site rates estimated by fastTIGER method are as good as the ones estimated from the TIGER method in partitioning alignments to build maximum likelihood trees. The fastTIGER method enhances us to study the evolutionary relationships among species using their genomic data.","PeriodicalId":130986,"journal":{"name":"2021 13th International Conference on Knowledge and Systems Engineering (KSE)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 13th International Conference on Knowledge and Systems Engineering (KSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KSE53942.2021.9648748","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The evolutionary processes vary among sites of an alignment, called rate heterogeneity, that must be properly handled when analyzing the evolutionary relationships among species based on their genomic data. To this end, methods have been proposed to estimate the relative evolutionary rates between sites. Tree Independent Generation of Evolutionary Rates (TIGER) is a popular method to estimate the evolutionary rates among sites. However, the TIGER method is computationally expensive to calculate the evolutionary rates for large datasets, especially for whole genome datasets. In this paper, we present a simplified, fast, and accurate method, called fastTIGER, to estimate evolutionary rates for large datasets. Experiments on several large real datasets show that the evolutionary rates from the fastTIGER method have a reasonable correlation with ones estimated from the TIGER method while the fastTIGER method is several orders of magnitudes faster than the TIGER method. Moreover, the site rates estimated by fastTIGER method are as good as the ones estimated from the TIGER method in partitioning alignments to build maximum likelihood trees. The fastTIGER method enhances us to study the evolutionary relationships among species using their genomic data.