fastTIGER:一种从大型数据集估计位点进化速率的快速方法

Thu Kim Le, L. Vinh
{"title":"fastTIGER:一种从大型数据集估计位点进化速率的快速方法","authors":"Thu Kim Le, L. Vinh","doi":"10.1109/KSE53942.2021.9648748","DOIUrl":null,"url":null,"abstract":"The evolutionary processes vary among sites of an alignment, called rate heterogeneity, that must be properly handled when analyzing the evolutionary relationships among species based on their genomic data. To this end, methods have been proposed to estimate the relative evolutionary rates between sites. Tree Independent Generation of Evolutionary Rates (TIGER) is a popular method to estimate the evolutionary rates among sites. However, the TIGER method is computationally expensive to calculate the evolutionary rates for large datasets, especially for whole genome datasets. In this paper, we present a simplified, fast, and accurate method, called fastTIGER, to estimate evolutionary rates for large datasets. Experiments on several large real datasets show that the evolutionary rates from the fastTIGER method have a reasonable correlation with ones estimated from the TIGER method while the fastTIGER method is several orders of magnitudes faster than the TIGER method. Moreover, the site rates estimated by fastTIGER method are as good as the ones estimated from the TIGER method in partitioning alignments to build maximum likelihood trees. The fastTIGER method enhances us to study the evolutionary relationships among species using their genomic data.","PeriodicalId":130986,"journal":{"name":"2021 13th International Conference on Knowledge and Systems Engineering (KSE)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"fastTIGER: A rapid method for estimating evolutionary rates of sites from large datasets\",\"authors\":\"Thu Kim Le, L. Vinh\",\"doi\":\"10.1109/KSE53942.2021.9648748\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The evolutionary processes vary among sites of an alignment, called rate heterogeneity, that must be properly handled when analyzing the evolutionary relationships among species based on their genomic data. To this end, methods have been proposed to estimate the relative evolutionary rates between sites. Tree Independent Generation of Evolutionary Rates (TIGER) is a popular method to estimate the evolutionary rates among sites. However, the TIGER method is computationally expensive to calculate the evolutionary rates for large datasets, especially for whole genome datasets. In this paper, we present a simplified, fast, and accurate method, called fastTIGER, to estimate evolutionary rates for large datasets. Experiments on several large real datasets show that the evolutionary rates from the fastTIGER method have a reasonable correlation with ones estimated from the TIGER method while the fastTIGER method is several orders of magnitudes faster than the TIGER method. Moreover, the site rates estimated by fastTIGER method are as good as the ones estimated from the TIGER method in partitioning alignments to build maximum likelihood trees. The fastTIGER method enhances us to study the evolutionary relationships among species using their genomic data.\",\"PeriodicalId\":130986,\"journal\":{\"name\":\"2021 13th International Conference on Knowledge and Systems Engineering (KSE)\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 13th International Conference on Knowledge and Systems Engineering (KSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/KSE53942.2021.9648748\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 13th International Conference on Knowledge and Systems Engineering (KSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KSE53942.2021.9648748","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

进化过程在不同的排列位点之间有所不同,这被称为速率异质性,在基于基因组数据分析物种之间的进化关系时必须妥善处理。为此,人们提出了估算位点间相对进化速率的方法。树独立进化速率生成法(Tree Independent Generation of Evolutionary Rates, TIGER)是一种常用的估算地点间进化速率的方法。然而,TIGER方法在计算大型数据集,特别是全基因组数据集的进化速率时计算成本很高。在本文中,我们提出了一种简化、快速、准确的方法,称为fastTIGER,用于估计大型数据集的进化速率。在几个大型真实数据集上的实验表明,fastTIGER方法的进化速率与TIGER方法的进化速率具有合理的相关性,并且比TIGER方法快几个数量级。此外,fastTIGER方法估计的站点率与TIGER方法估计的站点率在划分对齐以构建最大似然树方面是一样好的。fastTIGER方法增强了我们利用物种基因组数据研究物种间进化关系的能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
fastTIGER: A rapid method for estimating evolutionary rates of sites from large datasets
The evolutionary processes vary among sites of an alignment, called rate heterogeneity, that must be properly handled when analyzing the evolutionary relationships among species based on their genomic data. To this end, methods have been proposed to estimate the relative evolutionary rates between sites. Tree Independent Generation of Evolutionary Rates (TIGER) is a popular method to estimate the evolutionary rates among sites. However, the TIGER method is computationally expensive to calculate the evolutionary rates for large datasets, especially for whole genome datasets. In this paper, we present a simplified, fast, and accurate method, called fastTIGER, to estimate evolutionary rates for large datasets. Experiments on several large real datasets show that the evolutionary rates from the fastTIGER method have a reasonable correlation with ones estimated from the TIGER method while the fastTIGER method is several orders of magnitudes faster than the TIGER method. Moreover, the site rates estimated by fastTIGER method are as good as the ones estimated from the TIGER method in partitioning alignments to build maximum likelihood trees. The fastTIGER method enhances us to study the evolutionary relationships among species using their genomic data.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信