LifePrint:一种新的k元组距离方法,用于构建系统发育树。

Q2 Biochemistry, Genetics and Molecular Biology
Fabián Reyes-Prieto, Adda J García-Chéquer, Hueman Jaimes-Díaz, Janet Casique-Almazán, Juana M Espinosa-Lara, Rosaura Palma-Orozco, Alfonso Méndez-Tenorio, Rogelio Maldonado-Rodríguez, Kenneth L Beattie
{"title":"LifePrint:一种新的k元组距离方法,用于构建系统发育树。","authors":"Fabián Reyes-Prieto,&nbsp;Adda J García-Chéquer,&nbsp;Hueman Jaimes-Díaz,&nbsp;Janet Casique-Almazán,&nbsp;Juana M Espinosa-Lara,&nbsp;Rosaura Palma-Orozco,&nbsp;Alfonso Méndez-Tenorio,&nbsp;Rogelio Maldonado-Rodríguez,&nbsp;Kenneth L Beattie","doi":"10.2147/AABC.S15021","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Here we describe LifePrint, a sequence alignment-independent k-tuple distance method to estimate relatedness between complete genomes.</p><p><strong>Methods: </strong>We designed a representative sample of all possible DNA tuples of length 9 (9-tuples). The final sample comprises 1878 tuples (called the LifePrint set of 9-tuples; LPS9) that are distinct from each other by at least two internal and noncontiguous nucleotide differences. For validation of our k-tuple distance method, we analyzed several real and simulated viroid genomes. Using different distance metrics, we scrutinized diverse viroid genomes to estimate the k-tuple distances between these genomic sequences. Then we used the estimated genomic k-tuple distances to construct phylogenetic trees using the neighbor-joining algorithm. A comparison of the accuracy of LPS9 and the previously reported 5-tuple method was made using symmetric differences between the trees estimated from each method and a simulated \"true\" phylogenetic tree.</p><p><strong>Results: </strong>The identified optimal search scheme for LPS9 allows only up to two nucleotide differences between each 9-tuple and the scrutinized genome. Similarity search results of simulated viroid genomes indicate that, in most cases, LPS9 is able to detect single-base substitutions between genomes efficiently. Analysis of simulated genomic variants with a high proportion of base substitutions indicates that LPS9 is able to discern relationships between genomic variants with up to 40% of nucleotide substitution.</p><p><strong>Conclusion: </strong>Our LPS9 method generates more accurate phylogenetic reconstructions than the previously proposed 5-tuples strategy. LPS9-reconstructed trees show higher bootstrap proportion values than distance trees derived from the 5-tuple method.</p>","PeriodicalId":53584,"journal":{"name":"Advances and Applications in Bioinformatics and Chemistry","volume":"4 ","pages":"13-27"},"PeriodicalIF":0.0000,"publicationDate":"2011-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.2147/AABC.S15021","citationCount":"8","resultStr":"{\"title\":\"LifePrint: a novel k-tuple distance method for construction of phylogenetic trees.\",\"authors\":\"Fabián Reyes-Prieto,&nbsp;Adda J García-Chéquer,&nbsp;Hueman Jaimes-Díaz,&nbsp;Janet Casique-Almazán,&nbsp;Juana M Espinosa-Lara,&nbsp;Rosaura Palma-Orozco,&nbsp;Alfonso Méndez-Tenorio,&nbsp;Rogelio Maldonado-Rodríguez,&nbsp;Kenneth L Beattie\",\"doi\":\"10.2147/AABC.S15021\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>Here we describe LifePrint, a sequence alignment-independent k-tuple distance method to estimate relatedness between complete genomes.</p><p><strong>Methods: </strong>We designed a representative sample of all possible DNA tuples of length 9 (9-tuples). The final sample comprises 1878 tuples (called the LifePrint set of 9-tuples; LPS9) that are distinct from each other by at least two internal and noncontiguous nucleotide differences. For validation of our k-tuple distance method, we analyzed several real and simulated viroid genomes. Using different distance metrics, we scrutinized diverse viroid genomes to estimate the k-tuple distances between these genomic sequences. Then we used the estimated genomic k-tuple distances to construct phylogenetic trees using the neighbor-joining algorithm. A comparison of the accuracy of LPS9 and the previously reported 5-tuple method was made using symmetric differences between the trees estimated from each method and a simulated \\\"true\\\" phylogenetic tree.</p><p><strong>Results: </strong>The identified optimal search scheme for LPS9 allows only up to two nucleotide differences between each 9-tuple and the scrutinized genome. Similarity search results of simulated viroid genomes indicate that, in most cases, LPS9 is able to detect single-base substitutions between genomes efficiently. Analysis of simulated genomic variants with a high proportion of base substitutions indicates that LPS9 is able to discern relationships between genomic variants with up to 40% of nucleotide substitution.</p><p><strong>Conclusion: </strong>Our LPS9 method generates more accurate phylogenetic reconstructions than the previously proposed 5-tuples strategy. LPS9-reconstructed trees show higher bootstrap proportion values than distance trees derived from the 5-tuple method.</p>\",\"PeriodicalId\":53584,\"journal\":{\"name\":\"Advances and Applications in Bioinformatics and Chemistry\",\"volume\":\"4 \",\"pages\":\"13-27\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.2147/AABC.S15021\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advances and Applications in Bioinformatics and Chemistry\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2147/AABC.S15021\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2011/1/20 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"Biochemistry, Genetics and Molecular Biology\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances and Applications in Bioinformatics and Chemistry","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2147/AABC.S15021","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2011/1/20 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 8

摘要

目的:本文描述了LifePrint,这是一种独立于序列比对的k元组距离方法,用于估计完整基因组之间的相关性。方法:我们设计了一个具有代表性的样本,所有可能的DNA元组的长度为9(9-元组)。最后的样本包含1878个元组(称为9元组的LifePrint集合;LPS9)通过至少两个内部和不连续的核苷酸差异而彼此不同。为了验证我们的k元组距离方法,我们分析了几个真实和模拟的类病毒基因组。使用不同的距离度量,我们仔细检查了不同的类病毒基因组,以估计这些基因组序列之间的k元组距离。然后利用估计的基因组k元组距离,利用邻居连接算法构建系统发育树。比较LPS9和先前报道的5元组方法的准确性,使用从每种方法估计的树和模拟的“真实”系统发育树之间的对称差异。结果:确定的LPS9的最佳搜索方案只允许每个9元组和仔细检查的基因组之间最多两个核苷酸差异。模拟类病毒基因组的相似性搜索结果表明,在大多数情况下,LPS9能够有效地检测基因组之间的单碱基替换。对具有高比例碱基替换的模拟基因组变异的分析表明,LPS9能够识别具有高达40%核苷酸替换的基因组变异之间的关系。结论:我们的LPS9方法比之前提出的5元组策略产生更准确的系统发育重建。lps9重建的树显示出比5元组方法获得的距离树更高的自举比例值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

LifePrint: a novel k-tuple distance method for construction of phylogenetic trees.

LifePrint: a novel k-tuple distance method for construction of phylogenetic trees.

LifePrint: a novel k-tuple distance method for construction of phylogenetic trees.

LifePrint: a novel k-tuple distance method for construction of phylogenetic trees.

Purpose: Here we describe LifePrint, a sequence alignment-independent k-tuple distance method to estimate relatedness between complete genomes.

Methods: We designed a representative sample of all possible DNA tuples of length 9 (9-tuples). The final sample comprises 1878 tuples (called the LifePrint set of 9-tuples; LPS9) that are distinct from each other by at least two internal and noncontiguous nucleotide differences. For validation of our k-tuple distance method, we analyzed several real and simulated viroid genomes. Using different distance metrics, we scrutinized diverse viroid genomes to estimate the k-tuple distances between these genomic sequences. Then we used the estimated genomic k-tuple distances to construct phylogenetic trees using the neighbor-joining algorithm. A comparison of the accuracy of LPS9 and the previously reported 5-tuple method was made using symmetric differences between the trees estimated from each method and a simulated "true" phylogenetic tree.

Results: The identified optimal search scheme for LPS9 allows only up to two nucleotide differences between each 9-tuple and the scrutinized genome. Similarity search results of simulated viroid genomes indicate that, in most cases, LPS9 is able to detect single-base substitutions between genomes efficiently. Analysis of simulated genomic variants with a high proportion of base substitutions indicates that LPS9 is able to discern relationships between genomic variants with up to 40% of nucleotide substitution.

Conclusion: Our LPS9 method generates more accurate phylogenetic reconstructions than the previously proposed 5-tuples strategy. LPS9-reconstructed trees show higher bootstrap proportion values than distance trees derived from the 5-tuple method.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Advances and Applications in Bioinformatics and Chemistry
Advances and Applications in Bioinformatics and Chemistry Biochemistry, Genetics and Molecular Biology-Biochemistry, Genetics and Molecular Biology (miscellaneous)
CiteScore
6.50
自引率
0.00%
发文量
7
审稿时长
16 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信