一种新的序列不相似性测量方法及其在系统发育中的应用

Xiao-hui Niu, Nana Li, Feng Shi, Xue-yan Li
{"title":"一种新的序列不相似性测量方法及其在系统发育中的应用","authors":"Xiao-hui Niu, Nana Li, Feng Shi, Xue-yan Li","doi":"10.1109/ICNC.2008.299","DOIUrl":null,"url":null,"abstract":"We present a new computational approach to measure the distance between two biological sequences. A biological sequence quantifies as a Markov Chain with 20 states. Stochastic state transition matrix is computed as the quantitative index of the biological sequence. The Kullback-Leibler discrimination information is used as a diversity indicator to measure the dissimilarity of each pair of the rows in the two state transition matrix. Distance between the two sequences is defined as the average value with the weight of the occurrence possibility of each amino acid. We illustrate its application in reconstructing a phylogeny of the Eutherian orders using concatenated H-stranded amino acid sequences. This phylogeny is consistent with the commonly accepted one for the Eutherians.","PeriodicalId":6404,"journal":{"name":"2008 Fourth International Conference on Natural Computation","volume":"37 1","pages":"231-234"},"PeriodicalIF":0.0000,"publicationDate":"2008-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Novel Measurement of Sequence Dissimilarity and Its Application to Phylogeny\",\"authors\":\"Xiao-hui Niu, Nana Li, Feng Shi, Xue-yan Li\",\"doi\":\"10.1109/ICNC.2008.299\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present a new computational approach to measure the distance between two biological sequences. A biological sequence quantifies as a Markov Chain with 20 states. Stochastic state transition matrix is computed as the quantitative index of the biological sequence. The Kullback-Leibler discrimination information is used as a diversity indicator to measure the dissimilarity of each pair of the rows in the two state transition matrix. Distance between the two sequences is defined as the average value with the weight of the occurrence possibility of each amino acid. We illustrate its application in reconstructing a phylogeny of the Eutherian orders using concatenated H-stranded amino acid sequences. This phylogeny is consistent with the commonly accepted one for the Eutherians.\",\"PeriodicalId\":6404,\"journal\":{\"name\":\"2008 Fourth International Conference on Natural Computation\",\"volume\":\"37 1\",\"pages\":\"231-234\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-10-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 Fourth International Conference on Natural Computation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICNC.2008.299\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 Fourth International Conference on Natural Computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNC.2008.299","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

我们提出了一种新的计算方法来测量两个生物序列之间的距离。一个生物序列可以量化为一个有20个状态的马尔可夫链。计算随机状态转移矩阵作为生物序列的定量指标。利用Kullback-Leibler判别信息作为多样性指标,衡量两状态转移矩阵中每对行之间的不相似性。两个序列之间的距离定义为每个氨基酸出现可能性的加权平均值。我们说明了它的应用在重建真兽目系统发育使用连接的h链氨基酸序列。这种系统发育与人们普遍接受的真瑟利亚人的系统发育是一致的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Novel Measurement of Sequence Dissimilarity and Its Application to Phylogeny
We present a new computational approach to measure the distance between two biological sequences. A biological sequence quantifies as a Markov Chain with 20 states. Stochastic state transition matrix is computed as the quantitative index of the biological sequence. The Kullback-Leibler discrimination information is used as a diversity indicator to measure the dissimilarity of each pair of the rows in the two state transition matrix. Distance between the two sequences is defined as the average value with the weight of the occurrence possibility of each amino acid. We illustrate its application in reconstructing a phylogeny of the Eutherian orders using concatenated H-stranded amino acid sequences. This phylogeny is consistent with the commonly accepted one for the Eutherians.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信