利用独立成分评估基因组相似性

Q4 Medicine
T. Sáfadi, L. M. Ferreira
{"title":"利用独立成分评估基因组相似性","authors":"T. Sáfadi, L. M. Ferreira","doi":"10.28951/rbb.v38i1.439","DOIUrl":null,"url":null,"abstract":"We propose the use of independent component analysis to find similarities of genomes. Considering different numbers of independent components, the complete linkage method was used to identify groups based on the estimated coefficients of the mixing matrix. The sequences analyzed correspond to the strains of the Mycobacterium tuberculosis genome, ten sequences were analyzed, obtained from the National Center for Biotechnology Information (NCBI, 2017). The GC-content of each sequence was evaluated using a sliding window of 10,000 bases. The clustering analysis using the independent components of the analyzed sequences was essential to verify the dissimilarity of the sequences.","PeriodicalId":36293,"journal":{"name":"Revista Brasileira de Biometria","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"EVALUATION OF GENOME SIMILARITIES USING INDEPENDENT COMPONENTS\",\"authors\":\"T. Sáfadi, L. M. Ferreira\",\"doi\":\"10.28951/rbb.v38i1.439\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We propose the use of independent component analysis to find similarities of genomes. Considering different numbers of independent components, the complete linkage method was used to identify groups based on the estimated coefficients of the mixing matrix. The sequences analyzed correspond to the strains of the Mycobacterium tuberculosis genome, ten sequences were analyzed, obtained from the National Center for Biotechnology Information (NCBI, 2017). The GC-content of each sequence was evaluated using a sliding window of 10,000 bases. The clustering analysis using the independent components of the analyzed sequences was essential to verify the dissimilarity of the sequences.\",\"PeriodicalId\":36293,\"journal\":{\"name\":\"Revista Brasileira de Biometria\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-03-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Revista Brasileira de Biometria\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.28951/rbb.v38i1.439\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Revista Brasileira de Biometria","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.28951/rbb.v38i1.439","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

摘要

我们建议使用独立成分分析来发现基因组的相似性。考虑不同数量的独立成分,采用完全链接法根据混合矩阵的估计系数进行群体识别。分析的序列与结核分枝杆菌基因组菌株相对应,分析了10个序列,这些序列来自国家生物技术信息中心(NCBI, 2017)。每个序列的gc含量使用10,000个碱基的滑动窗口进行评估。利用所分析序列的独立分量进行聚类分析是验证序列相似性的必要手段。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
EVALUATION OF GENOME SIMILARITIES USING INDEPENDENT COMPONENTS
We propose the use of independent component analysis to find similarities of genomes. Considering different numbers of independent components, the complete linkage method was used to identify groups based on the estimated coefficients of the mixing matrix. The sequences analyzed correspond to the strains of the Mycobacterium tuberculosis genome, ten sequences were analyzed, obtained from the National Center for Biotechnology Information (NCBI, 2017). The GC-content of each sequence was evaluated using a sliding window of 10,000 bases. The clustering analysis using the independent components of the analyzed sequences was essential to verify the dissimilarity of the sequences.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Revista Brasileira de Biometria
Revista Brasileira de Biometria Agricultural and Biological Sciences-Agricultural and Biological Sciences (all)
自引率
0.00%
发文量
0
审稿时长
53 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信