A comparison between hierarchical clustering and community detection method in the collection of gene targets for molecular identification of pathogenic fungi

I. Thapa, S. Bhowmick, D. Bastola
{"title":"A comparison between hierarchical clustering and community detection method in the collection of gene targets for molecular identification of pathogenic fungi","authors":"I. Thapa, S. Bhowmick, D. Bastola","doi":"10.1109/BIBMW.2012.6470234","DOIUrl":null,"url":null,"abstract":"Ribosomal RNA sequence is a popular primary molecular target in the diagnosis of many fungal and bacterial infections. More recently a number of other molecular targets like `cytochrome b', `rpoB', `actin' is available in public databases such as GenBank. These sequences could be better alternatives to the popular ribosomal RNA as molecular targets. However, existing computational approaches do not provide a convenient method to collect and make these sequences available for the development of new alternative sequence-based diagnostics that are critical for early detection of infectious agents like fungi. The long-term goal of this study is to develop a computational tool for the rapid identification of infectious agents in biological sample. In the present study, we focus on pre-processing of sequence data in public database and compare a number of clustering approaches to classify currently available DNA sequences into different target genes. We evaluate the correctness of these methods based on the target classification of seven different species of Zygomycetes. Use of a clustering comparison metric has shown that community detection and hierarchical clustering methods are on par with high accuracy.","PeriodicalId":6392,"journal":{"name":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2012-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBMW.2012.6470234","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Ribosomal RNA sequence is a popular primary molecular target in the diagnosis of many fungal and bacterial infections. More recently a number of other molecular targets like `cytochrome b', `rpoB', `actin' is available in public databases such as GenBank. These sequences could be better alternatives to the popular ribosomal RNA as molecular targets. However, existing computational approaches do not provide a convenient method to collect and make these sequences available for the development of new alternative sequence-based diagnostics that are critical for early detection of infectious agents like fungi. The long-term goal of this study is to develop a computational tool for the rapid identification of infectious agents in biological sample. In the present study, we focus on pre-processing of sequence data in public database and compare a number of clustering approaches to classify currently available DNA sequences into different target genes. We evaluate the correctness of these methods based on the target classification of seven different species of Zygomycetes. Use of a clustering comparison metric has shown that community detection and hierarchical clustering methods are on par with high accuracy.
层次聚类法与群落检测法在病原真菌分子鉴定基因靶点采集中的比较
核糖体RNA序列是诊断许多真菌和细菌感染的主要分子靶点。最近,诸如“细胞色素b”、“rpoB”、“肌动蛋白”等其他一些分子靶标也可以在GenBank等公共数据库中找到。这些序列可以更好地替代常用的核糖体RNA作为分子靶标。然而,现有的计算方法并没有提供一种方便的方法来收集这些序列,并使这些序列可用于开发新的基于序列的诊断方法,这些诊断方法对于真菌等感染性病原体的早期检测至关重要。本研究的长期目标是开发一种快速识别生物样本中感染因子的计算工具。在本研究中,我们重点研究了公共数据库中序列数据的预处理,并比较了几种聚类方法,将现有的DNA序列分类为不同的目标基因。我们以7种接合菌的目标分类为基础,对这些方法的正确性进行了评价。聚类比较度量的使用表明,社区检测和分层聚类方法具有相同的高准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信