Longest Common Subsequences in Bacteria Taxonomic Classification

M. Can, Osman Gursoy
{"title":"Longest Common Subsequences in Bacteria Taxonomic Classification","authors":"M. Can, Osman Gursoy","doi":"10.21533/SCJOURNAL.V7I2.166","DOIUrl":null,"url":null,"abstract":"In 1980s, Carl Woese made a ground breaking contribution to microbiology using rRNA-genes for phylogenetic classifications. He used it not only to explore microbial diversity but also as a method for bacterial annotation. Today, rRNA-based analysis remains a central method in microbiology. Many researchers followed this track, using several new generations of Artificial Neural Networks obtained high accuracies using available datasets of their time. By the time, the number of bacteria increased enormously. In this article we used Longest Common Subsequence similarity measure to classify bacterial 16S rRNA gene sequences of 1.820.414 bacteria in SILVA, 3.196.038 bacteria in RDP, and 198.509 bacteria in Greengenes. The last two taxonomy have six taxonomical levels, phylum, class, order, family, genus, and species, while SILVA has two more levels subclass and suborder, but lacks species level. The majority of classifications (98%) were of high accuracy (98%).","PeriodicalId":243185,"journal":{"name":"Southeast Europe Journal of Soft Computing","volume":"117 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Southeast Europe Journal of Soft Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21533/SCJOURNAL.V7I2.166","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In 1980s, Carl Woese made a ground breaking contribution to microbiology using rRNA-genes for phylogenetic classifications. He used it not only to explore microbial diversity but also as a method for bacterial annotation. Today, rRNA-based analysis remains a central method in microbiology. Many researchers followed this track, using several new generations of Artificial Neural Networks obtained high accuracies using available datasets of their time. By the time, the number of bacteria increased enormously. In this article we used Longest Common Subsequence similarity measure to classify bacterial 16S rRNA gene sequences of 1.820.414 bacteria in SILVA, 3.196.038 bacteria in RDP, and 198.509 bacteria in Greengenes. The last two taxonomy have six taxonomical levels, phylum, class, order, family, genus, and species, while SILVA has two more levels subclass and suborder, but lacks species level. The majority of classifications (98%) were of high accuracy (98%).
细菌分类中的最长公共子序列
20世纪80年代,卡尔·沃斯(Carl Woese)利用rrna基因进行系统发育分类,对微生物学做出了开创性的贡献。他不仅用它来探索微生物的多样性,而且作为细菌注释的一种方法。今天,基于rrna的分析仍然是微生物学的核心方法。许多研究人员沿着这条轨道,使用了几代新一代的人工神经网络,利用当时可用的数据集获得了很高的精度。到那时,细菌的数量急剧增加。本文采用最长公共子序列相似性度量对SILVA的1.820.414株细菌、RDP的3.196.038株细菌和Greengenes的198.509株细菌的16S rRNA基因序列进行了分类。后两种分类法有门、纲、目、科、属和种6个级别,而SILVA有亚纲和亚目两个级别,但缺乏种级别。大多数分类(98%)准确率高(98%)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信