Hypervariable Regions in 16S rRNA Genes for the Taxonomic Classification

Osman Gursoy, M. Can
{"title":"Hypervariable Regions in 16S rRNA Genes for the Taxonomic Classification","authors":"Osman Gursoy, M. Can","doi":"10.21533/SCJOURNAL.V8I1.171","DOIUrl":null,"url":null,"abstract":"16S ribosomal RNA (rRNA) gene sequences are reliable markers for the taxonomic classification of microbes and widely used in environmental microbiology. Production of 16S rRNA gene amplicons in large amounts, encompassing the full length of genes is not yet feasible, because of the limitations of the current sequencing techniques. They are mostly in short reads of length less than 300 base pairs. Hence, the selection of the most efficient hypervariable regions for phylogenetic analysis and taxonomic classification is a current research area. It is found that nine hypervariable regions (V1–V9), resides in bacterial 16S ribosomal RNA (rRNA) genes. Family, genus, and species-specific sequences within a given hypervariable region constitute useful targets for diagnostic assays and other scientific investigations. In this study systematic studies that compare the relative advantage of hypervariable regions grouped as V1–V2–V3, V4–V5–V6, and V7–V8–V9 for specific diagnostic goals are done. In the present research, the built in function Longest–Common–Subsequence in computer algebra package MATHEMATICA is used to create an in silico pipeline to evaluate the taxonomic classification sensitivity of the hypervariable regions compared with the corresponding full-length sequences. Conclusions: Our results suggest that V4–V5–V6 region might be an optimal sub-region for the design of universal primers with superior phylogenetic resolution for bacterial phyla.","PeriodicalId":243185,"journal":{"name":"Southeast Europe Journal of Soft Computing","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Southeast Europe Journal of Soft Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21533/SCJOURNAL.V8I1.171","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

16S ribosomal RNA (rRNA) gene sequences are reliable markers for the taxonomic classification of microbes and widely used in environmental microbiology. Production of 16S rRNA gene amplicons in large amounts, encompassing the full length of genes is not yet feasible, because of the limitations of the current sequencing techniques. They are mostly in short reads of length less than 300 base pairs. Hence, the selection of the most efficient hypervariable regions for phylogenetic analysis and taxonomic classification is a current research area. It is found that nine hypervariable regions (V1–V9), resides in bacterial 16S ribosomal RNA (rRNA) genes. Family, genus, and species-specific sequences within a given hypervariable region constitute useful targets for diagnostic assays and other scientific investigations. In this study systematic studies that compare the relative advantage of hypervariable regions grouped as V1–V2–V3, V4–V5–V6, and V7–V8–V9 for specific diagnostic goals are done. In the present research, the built in function Longest–Common–Subsequence in computer algebra package MATHEMATICA is used to create an in silico pipeline to evaluate the taxonomic classification sensitivity of the hypervariable regions compared with the corresponding full-length sequences. Conclusions: Our results suggest that V4–V5–V6 region might be an optimal sub-region for the design of universal primers with superior phylogenetic resolution for bacterial phyla.
16S rRNA基因的高变区分类
16S核糖体RNA (rRNA)基因序列是微生物分类的可靠标记,在环境微生物学中得到广泛应用。由于当前测序技术的限制,大量生产包含基因全长的16S rRNA基因扩增子尚不可行。它们大多是长度小于300个碱基对的短片段。因此,选择最有效的高变区进行系统发育分析和分类分类是当前的研究领域。发现细菌16S核糖体RNA (rRNA)基因存在9个高变区(V1-V9)。在给定的高变区域内的科、属和种特异性序列构成诊断分析和其他科学研究的有用目标。在本研究中,我们进行了系统研究,比较了V1-V2-V3、V4-V5-V6和V7-V8-V9高变区在特定诊断目标方面的相对优势。在本研究中,利用计算机代数软件包MATHEMATICA中内置的函数Longest-Common-Subsequence创建了一个计算机流水线,以评估高变区与相应全长序列的分类敏感性。结论:V4-V5-V6区可能是设计具有较好系统发育分辨率的细菌门通用引物的最佳亚区。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信