Devising Isolation Forest-Based Method to Investigate the sRNAome of Mycobacterium tuberculosis Using sRNA-seq Data.

IF 2.3 Q3 BIOCHEMICAL RESEARCH METHODS
Bioinformatics and Biology Insights Pub Date : 2024-07-30 eCollection Date: 2024-01-01 DOI:10.1177/11779322241263674
Upasana Maity, Ritika Aggarwal, Rami Balasubramanian, Divya Lakshmi Venkatraman, Shubhada R Hegde
{"title":"Devising Isolation Forest-Based Method to Investigate the sRNAome of <i>Mycobacterium tuberculosis</i> Using sRNA-seq Data.","authors":"Upasana Maity, Ritika Aggarwal, Rami Balasubramanian, Divya Lakshmi Venkatraman, Shubhada R Hegde","doi":"10.1177/11779322241263674","DOIUrl":null,"url":null,"abstract":"<p><p>Small non-coding RNAs (sRNAs) regulate the synthesis of virulence factors and other pathogenic traits, which enables the bacteria to survive and proliferate after host infection. While high-throughput sequencing data have proved useful in identifying sRNAs from the intergenic regions (IGRs) of the genome, it remains a challenge to present a complete genome-wide map of the expression of the sRNAs. Moreover, existing methodologies necessitate multiple dependencies for executing their algorithm and also lack a targeted approach for the <i>de novo</i> sRNA identification. We developed an Isolation Forest algorithm-based method and the tool Prediction Of sRNAs using Isolation Forest for the <i>de novo</i> identification of sRNAs from available bacterial sRNA-seq data (http://posif.ibab.ac.in/). Using this framework, we predicted 1120 sRNAs and 46 small proteins in <i>Mycobacterium tuberculosis</i>. Besides, we highlight the context-dependent expression of novel sRNAs, their probable synthesis, and their potential relevance in stress response mechanisms manifested by <i>M. tuberculosis.</i></p>","PeriodicalId":9065,"journal":{"name":"Bioinformatics and Biology Insights","volume":"18 ","pages":"11779322241263674"},"PeriodicalIF":2.3000,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11292719/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics and Biology Insights","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/11779322241263674","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Small non-coding RNAs (sRNAs) regulate the synthesis of virulence factors and other pathogenic traits, which enables the bacteria to survive and proliferate after host infection. While high-throughput sequencing data have proved useful in identifying sRNAs from the intergenic regions (IGRs) of the genome, it remains a challenge to present a complete genome-wide map of the expression of the sRNAs. Moreover, existing methodologies necessitate multiple dependencies for executing their algorithm and also lack a targeted approach for the de novo sRNA identification. We developed an Isolation Forest algorithm-based method and the tool Prediction Of sRNAs using Isolation Forest for the de novo identification of sRNAs from available bacterial sRNA-seq data (http://posif.ibab.ac.in/). Using this framework, we predicted 1120 sRNAs and 46 small proteins in Mycobacterium tuberculosis. Besides, we highlight the context-dependent expression of novel sRNAs, their probable synthesis, and their potential relevance in stress response mechanisms manifested by M. tuberculosis.

利用 sRNA-seq 数据设计基于森林隔离的方法来研究结核分枝杆菌的 sRNA 组
小非编码 RNA(sRNA)调控毒力因子和其他致病性特征的合成,从而使细菌在宿主感染后能够存活和增殖。虽然高通量测序数据已被证明有助于从基因组的基因间区(IGRs)识别 sRNAs,但要提供完整的全基因组 sRNAs 表达图谱仍是一项挑战。此外,现有的方法在执行算法时需要多重依赖,也缺乏一种有针对性的方法来从头开始鉴定 sRNA。我们开发了一种基于 Isolation Forest 算法的方法和工具 Prediction Of sRNAs using Isolation Forest,用于从现有的细菌 sRNA-seq 数据中从头鉴定 sRNA (http://posif.ibab.ac.in/)。利用这一框架,我们预测了结核分枝杆菌中的 1120 个 sRNA 和 46 个小蛋白。此外,我们还强调了新型 sRNA 的表达、可能的合成及其在结核分枝杆菌应激反应机制中的潜在相关性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Bioinformatics and Biology Insights
Bioinformatics and Biology Insights BIOCHEMICAL RESEARCH METHODS-
CiteScore
6.80
自引率
1.70%
发文量
36
审稿时长
8 weeks
期刊介绍: Bioinformatics and Biology Insights is an open access, peer-reviewed journal that considers articles on bioinformatics methods and their applications which must pertain to biological insights. All papers should be easily amenable to biologists and as such help bridge the gap between theories and applications.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信