TSS-Captur: a user-friendly pipeline for characterizing unclassified RNA transcripts.

IF 4 Q1 GENETICS & HEREDITY
NAR Genomics and Bioinformatics Pub Date : 2024-12-18 eCollection Date: 2024-12-01 DOI:10.1093/nargab/lqae168
Mathias Witte Paz, Thomas Vogel, Kay Nieselt
{"title":"TSS-Captur: a user-friendly pipeline for characterizing unclassified RNA transcripts.","authors":"Mathias Witte Paz, Thomas Vogel, Kay Nieselt","doi":"10.1093/nargab/lqae168","DOIUrl":null,"url":null,"abstract":"<p><p>RNA-seq and its 5'-enrichment methods for prokaryotes have enabled the precise identification of transcription start sites (TSSs), improving gene expression analysis. Computational methods are applied to these data to identify TSSs and classify them based on proximal annotated genes. While some TSSs cannot be classified at all (orphan TSSs), other TSSs are found on the reverse strand of known genes (antisense TSSs) but are not associated with the direct transcription of any known gene. Here, we introduce TSS-Captur, a novel pipeline, which uses computational approaches to characterize genomic regions starting from experimentally confirmed but unclassified TSSs. By analyzing TSS data, TSS-Captur characterizes unclassified signals, complementing prokaryotic genome annotation tools. TSS-Captur categorizes extracted transcripts as either messenger RNA for genes with coding potential or non-coding RNA (ncRNA) for non-translated genes. Additionally, it predicts the transcription termination site for each putative transcript. For ncRNA genes, the secondary structure is computed. Moreover, all putative promoter regions are analyzed to identify enriched motifs. An interactive report allows seamless data exploration. We validated TSS-Captur with a <i>Campylobacter jejuni</i> dataset and characterized unlabeled ncRNAs in <i>Streptomyces coelicolor</i>. TSS-Captur is available both as a web-application and as a command-line tool.</p>","PeriodicalId":33994,"journal":{"name":"NAR Genomics and Bioinformatics","volume":"6 4","pages":"lqae168"},"PeriodicalIF":4.0000,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11655288/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NAR Genomics and Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/nargab/lqae168","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

RNA-seq and its 5'-enrichment methods for prokaryotes have enabled the precise identification of transcription start sites (TSSs), improving gene expression analysis. Computational methods are applied to these data to identify TSSs and classify them based on proximal annotated genes. While some TSSs cannot be classified at all (orphan TSSs), other TSSs are found on the reverse strand of known genes (antisense TSSs) but are not associated with the direct transcription of any known gene. Here, we introduce TSS-Captur, a novel pipeline, which uses computational approaches to characterize genomic regions starting from experimentally confirmed but unclassified TSSs. By analyzing TSS data, TSS-Captur characterizes unclassified signals, complementing prokaryotic genome annotation tools. TSS-Captur categorizes extracted transcripts as either messenger RNA for genes with coding potential or non-coding RNA (ncRNA) for non-translated genes. Additionally, it predicts the transcription termination site for each putative transcript. For ncRNA genes, the secondary structure is computed. Moreover, all putative promoter regions are analyzed to identify enriched motifs. An interactive report allows seamless data exploration. We validated TSS-Captur with a Campylobacter jejuni dataset and characterized unlabeled ncRNAs in Streptomyces coelicolor. TSS-Captur is available both as a web-application and as a command-line tool.

tss - capture:一个用户友好的管道,用于表征未分类的RNA转录物。
RNA-seq及其用于原核生物的5'富集方法能够精确鉴定转录起始位点(tss),提高基因表达分析水平。对这些数据应用计算方法来识别tss,并基于近端注释基因对其进行分类。虽然有些TSSs根本无法分类(孤儿TSSs),但其他TSSs位于已知基因的反向链上(反义TSSs),但与任何已知基因的直接转录无关。在这里,我们介绍了TSS-Captur,一个新的管道,它使用计算方法来表征基因组区域,从实验证实但未分类的tss开始。通过分析TSS数据,TSS- capture对未分类信号进行表征,补充了原核基因组注释工具。tss - capture将提取的转录本分为具有编码潜力基因的信使RNA和非翻译基因的非编码RNA (ncRNA)。此外,它还预测了每个假定转录物的转录终止位点。对于ncRNA基因,二级结构被计算。此外,还分析了所有假定的启动子区域,以确定富集的基序。交互式报告允许无缝的数据探索。我们用空肠弯曲杆菌数据集验证了tss - capture,并对冷色链霉菌中未标记的ncrna进行了表征。tss - capture既可以作为web应用程序,也可以作为命令行工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.00
自引率
2.20%
发文量
95
审稿时长
15 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信