Shusruto Rishik,Leidy Alejandra G Molano,Syed Mohammed Khalid,Tiniko Babalashvili,Fadlilah Nur Hasanah,Md Mobashir Rahman,Pascal Hirsch,Friederike Grandke,Emma S Hoffmann,Tobias W Wolff,Georges P Schmartz,Andreas Keller
{"title":"SmallBARNA 2026: a kingdom-wide bacterial sRNA resource.","authors":"Shusruto Rishik,Leidy Alejandra G Molano,Syed Mohammed Khalid,Tiniko Babalashvili,Fadlilah Nur Hasanah,Md Mobashir Rahman,Pascal Hirsch,Friederike Grandke,Emma S Hoffmann,Tobias W Wolff,Georges P Schmartz,Andreas Keller","doi":"10.1093/nar/gkaf999","DOIUrl":null,"url":null,"abstract":"Bacterial small RNA are important context-sensitive regulators of gene expression, especially in highly pathogenic bacteria, often controlling virulence. The number of predicted small RNA (sRNA) entries in public repositories has grown exponentially, contrasting with the linear growth of functionally validated sRNAs. While there are databases maintaining sRNA records from single bacterial species or taxonomic groups, a central repository of bona fide sRNAs for all bacteria with evidence, alignment, and RNA expression information is missing. Such a repository is a critical starting point for both wet lab biologists validating sRNA function as well as bioinformaticians creating new models for sRNA prediction. In this paper, we hand-curate 1117 articles from the literature to find 746 sRNAs that have been confirmed by northern blotting, quantitative polymerase chain reaction (qPCR), mutagenesis, or other functional validation methods. We map these sRNA sequences to QC-filtered bacterial genomic assemblies from NCBI, obtaining 3.8 million hits from 44 789 chromosomes and 10 884 plasmids. Finally, we also quantify these sRNAs in a filtered subset of 5292 isolates with available RNA-seq data from the Sequence Read Archive. The bona fide set, alignment, and expression information is available for download and interactive exploration at https://web.ccb.uni-saarland.de/smallbarna/.","PeriodicalId":19471,"journal":{"name":"Nucleic Acids Research","volume":"52 1","pages":""},"PeriodicalIF":13.1000,"publicationDate":"2025-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nucleic Acids Research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/nar/gkaf999","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Bacterial small RNA are important context-sensitive regulators of gene expression, especially in highly pathogenic bacteria, often controlling virulence. The number of predicted small RNA (sRNA) entries in public repositories has grown exponentially, contrasting with the linear growth of functionally validated sRNAs. While there are databases maintaining sRNA records from single bacterial species or taxonomic groups, a central repository of bona fide sRNAs for all bacteria with evidence, alignment, and RNA expression information is missing. Such a repository is a critical starting point for both wet lab biologists validating sRNA function as well as bioinformaticians creating new models for sRNA prediction. In this paper, we hand-curate 1117 articles from the literature to find 746 sRNAs that have been confirmed by northern blotting, quantitative polymerase chain reaction (qPCR), mutagenesis, or other functional validation methods. We map these sRNA sequences to QC-filtered bacterial genomic assemblies from NCBI, obtaining 3.8 million hits from 44 789 chromosomes and 10 884 plasmids. Finally, we also quantify these sRNAs in a filtered subset of 5292 isolates with available RNA-seq data from the Sequence Read Archive. The bona fide set, alignment, and expression information is available for download and interactive exploration at https://web.ccb.uni-saarland.de/smallbarna/.
期刊介绍:
Nucleic Acids Research (NAR) is a scientific journal that publishes research on various aspects of nucleic acids and proteins involved in nucleic acid metabolism and interactions. It covers areas such as chemistry and synthetic biology, computational biology, gene regulation, chromatin and epigenetics, genome integrity, repair and replication, genomics, molecular biology, nucleic acid enzymes, RNA, and structural biology. The journal also includes a Survey and Summary section for brief reviews. Additionally, each year, the first issue is dedicated to biological databases, and an issue in July focuses on web-based software resources for the biological community. Nucleic Acids Research is indexed by several services including Abstracts on Hygiene and Communicable Diseases, Animal Breeding Abstracts, Agricultural Engineering Abstracts, Agbiotech News and Information, BIOSIS Previews, CAB Abstracts, and EMBASE.