Oleg N Murashko, Connor Morgan-Lang, Chen-Hsin Albert Yu, Hsin-Nan Lin, Anna Chao Kaberdina, Shin-Yu Kung, Vladimir R Kaberdin, Sue Lin-Chao
{"title":"Comprehensive analysis of repetitive extragenic palindrome sequences identified in bacteria and archaea using a new web-based tool, RepRanger.","authors":"Oleg N Murashko, Connor Morgan-Lang, Chen-Hsin Albert Yu, Hsin-Nan Lin, Anna Chao Kaberdina, Shin-Yu Kung, Vladimir R Kaberdin, Sue Lin-Chao","doi":"10.1128/msphere.00124-25","DOIUrl":null,"url":null,"abstract":"<p><p>Repetitive extragenic palindromic (REP) sequences were first discovered in the intergenic regions of <i>Escherichia coli</i> and later found in phylogenetically distinct bacterial species. However, their biological roles and sequence conservation remain unclear. Considering the growing number of microbial genome sequencing and annotation projects, along with the lack of online tools for REP identification, we developed a new web-based platform, RepRanger, to rapidly identify and annotate putative palindromic elements, including REPs. Using RepRanger, we identified >4,000 REPs in the <i>E. coli</i> MG1655 genome. Moreover, 81 (~52%) of its 157 small noncoding RNAs (sRNAs; known to regulate gene expression) contain REPs. Further analysis using RepRanger, alongside other bioinformatics tools, yielded 10 REP consensus motifs. Notably, REPs potentially involved in translational control (i.e., located within 15 nucleotides downstream of an open reading frame) primarily host motif 4, whereas REPs present in sRNAs predominantly encoded motif 9. Putative targets of REP-containing sRNAs include genes that rewire metabolic pathways in response to environmental changes, indicating that REPs likely contribute to bacterial adaptation. We also reveal that REP consensus motifs are similar across pathogenic and environmental <i>E. coli</i> strains but not commensal and laboratory ones. We employed RepRanger to search for REPs and confirm that they are present across bacteria and archaea. Furthermore, we uncovered similarities in REP sequences between bacterial and archaeal genomes. Thus, we demonstrate that RepRanger represents a versatile tool for discovering REPs in bacteria and archaea, providing new insights into their functions and sequence diversity.IMPORTANCERepetitive extragenic palindromic (REP) sequences were first discovered in <i>Escherichia coli</i>, but their biological roles, diversity, and sequence conservation remain unclear. We have developed a web-based tool, RepRanger, to identify and annotate putative palindromic elements, including REPs. Using RepRanger, we identified approximately 4,000 REPs in the <i>E. coli</i> MG1655 genome. We show that >50% of small noncoding RNAs (sRNAs) contain REPs. The predicted functions of REP-containing sRNAs indicate that REPs likely contribute to bacterial environmental adaptability. In addition, we have discovered REPs in pathogenic, environmental, and commensal <i>E. coli,</i> allowing us to assess their sequence similarity. We show that REPs are widely present in bacterial and archaeal genomes and share some sequence similarities. Our comparison of REPs in annotated genomes broadens the current understanding of REP sequence diversity, conservation, and function.</p>","PeriodicalId":19052,"journal":{"name":"mSphere","volume":" ","pages":"e0012425"},"PeriodicalIF":3.7000,"publicationDate":"2025-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"mSphere","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1128/msphere.00124-25","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Repetitive extragenic palindromic (REP) sequences were first discovered in the intergenic regions of Escherichia coli and later found in phylogenetically distinct bacterial species. However, their biological roles and sequence conservation remain unclear. Considering the growing number of microbial genome sequencing and annotation projects, along with the lack of online tools for REP identification, we developed a new web-based platform, RepRanger, to rapidly identify and annotate putative palindromic elements, including REPs. Using RepRanger, we identified >4,000 REPs in the E. coli MG1655 genome. Moreover, 81 (~52%) of its 157 small noncoding RNAs (sRNAs; known to regulate gene expression) contain REPs. Further analysis using RepRanger, alongside other bioinformatics tools, yielded 10 REP consensus motifs. Notably, REPs potentially involved in translational control (i.e., located within 15 nucleotides downstream of an open reading frame) primarily host motif 4, whereas REPs present in sRNAs predominantly encoded motif 9. Putative targets of REP-containing sRNAs include genes that rewire metabolic pathways in response to environmental changes, indicating that REPs likely contribute to bacterial adaptation. We also reveal that REP consensus motifs are similar across pathogenic and environmental E. coli strains but not commensal and laboratory ones. We employed RepRanger to search for REPs and confirm that they are present across bacteria and archaea. Furthermore, we uncovered similarities in REP sequences between bacterial and archaeal genomes. Thus, we demonstrate that RepRanger represents a versatile tool for discovering REPs in bacteria and archaea, providing new insights into their functions and sequence diversity.IMPORTANCERepetitive extragenic palindromic (REP) sequences were first discovered in Escherichia coli, but their biological roles, diversity, and sequence conservation remain unclear. We have developed a web-based tool, RepRanger, to identify and annotate putative palindromic elements, including REPs. Using RepRanger, we identified approximately 4,000 REPs in the E. coli MG1655 genome. We show that >50% of small noncoding RNAs (sRNAs) contain REPs. The predicted functions of REP-containing sRNAs indicate that REPs likely contribute to bacterial environmental adaptability. In addition, we have discovered REPs in pathogenic, environmental, and commensal E. coli, allowing us to assess their sequence similarity. We show that REPs are widely present in bacterial and archaeal genomes and share some sequence similarities. Our comparison of REPs in annotated genomes broadens the current understanding of REP sequence diversity, conservation, and function.
期刊介绍:
mSphere™ is a multi-disciplinary open-access journal that will focus on rapid publication of fundamental contributions to our understanding of microbiology. Its scope will reflect the immense range of fields within the microbial sciences, creating new opportunities for researchers to share findings that are transforming our understanding of human health and disease, ecosystems, neuroscience, agriculture, energy production, climate change, evolution, biogeochemical cycling, and food and drug production. Submissions will be encouraged of all high-quality work that makes fundamental contributions to our understanding of microbiology. mSphere™ will provide streamlined decisions, while carrying on ASM''s tradition for rigorous peer review.