Samuel C Epstein, Louise K Charkoudian, Marnix H Medema
{"title":"A standardized workflow for submitting data to the Minimum Information about a Biosynthetic Gene cluster (MIBiG) repository: prospects for research-based educational experiences.","authors":"Samuel C Epstein, Louise K Charkoudian, Marnix H Medema","doi":"10.1186/s40793-018-0318-y","DOIUrl":"10.1186/s40793-018-0318-y","url":null,"abstract":"<p><p>Microorganisms utilize complex enzymatic pathways to biosynthesize structurally complex and pharmacologically relevant molecules. These pathways are encoded by gene clusters and are found in a diverse set of organisms. The Minimum Information about a Biosynthetic Gene cluster repository facilitates standardized and centralized storage of experimental data on these gene clusters and their molecular products, by utilizing user-submitted data to translate scientific discoveries into a format that can be analyzed computationally. This accelerates the processes of connecting genes to chemical structures, understanding biosynthetic gene clusters in the context of environmental diversity, and performing computer-assisted design of synthetic gene clusters. Here, we present a Standard Operating Procedure, Excel templates, a tutorial video, and a collection of relevant review literature to support scientists in their efforts to submit data into MiBIG. Further, we provide tools to integrate gene cluster annotation projects into the classroom environment, including workflows and assessment materials.</p>","PeriodicalId":21965,"journal":{"name":"Standards in Genomic Sciences","volume":"13 ","pages":"16"},"PeriodicalIF":0.0,"publicationDate":"2018-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s40793-018-0318-y","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36312291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Short genome report of cellulose-producing commensal <i>Escherichia coli</i> 1094.","authors":"Joaquin Bernal-Bayard, Laura Gomez-Valero, Aimee Wessel, Varun Khanna, Christiane Bouchier, Jean-Marc Ghigo","doi":"10.1186/s40793-018-0316-0","DOIUrl":"https://doi.org/10.1186/s40793-018-0316-0","url":null,"abstract":"<p><p>Bacterial surface colonization and biofilm formation often rely on the production of an extracellular polymeric matrix that mediates cell-cell and cell-surface contacts. In <i>Escherichia coli</i> and many <i>Betaproteobacteria</i> and <i>Gammaproteobacteria</i> cellulose is often the main component of the extracellular matrix. Here we report the complete genome sequence of the cellulose producing strain <i>E. coli</i> 1094 and compare it with five other closely related genomes within <i>E. coli</i> phylogenetic group A. We present a comparative analysis of the regions encoding genes responsible for cellulose biosynthesis and discuss the changes that could have led to the loss of this important adaptive advantage in several <i>E. coli</i> strains. Data deposition: The annotated genome sequence has been deposited at the European Nucleotide Archive under the accession number PRJEB21000.</p>","PeriodicalId":21965,"journal":{"name":"Standards in Genomic Sciences","volume":"13 ","pages":"13"},"PeriodicalIF":0.0,"publicationDate":"2018-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s40793-018-0316-0","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36098897","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Samuel M Luedin, Joël F Pothier, Francesco Danza, Nicola Storelli, Niels-Ulrik Frigaard, Matthias Wittwer, Mauro Tonolla
{"title":"Complete genome sequence of \"<i>Thiodictyon syntrophicum</i>\" sp. nov. strain Cad16<sup>T</sup>, a photolithoautotrophic purple sulfur bacterium isolated from the alpine meromictic Lake Cadagno.","authors":"Samuel M Luedin, Joël F Pothier, Francesco Danza, Nicola Storelli, Niels-Ulrik Frigaard, Matthias Wittwer, Mauro Tonolla","doi":"10.1186/s40793-018-0317-z","DOIUrl":"10.1186/s40793-018-0317-z","url":null,"abstract":"<p><p>\"<i>Thiodictyon syntrophicum\"</i> sp. nov. strain Cad16<sup>T</sup> is a photoautotrophic purple sulfur bacterium belonging to the family of <i>Chromatiaceae</i> in the class of <i>Gammaproteobacteria</i>. The type strain Cad16<sup>T</sup> was isolated from the chemocline of the alpine meromictic Lake Cadagno in Switzerland. Strain Cad16<sup>T</sup> represents a key species within this sulfur-driven bacterial ecosystem with respect to carbon fixation. The 7.74-Mbp genome of strain Cad16<sup>T</sup> has been sequenced and annotated. It encodes 6237 predicted protein sequences and 59 RNA sequences. Phylogenetic comparison based on 16S rRNA revealed that <i>Thiodictyon elegans</i> strain DSM 232<sup>T</sup> the most closely related species. Genes involved in sulfur oxidation, central carbon metabolism and transmembrane transport were found. Noteworthy, clusters of genes encoding the photosynthetic machinery and pigment biosynthesis are found on the 0.48 Mb plasmid pTs485. We provide a detailed insight into the Cad16<sup>T</sup> genome and analyze it in the context of the microbial ecosystem of Lake Cadagno.</p>","PeriodicalId":21965,"journal":{"name":"Standards in Genomic Sciences","volume":"13 ","pages":"14"},"PeriodicalIF":0.0,"publicationDate":"2018-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5944118/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36109272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
David Alvarez-Ponce, Chava L Weitzman, Richard L Tillett, Franziska C Sandmeier, C Richard Tracy
{"title":"High quality draft genome sequences of <i>Mycoplasma agassizii</i> strains PS6<sup>T</sup> and 723 isolated from <i>Gopherus</i> tortoises with upper respiratory tract disease.","authors":"David Alvarez-Ponce, Chava L Weitzman, Richard L Tillett, Franziska C Sandmeier, C Richard Tracy","doi":"10.1186/s40793-018-0315-1","DOIUrl":"https://doi.org/10.1186/s40793-018-0315-1","url":null,"abstract":"<p><p><i>Mycoplasma agassizii</i> is one of the known causative agents of upper respiratory tract disease (URTD) in Mojave desert tortoises (<i>Gopherus agassizii</i>) and in gopher tortoises (<i>Gopherus polyphemus</i>). We sequenced the genomes of <i>M. agassizii</i> strains PS6<sup>T</sup> (ATCC 700616) and 723 (ATCC 700617) isolated from the upper respiratory tract of a Mojave desert tortoise and a gopher tortoise, respectively, both with signs of URTD. The PS6<sup>T</sup> genome assembly was organized in eight scaffolds, had a total length of 1,274,972 bp, a G + C content of 28.43%, and contained 979 protein-coding genes, 13 pseudogenes and 35 RNA genes. The 723 genome assembly was organized in 40 scaffolds, had a total length of 1,211,209 bp, a G + C content of 28.34%, and contained 955 protein-coding genes, seven pseudogenes, and 35 RNA genes. Both genomes exhibit a very similar organization and very similar numbers of genes in each functional category. Pairs of orthologous genes encode proteins that are 93.57% identical on average. Homology searches identified a putative cytadhesin. These genomes will enable studies that will help understand the molecular bases of pathogenicity of this and other <i>Mycoplasma</i> species.</p>","PeriodicalId":21965,"journal":{"name":"Standards in Genomic Sciences","volume":"13 ","pages":"12"},"PeriodicalIF":0.0,"publicationDate":"2018-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s40793-018-0315-1","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36068416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Strategies for high-altitude adaptation revealed from high-quality draft genome of non-violacein producing <i>Janthinobacterium lividum</i> ERGS5:01.","authors":"Rakshak Kumar, Vishal Acharya, Dharam Singh, Sanjay Kumar","doi":"10.1186/s40793-018-0313-3","DOIUrl":"10.1186/s40793-018-0313-3","url":null,"abstract":"<p><p>A light pink coloured bacterial strain ERGS5:01 isolated from glacial stream water of Sikkim Himalaya was affiliated to <i>Janthinobacterium lividum</i> based on 16S rRNA gene sequence identity and phylogenetic clustering. Whole genome sequencing was performed for the strain to confirm its taxonomy as it lacked the typical violet pigmentation of the genus and also to decipher its survival strategy at the aquatic ecosystem of high elevation. The PacBio RSII sequencing generated genome of 5,168,928 bp with 4575 protein-coding genes and 118 RNA genes. Whole genome-based multilocus sequence analysis clustering, in silico DDH similarity value of 95.1% and, the ANI value of 99.25% established the identity of the strain ERGS5:01 (MCC 2953) as a non-violacein producing <i>J. lividum</i>. The genome comparisons across genus <i>Janthinobacterium</i> revealed an open pan-genome with the scope of the addition of new orthologous cluster to complete the genomic inventory. The genomic insight provided the genetic basis of freezing and frequent freeze-thaw cycle tolerance and, for industrially important enzymes. Extended insight into the genome provided clues of crucial genes associated with adaptation in the harsh aquatic ecosystem of high altitude.</p>","PeriodicalId":21965,"journal":{"name":"Standards in Genomic Sciences","volume":"13 ","pages":"11"},"PeriodicalIF":0.0,"publicationDate":"2018-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5909252/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36064467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Alejandro Mateos-Rivera, Tajul Islam, Ian P G Marshall, Lars Schreiber, Lise Øvreås
{"title":"High-quality draft genome of the methanotroph <i>Methylovulum psychrotolerans</i> Str. HV10-M2 isolated from plant material at a high-altitude environment.","authors":"Alejandro Mateos-Rivera, Tajul Islam, Ian P G Marshall, Lars Schreiber, Lise Øvreås","doi":"10.1186/s40793-018-0314-2","DOIUrl":"https://doi.org/10.1186/s40793-018-0314-2","url":null,"abstract":"<p><p>Here we present the genome of <i>Methylovulum psychrotolerans</i> strain HV10-M2, a methanotroph isolated from Hardangervidda national park (Norway). This strain represents the second of the two validly published species genus with a sequenced genome. The other is <i>M. miyakonense</i> HT12, which is the type strain of the species and the type species of the genus <i>Methylovulum.</i> We present the genome of <i>M. psychrotolerants</i> str. HV10-M2 and discuss the differences between <i>M. psychrotolerans</i> and <i>M. miyakonense</i>. The genome size of <i>M. psychrotolerans</i> str. HV10-M2 is 4,923,400 bp and contains 4415 protein-coding genes, 50 RNA genes and an average GC content of 50.88%.</p>","PeriodicalId":21965,"journal":{"name":"Standards in Genomic Sciences","volume":"13 ","pages":"10"},"PeriodicalIF":0.0,"publicationDate":"2018-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s40793-018-0314-2","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36036182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Chava L Weitzman, Richard L Tillett, Franziska C Sandmeier, C Richard Tracy, David Alvarez-Ponce
{"title":"High quality draft genome sequence of <i>Mycoplasma testudineum</i> strain BH29<sup>T</sup>, isolated from the respiratory tract of a desert tortoise.","authors":"Chava L Weitzman, Richard L Tillett, Franziska C Sandmeier, C Richard Tracy, David Alvarez-Ponce","doi":"10.1186/s40793-018-0309-z","DOIUrl":"https://doi.org/10.1186/s40793-018-0309-z","url":null,"abstract":"<p><p><i>Mycoplasma testudineum</i> is one of the pathogens that can cause upper respiratory tract disease in desert tortoises, <i>Gopherus agassizii</i>. We sequenced the genome of <i>M. testudineum</i> BH29<sup>T</sup> (ATCC 700618<sup>T</sup> = MCCM 03231<sup>T</sup>), isolated from the upper respiratory tract of a Mojave desert tortoise with upper respiratory tract disease. The sequenced draft genome, organized in 25 scaffolds, has a length of 960,895 bp and a G + C content of 27.54%. A total of 788 protein-coding sequences, six pseudogenes and 35 RNA genes were identified. The potential presence of cytadhesin-encoding genes is investigated. This genome will enable comparative genomic studies to help understand the molecular bases of the pathogenicity of this and other <i>Mycoplasma</i> species.</p>","PeriodicalId":21965,"journal":{"name":"Standards in Genomic Sciences","volume":"13 ","pages":"9"},"PeriodicalIF":0.0,"publicationDate":"2018-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s40793-018-0309-z","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36034277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Genome sequence of <i>Planktotalea frisia</i> type strain (SH6-1<sup>T</sup>), a representative of the <i>Roseobacter</i> group isolated from the North Sea during a phytoplankton bloom.","authors":"Insa Bakenhus, Sonja Voget, Anja Poehlein, Thorsten Brinkhoff, Rolf Daniel, Meinhard Simon","doi":"10.1186/s40793-018-0311-5","DOIUrl":"https://doi.org/10.1186/s40793-018-0311-5","url":null,"abstract":"<p><p><i>Planktotalea frisia</i> SH6-1<sup>T</sup> Hahnke et al. (Int J Syst Evol Microbiol 62:1619-24, 2012) is a planktonic marine bacterium isolated during a phytoplankton bloom from the southern North Sea. It belongs to the <i>Roseobacter</i> group within the alphaproteobacterial family <i>Rhodobacteraceae</i>. Here we describe the draft genome sequence and annotation of the type strain SH6-1<sup>T</sup>. The genome comprises 4,106,736 bp and contains 4128 protein-coding and 38 RNA genes. The draft genome sequence provides evidence for at least three extrachromosomal elements, encodes genes for DMSP utilization, quorum sensing, photoheterotrophy and a type IV secretion system. This indicates not only adaptation to a free-living lifestyle of <i>P. frisia</i> but points also to interactions with prokaryotic or eukaryotic organisms.</p>","PeriodicalId":21965,"journal":{"name":"Standards in Genomic Sciences","volume":"13 ","pages":"7"},"PeriodicalIF":0.0,"publicationDate":"2018-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s40793-018-0311-5","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36031445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Draft genome sequence of <i>Fermentimonas caenicola</i> strain SIT8, isolated from the human gut.","authors":"Mamadou Beye, Sofiane Bakour, Sory Ibrahima Traore, Jaishriram Rathored, Noémie Labas, Didier Raoult, Pierre-Edouard Fournier","doi":"10.1186/s40793-018-0310-6","DOIUrl":"10.1186/s40793-018-0310-6","url":null,"abstract":"<p><p>We report the properties of a draft genome sequence of the bacterium <i>Fermentimonas caenicola</i> strain SIT8 (= CSUR P1560). This strain, whose genome is described here, was isolated from the fecal flora of a healthy 28-month-old Senegalese boy. Strain SIT8 is a facultatively anaerobic Gram-negative bacillus. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 2,824,451-bp long (1 chromosome but no plasmid) contains 2354 protein-coding and 46 RNA genes, including four rRNA genes.</p>","PeriodicalId":21965,"journal":{"name":"Standards in Genomic Sciences","volume":"13 ","pages":"8"},"PeriodicalIF":0.0,"publicationDate":"2018-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5896035/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36034276","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}