Y. Kawarabayasi, Y. Kawarabayasi, Y. Hino, H. Horikawa, Koji Jin-no, Mikio Takahashi, M. Sekine, S. Baba, Akiho Ankai, H. Kosugi, A. Hosoyama, Shigehiro Fukui, Y. Nagai, Keiko Nishijima, R. Otsuka, H. Nakazawa, M. Takamiya, Y. Kato, Takio Yoshizawa, Toshihiro Tanaka, Y. Kudoh, J. Yamazaki, N. Kushida, A. Oguchi, Ken-ichi Aoki, S. Masuda, Masao Yanagii, Masami Nishimura, A. Yamagishi, Tairo Oshima, H. Kikuchi
{"title":"嗜热嗜酸好氧绿古菌tokodaisulfolobus菌株7的全基因组序列。","authors":"Y. Kawarabayasi, Y. Kawarabayasi, Y. Hino, H. Horikawa, Koji Jin-no, Mikio Takahashi, M. Sekine, S. Baba, Akiho Ankai, H. Kosugi, A. Hosoyama, Shigehiro Fukui, Y. Nagai, Keiko Nishijima, R. Otsuka, H. Nakazawa, M. Takamiya, Y. Kato, Takio Yoshizawa, Toshihiro Tanaka, Y. Kudoh, J. Yamazaki, N. Kushida, A. Oguchi, Ken-ichi Aoki, S. Masuda, Masao Yanagii, Masami Nishimura, A. Yamagishi, Tairo Oshima, H. Kikuchi","doi":"10.1093/DNARES/8.4.123","DOIUrl":null,"url":null,"abstract":"The complete genomic sequence of an aerobic thermoacidophilic crenarchaeon, Sulfolobus tokodaii strain7 which optimally grows at 80 degrees C, at low pH, and under aerobic conditions, has been determined by the whole genome shotgun method with slight modifications. The genomic size was 2,694,756 bp long and the G + C content was 32.8%. The following RNA-coding genes were identified: a single 16S-23S rRNA cluster, one 5S rRNA gene and 46 tRNA genes (including 24 intron-containing tRNA genes). The repetitive sequences identified were SR-type repetitive sequences, long dispersed-type repetitive sequences and Tn-like repetitive elements. The genome contained 2826 potential protein-coding regions (open reading frames, ORFs). By similarity search against public databases, 911 (32.2%) ORFs were related to functional assigned genes, 921 (32.6%) were related to conserved ORFs of unknown function, 145 (5.1%) contained some motifs, and remaining 849 (30.0%) did not show any significant similarity to the registered sequences. The ORFs with functional assignments included the candidate genes involved in sulfide metabolism, the TCA cycle and the respiratory chain. Sequence comparison provided evidence suggesting the integration of plasmid, rearrangement of genomic structure, and duplication of genomic regions that may be responsible for the larger genomic size of the S. tokodaii strain7 genome. The genome contained eukaryote-type genes which were not identified in other archaea and lacked the CCA sequence in the tRNA genes. The result suggests that this strain is closer to eukaryotes among the archaea strains so far sequenced. The data presented in this paper are also available on the internet homepage (http://www.bio.nite.go.jp/E-home/genome_list-e.html/).","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":"83 1","pages":"123-40"},"PeriodicalIF":0.0000,"publicationDate":"2001-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"312","resultStr":"{\"title\":\"Complete genome sequence of an aerobic thermoacidophilic crenarchaeon, Sulfolobus tokodaii strain7.\",\"authors\":\"Y. Kawarabayasi, Y. Kawarabayasi, Y. Hino, H. Horikawa, Koji Jin-no, Mikio Takahashi, M. Sekine, S. Baba, Akiho Ankai, H. Kosugi, A. Hosoyama, Shigehiro Fukui, Y. Nagai, Keiko Nishijima, R. Otsuka, H. Nakazawa, M. Takamiya, Y. Kato, Takio Yoshizawa, Toshihiro Tanaka, Y. Kudoh, J. Yamazaki, N. Kushida, A. Oguchi, Ken-ichi Aoki, S. Masuda, Masao Yanagii, Masami Nishimura, A. Yamagishi, Tairo Oshima, H. Kikuchi\",\"doi\":\"10.1093/DNARES/8.4.123\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The complete genomic sequence of an aerobic thermoacidophilic crenarchaeon, Sulfolobus tokodaii strain7 which optimally grows at 80 degrees C, at low pH, and under aerobic conditions, has been determined by the whole genome shotgun method with slight modifications. The genomic size was 2,694,756 bp long and the G + C content was 32.8%. The following RNA-coding genes were identified: a single 16S-23S rRNA cluster, one 5S rRNA gene and 46 tRNA genes (including 24 intron-containing tRNA genes). The repetitive sequences identified were SR-type repetitive sequences, long dispersed-type repetitive sequences and Tn-like repetitive elements. The genome contained 2826 potential protein-coding regions (open reading frames, ORFs). By similarity search against public databases, 911 (32.2%) ORFs were related to functional assigned genes, 921 (32.6%) were related to conserved ORFs of unknown function, 145 (5.1%) contained some motifs, and remaining 849 (30.0%) did not show any significant similarity to the registered sequences. The ORFs with functional assignments included the candidate genes involved in sulfide metabolism, the TCA cycle and the respiratory chain. Sequence comparison provided evidence suggesting the integration of plasmid, rearrangement of genomic structure, and duplication of genomic regions that may be responsible for the larger genomic size of the S. tokodaii strain7 genome. The genome contained eukaryote-type genes which were not identified in other archaea and lacked the CCA sequence in the tRNA genes. The result suggests that this strain is closer to eukaryotes among the archaea strains so far sequenced. The data presented in this paper are also available on the internet homepage (http://www.bio.nite.go.jp/E-home/genome_list-e.html/).\",\"PeriodicalId\":11212,\"journal\":{\"name\":\"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes\",\"volume\":\"83 1\",\"pages\":\"123-40\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2001-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"312\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/DNARES/8.4.123\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/DNARES/8.4.123","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 312
摘要
在80℃、低pH、好氧条件下最适宜生长的嗜热嗜酸嗜绿古菌Sulfolobus tokodaii菌株7用全基因组霰弹枪法测定了其完整的基因组序列,并进行了轻微的修改。基因组长2,694,756 bp, G + C含量为32.8%。鉴定出以下rna编码基因:1个16S-23S rRNA簇、1个5S rRNA基因和46个tRNA基因(其中含内含子tRNA基因24个)。鉴定出的重复序列有sr型重复序列、长分散型重复序列和n型重复元件。基因组包含2826个潜在的蛋白质编码区(开放阅读框,orf)。通过对公开数据库的相似性检索,发现与功能指定基因相关的orf有911个(32.2%),与功能未知的保守orf相关的orf有921个(32.6%),145个(5.1%)含有一定的基序,其余849个(30.0%)与已登记序列无显著的相似性。具有功能分配的orf包括参与硫化物代谢、TCA循环和呼吸链的候选基因。序列比较表明,质粒的整合、基因组结构的重排和基因组区域的重复可能是tokodais菌株7基因组较大的原因。该基因组含有真核生物型基因,这些基因在其他古细菌中未被发现,并且缺乏tRNA基因的CCA序列。结果表明,在迄今测序的古生菌菌株中,该菌株更接近真核生物。本文中提供的数据也可在互联网主页(http://www.bio.nite.go.jp/E-home/genome_list-e.html/)上获得。
Complete genome sequence of an aerobic thermoacidophilic crenarchaeon, Sulfolobus tokodaii strain7.
The complete genomic sequence of an aerobic thermoacidophilic crenarchaeon, Sulfolobus tokodaii strain7 which optimally grows at 80 degrees C, at low pH, and under aerobic conditions, has been determined by the whole genome shotgun method with slight modifications. The genomic size was 2,694,756 bp long and the G + C content was 32.8%. The following RNA-coding genes were identified: a single 16S-23S rRNA cluster, one 5S rRNA gene and 46 tRNA genes (including 24 intron-containing tRNA genes). The repetitive sequences identified were SR-type repetitive sequences, long dispersed-type repetitive sequences and Tn-like repetitive elements. The genome contained 2826 potential protein-coding regions (open reading frames, ORFs). By similarity search against public databases, 911 (32.2%) ORFs were related to functional assigned genes, 921 (32.6%) were related to conserved ORFs of unknown function, 145 (5.1%) contained some motifs, and remaining 849 (30.0%) did not show any significant similarity to the registered sequences. The ORFs with functional assignments included the candidate genes involved in sulfide metabolism, the TCA cycle and the respiratory chain. Sequence comparison provided evidence suggesting the integration of plasmid, rearrangement of genomic structure, and duplication of genomic regions that may be responsible for the larger genomic size of the S. tokodaii strain7 genome. The genome contained eukaryote-type genes which were not identified in other archaea and lacked the CCA sequence in the tRNA genes. The result suggests that this strain is closer to eukaryotes among the archaea strains so far sequenced. The data presented in this paper are also available on the internet homepage (http://www.bio.nite.go.jp/E-home/genome_list-e.html/).