COG database update 2024.

IF 16.6 2区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY
Michael Y Galperin, Roberto Vera Alvarez, Svetlana Karamycheva, Kira S Makarova, Yuri I Wolf, David Landsman, Eugene V Koonin
{"title":"COG database update 2024.","authors":"Michael Y Galperin, Roberto Vera Alvarez, Svetlana Karamycheva, Kira S Makarova, Yuri I Wolf, David Landsman, Eugene V Koonin","doi":"10.1093/nar/gkae983","DOIUrl":null,"url":null,"abstract":"<p><p>The Clusters of Orthologous Genes (COG) database, originally created in 1997, has been updated to reflect the constantly growing collection of completely sequenced prokaryotic genomes. This update increased the genome coverage from 1309 to 2296 species, including 2103 bacteria and 193 archaea, in most cases, with a single representative genome per genus. This set covers all genera of bacteria and archaea that included organisms with 'complete genomes' as per NCBI databases in November 2023. The number of COGs has been expanded from 4877 to 4981, primarily by including protein families involved in bacterial protein secretion. Accordingly, COG pathways and functional groups now include secretion systems of types II through X, as well as Flp/Tad and type IV pili. These groupings allow straightforward identification and examination of the prokaryotic lineages that encompass-or lack-a particular secretion system. Other developments include improved annotations for the rRNA and tRNA modification proteins, multi-domain signal transduction proteins, and some previously uncharacterized protein families. The new version of COGs is available at https://www.ncbi.nlm.nih.gov/research/COG, as well as on the NCBI FTP site https://ftp.ncbi.nlm.nih.gov/pub/COG/, which also provides archived data from previous COG releases.</p>","PeriodicalId":19471,"journal":{"name":"Nucleic Acids Research","volume":null,"pages":null},"PeriodicalIF":16.6000,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nucleic Acids Research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/nar/gkae983","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

The Clusters of Orthologous Genes (COG) database, originally created in 1997, has been updated to reflect the constantly growing collection of completely sequenced prokaryotic genomes. This update increased the genome coverage from 1309 to 2296 species, including 2103 bacteria and 193 archaea, in most cases, with a single representative genome per genus. This set covers all genera of bacteria and archaea that included organisms with 'complete genomes' as per NCBI databases in November 2023. The number of COGs has been expanded from 4877 to 4981, primarily by including protein families involved in bacterial protein secretion. Accordingly, COG pathways and functional groups now include secretion systems of types II through X, as well as Flp/Tad and type IV pili. These groupings allow straightforward identification and examination of the prokaryotic lineages that encompass-or lack-a particular secretion system. Other developments include improved annotations for the rRNA and tRNA modification proteins, multi-domain signal transduction proteins, and some previously uncharacterized protein families. The new version of COGs is available at https://www.ncbi.nlm.nih.gov/research/COG, as well as on the NCBI FTP site https://ftp.ncbi.nlm.nih.gov/pub/COG/, which also provides archived data from previous COG releases.

COG 数据库更新 2024 年。
同源基因群(COG)数据库最初创建于 1997 年,现已更新,以反映不断增加的完整测序原核生物基因组。此次更新将基因组覆盖范围从 1309 个物种增加到 2296 个物种,其中包括 2103 个细菌和 193 个古细菌,在大多数情况下,每个属都有一个具有代表性的基因组。这套基因组涵盖了所有细菌属和古细菌属,其中包括 2023 年 11 月根据 NCBI 数据库具有 "完整基因组 "的生物。COG 的数量从 4877 个扩展到 4981 个,主要是通过纳入参与细菌蛋白质分泌的蛋白质家族。因此,COG 途径和功能组现在包括 II 型到 X 型的分泌系统,以及 Flp/Tad 和 IV 型纤毛虫。通过这些分组,可以直接识别和检查包含或缺乏特定分泌系统的原核生物系。其他进展包括改进了 rRNA 和 tRNA 修饰蛋白、多域信号转导蛋白以及一些以前未表征的蛋白家族的注释。新版 COGs 可在 https://www.ncbi.nlm.nih.gov/research/COG 以及 NCBI FTP 网站 https://ftp.ncbi.nlm.nih.gov/pub/COG/ 上查阅,该网站还提供以前发布的 COG 的存档数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Nucleic Acids Research
Nucleic Acids Research 生物-生化与分子生物学
CiteScore
27.10
自引率
4.70%
发文量
1057
审稿时长
2 months
期刊介绍: Nucleic Acids Research (NAR) is a scientific journal that publishes research on various aspects of nucleic acids and proteins involved in nucleic acid metabolism and interactions. It covers areas such as chemistry and synthetic biology, computational biology, gene regulation, chromatin and epigenetics, genome integrity, repair and replication, genomics, molecular biology, nucleic acid enzymes, RNA, and structural biology. The journal also includes a Survey and Summary section for brief reviews. Additionally, each year, the first issue is dedicated to biological databases, and an issue in July focuses on web-based software resources for the biological community. Nucleic Acids Research is indexed by several services including Abstracts on Hygiene and Communicable Diseases, Animal Breeding Abstracts, Agricultural Engineering Abstracts, Agbiotech News and Information, BIOSIS Previews, CAB Abstracts, and EMBASE.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信