EPSD 2.0: An Updated Database of Protein Phosphorylation Sites Across Eukaryotic Species.

Miaomiao Chen, Yujie Gou, Ming Lei, Leming Xiao, Miaoying Zhao, Xinhe Huang, Dan Liu, Zihao Feng, Di Peng, Yu Xue
{"title":"EPSD 2.0: An Updated Database of Protein Phosphorylation Sites Across Eukaryotic Species.","authors":"Miaomiao Chen, Yujie Gou, Ming Lei, Leming Xiao, Miaoying Zhao, Xinhe Huang, Dan Liu, Zihao Feng, Di Peng, Yu Xue","doi":"10.1093/gpbjnl/qzaf057","DOIUrl":null,"url":null,"abstract":"<p><p>As one of the most crucial post-translational modifications (PTMs), protein phosphorylation regulates a broad range of biological processes in eukaryotes. Biocuration, integration, and annotation of reported phosphorylation events will deliver a valuable resource for the community. Here, we present an updated database, the eukaryotic phosphorylation site database 2.0 (EPSD 2.0), which includes 2,769,163 experimentally identified phosphorylation sites (p-sites) in 362,707 phosphoproteins from 223 eukaryotes. From the literature, 873,718 new p-sites identified through high-throughput phosphoproteomic research were first collected, and 1,078,888 original phosphopeptides together with primary references were reserved. Then, this dataset was merged into EPSD 1.0, comprising 1,616,804 p-sites within 209,326 proteins across 68 eukaryotic organisms [1]. We also integrated 362,190 additional known p-sites from 10 public databases. After redundancy clearance, we manually re-checked each p-site and annotated 88,074 functional events for 32,762 p-sites, covering 58 types of downstream effects on phosphoproteins, and regulatory impacts on 107 biological processes. In addition, phosphoproteins and p-sites in 8 model organisms were meticulously annotated utilizing information supplied by 100 external platforms encompassing 15 areas. These areas included kinase/phosphatase, transcription regulators, three-dimensional structures, physicochemical characteristics, genomic variations, functional descriptions, protein domains, molecular interactions, drug-target associations, disease-related data, orthologs, transcript expression levels, proteomics, subcellular localization, and regulatory pathways. We expect that EPSD 2.0 will become a useful database supporting comprehensive studies on phosphorylation in eukaryotes. The EPSD 2.0 database is freely accessible online at https://epsd.biocuckoo.cn/.</p>","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genomics, proteomics & bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/gpbjnl/qzaf057","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

As one of the most crucial post-translational modifications (PTMs), protein phosphorylation regulates a broad range of biological processes in eukaryotes. Biocuration, integration, and annotation of reported phosphorylation events will deliver a valuable resource for the community. Here, we present an updated database, the eukaryotic phosphorylation site database 2.0 (EPSD 2.0), which includes 2,769,163 experimentally identified phosphorylation sites (p-sites) in 362,707 phosphoproteins from 223 eukaryotes. From the literature, 873,718 new p-sites identified through high-throughput phosphoproteomic research were first collected, and 1,078,888 original phosphopeptides together with primary references were reserved. Then, this dataset was merged into EPSD 1.0, comprising 1,616,804 p-sites within 209,326 proteins across 68 eukaryotic organisms [1]. We also integrated 362,190 additional known p-sites from 10 public databases. After redundancy clearance, we manually re-checked each p-site and annotated 88,074 functional events for 32,762 p-sites, covering 58 types of downstream effects on phosphoproteins, and regulatory impacts on 107 biological processes. In addition, phosphoproteins and p-sites in 8 model organisms were meticulously annotated utilizing information supplied by 100 external platforms encompassing 15 areas. These areas included kinase/phosphatase, transcription regulators, three-dimensional structures, physicochemical characteristics, genomic variations, functional descriptions, protein domains, molecular interactions, drug-target associations, disease-related data, orthologs, transcript expression levels, proteomics, subcellular localization, and regulatory pathways. We expect that EPSD 2.0 will become a useful database supporting comprehensive studies on phosphorylation in eukaryotes. The EPSD 2.0 database is freely accessible online at https://epsd.biocuckoo.cn/.

EPSD 2.0:一个更新的真核生物物种蛋白质磷酸化位点数据库。
蛋白磷酸化作为最重要的翻译后修饰(PTMs)之一,调控着真核生物广泛的生物过程。报道的磷酸化事件的生物记录、整合和注释将为社区提供宝贵的资源。在这里,我们提出了一个更新的数据库,真核磷酸化位点数据库2.0 (EPSD 2.0),其中包括2,769,163个实验鉴定的磷酸化位点(p位点),来自223个真核生物的362,707个磷酸化蛋白。从文献中,首次收集到高通量磷蛋白组学研究鉴定的873718个新p位点,保留了1,078,888个原始磷酸肽和主要参考文献。然后,将该数据集合并到EPSD 1.0中,包括68个真核生物[1]的209,326个蛋白质中的1,616,804个p位点。我们还整合了来自10个公共数据库的362,190个已知p-site。清除冗余后,我们手动重新检查了每个p位点,并注释了32,762个p位点的88,074个功能事件,涵盖了58种对磷酸化蛋白的下游作用,以及对107个生物过程的调节影响。此外,利用涵盖15个领域的100个外部平台提供的信息,对8种模式生物的磷酸化蛋白和p位点进行了细致的注释。这些领域包括激酶/磷酸酶、转录调节剂、三维结构、物理化学特征、基因组变异、功能描述、蛋白质结构域、分子相互作用、药物靶标关联、疾病相关数据、同源物、转录物表达水平、蛋白质组学、亚细胞定位和调控途径。我们期望EPSD 2.0能够成为一个支持真核生物磷酸化综合研究的有用数据库。环保署2.0数据库可在网上免费查阅,网址为https://epsd.biocuckoo.cn/。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信