EPSD 2.0: An Updated Database of Protein Phosphorylation Sites Across Eukaryotic Species.

IF 7.9
Miaomiao Chen, Yujie Gou, Ming Lei, Leming Xiao, Miaoying Zhao, Xinhe Huang, Dan Liu, Zihao Feng, Di Peng, Yu Xue
{"title":"EPSD 2.0: An Updated Database of Protein Phosphorylation Sites Across Eukaryotic Species.","authors":"Miaomiao Chen, Yujie Gou, Ming Lei, Leming Xiao, Miaoying Zhao, Xinhe Huang, Dan Liu, Zihao Feng, Di Peng, Yu Xue","doi":"10.1093/gpbjnl/qzaf057","DOIUrl":null,"url":null,"abstract":"<p><p>As one of the most crucial post-translational modifications, protein phosphorylation regulates a broad range of biological processes in eukaryotes. Biocuration, integration, and annotation of reported phosphorylation events will deliver a valuable resource for the community. Here, we present an updated database, the eukaryotic phosphorylation site database 2.0 (EPSD 2.0), which includes 2,769,163 experimentally identified phosphorylation sites (p-sites) in 362,707 phosphoproteins from 223 eukaryotes. From the literature, 873,718 new p-sites identified through high-throughput phosphoproteomic research were first collected, and 1,078,888 original phosphopeptides together with primary references were reserved. Then, this dataset was merged into EPSD 1.0, comprising 1,616,804 p-sites within 209,326 proteins across 68 eukaryotic organisms. We also integrated 362,190 additional known p-sites from 10 public databases. After redundancy clearance, we manually re-checked each p-site and annotated 88,074 functional events for 32,762 p-sites, covering 58 types of downstream effects on phosphoproteins, and regulatory impacts on 107 biological processes. In addition, phosphoproteins and p-sites in 8 model organisms were meticulously annotated utilizing information supplied by 100 external platforms encompassing 15 areas. These areas included kinase/phosphatase, transcription regulators, three-dimensional structures, physicochemical characteristics, genomic variations, functional descriptions, protein domains, molecular interactions, drug-target associations, disease-related data, orthologs, transcript expression levels, proteomics, subcellular localization, and regulatory pathways. We expect that EPSD 2.0 will become a useful database supporting comprehensive studies on phosphorylation in eukaryotes. The EPSD 2.0 database is freely accessible online at https://epsd.biocuckoo.cn/.</p>","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":" ","pages":""},"PeriodicalIF":7.9000,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12448286/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genomics, proteomics & bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/gpbjnl/qzaf057","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

As one of the most crucial post-translational modifications, protein phosphorylation regulates a broad range of biological processes in eukaryotes. Biocuration, integration, and annotation of reported phosphorylation events will deliver a valuable resource for the community. Here, we present an updated database, the eukaryotic phosphorylation site database 2.0 (EPSD 2.0), which includes 2,769,163 experimentally identified phosphorylation sites (p-sites) in 362,707 phosphoproteins from 223 eukaryotes. From the literature, 873,718 new p-sites identified through high-throughput phosphoproteomic research were first collected, and 1,078,888 original phosphopeptides together with primary references were reserved. Then, this dataset was merged into EPSD 1.0, comprising 1,616,804 p-sites within 209,326 proteins across 68 eukaryotic organisms. We also integrated 362,190 additional known p-sites from 10 public databases. After redundancy clearance, we manually re-checked each p-site and annotated 88,074 functional events for 32,762 p-sites, covering 58 types of downstream effects on phosphoproteins, and regulatory impacts on 107 biological processes. In addition, phosphoproteins and p-sites in 8 model organisms were meticulously annotated utilizing information supplied by 100 external platforms encompassing 15 areas. These areas included kinase/phosphatase, transcription regulators, three-dimensional structures, physicochemical characteristics, genomic variations, functional descriptions, protein domains, molecular interactions, drug-target associations, disease-related data, orthologs, transcript expression levels, proteomics, subcellular localization, and regulatory pathways. We expect that EPSD 2.0 will become a useful database supporting comprehensive studies on phosphorylation in eukaryotes. The EPSD 2.0 database is freely accessible online at https://epsd.biocuckoo.cn/.

EPSD 2.0:一个更新的真核生物物种蛋白质磷酸化位点数据库。
蛋白磷酸化作为最重要的翻译后修饰(PTMs)之一,调控着真核生物广泛的生物过程。报道的磷酸化事件的生物记录、整合和注释将为社区提供宝贵的资源。在这里,我们提出了一个更新的数据库,真核磷酸化位点数据库2.0 (EPSD 2.0),其中包括2,769,163个实验鉴定的磷酸化位点(p位点),来自223个真核生物的362,707个磷酸化蛋白。从文献中,首次收集到高通量磷蛋白组学研究鉴定的873718个新p位点,保留了1,078,888个原始磷酸肽和主要参考文献。然后,将该数据集合并到EPSD 1.0中,包括68个真核生物[1]的209,326个蛋白质中的1,616,804个p位点。我们还整合了来自10个公共数据库的362,190个已知p-site。清除冗余后,我们手动重新检查了每个p位点,并注释了32,762个p位点的88,074个功能事件,涵盖了58种对磷酸化蛋白的下游作用,以及对107个生物过程的调节影响。此外,利用涵盖15个领域的100个外部平台提供的信息,对8种模式生物的磷酸化蛋白和p位点进行了细致的注释。这些领域包括激酶/磷酸酶、转录调节剂、三维结构、物理化学特征、基因组变异、功能描述、蛋白质结构域、分子相互作用、药物靶标关联、疾病相关数据、同源物、转录物表达水平、蛋白质组学、亚细胞定位和调控途径。我们期望EPSD 2.0能够成为一个支持真核生物磷酸化综合研究的有用数据库。环保署2.0数据库可在网上免费查阅,网址为https://epsd.biocuckoo.cn/。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信