DeepMVP:经过高质量数据训练的深度学习模型可以准确预测PTM位点和变异引起的变化。

IF 32.1 1区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS
Bo Wen, Chenwei Wang, Kai Li, Ping Han, Matthew V. Holt, Sara R. Savage, Jonathan T. Lei, Yongchao Dou, Zhiao Shi, Yi Li, Bing Zhang
{"title":"DeepMVP:经过高质量数据训练的深度学习模型可以准确预测PTM位点和变异引起的变化。","authors":"Bo Wen, Chenwei Wang, Kai Li, Ping Han, Matthew V. Holt, Sara R. Savage, Jonathan T. Lei, Yongchao Dou, Zhiao Shi, Yi Li, Bing Zhang","doi":"10.1038/s41592-025-02797-x","DOIUrl":null,"url":null,"abstract":"Post-translational modifications (PTMs) are critical regulators of protein function, and their disruption is a key mechanism by which missense variants contribute to disease. Accurate PTM site prediction using deep learning can help identify PTM-altering variants, but progress has been limited by the lack of large, high-quality training datasets. Here, we introduce PTMAtlas, a curated compendium of 397,524 PTM sites generated through systematic reprocessing of 241 public mass-spectrometry datasets, and DeepMVP, a deep learning framework trained on PTMAtlas to predict PTM sites for phosphorylation, acetylation, methylation, sumoylation, ubiquitination and N-glycosylation. DeepMVP substantially outperforms existing tools across all six PTM types. Its application to predicting PTM-altering missense variants shows strong concordance with experimental results, validated using literature-curated variants and cancer proteogenomic datasets. Together, PTMAtlas and DeepMVP provide a robust platform for PTM research and a scalable framework for assessing the functional consequences of coding variants through the lens of PTMs. DeepMVP is a deep learning framework for predicting PTM sites and variant-induced alterations across six modification types, including phosphorylation, acetylation, methylation, sumoylation, ubiquitination and N-glycosylation.","PeriodicalId":18981,"journal":{"name":"Nature Methods","volume":"22 9","pages":"1857-1867"},"PeriodicalIF":32.1000,"publicationDate":"2025-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12446062/pdf/","citationCount":"0","resultStr":"{\"title\":\"DeepMVP: deep learning models trained on high-quality data accurately predict PTM sites and variant-induced alterations\",\"authors\":\"Bo Wen, Chenwei Wang, Kai Li, Ping Han, Matthew V. Holt, Sara R. Savage, Jonathan T. Lei, Yongchao Dou, Zhiao Shi, Yi Li, Bing Zhang\",\"doi\":\"10.1038/s41592-025-02797-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Post-translational modifications (PTMs) are critical regulators of protein function, and their disruption is a key mechanism by which missense variants contribute to disease. Accurate PTM site prediction using deep learning can help identify PTM-altering variants, but progress has been limited by the lack of large, high-quality training datasets. Here, we introduce PTMAtlas, a curated compendium of 397,524 PTM sites generated through systematic reprocessing of 241 public mass-spectrometry datasets, and DeepMVP, a deep learning framework trained on PTMAtlas to predict PTM sites for phosphorylation, acetylation, methylation, sumoylation, ubiquitination and N-glycosylation. DeepMVP substantially outperforms existing tools across all six PTM types. Its application to predicting PTM-altering missense variants shows strong concordance with experimental results, validated using literature-curated variants and cancer proteogenomic datasets. Together, PTMAtlas and DeepMVP provide a robust platform for PTM research and a scalable framework for assessing the functional consequences of coding variants through the lens of PTMs. DeepMVP is a deep learning framework for predicting PTM sites and variant-induced alterations across six modification types, including phosphorylation, acetylation, methylation, sumoylation, ubiquitination and N-glycosylation.\",\"PeriodicalId\":18981,\"journal\":{\"name\":\"Nature Methods\",\"volume\":\"22 9\",\"pages\":\"1857-1867\"},\"PeriodicalIF\":32.1000,\"publicationDate\":\"2025-08-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12446062/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Nature Methods\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.nature.com/articles/s41592-025-02797-x\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Methods","FirstCategoryId":"99","ListUrlMain":"https://www.nature.com/articles/s41592-025-02797-x","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

摘要

翻译后修饰(PTMs)是蛋白质功能的关键调节因子,其破坏是错义变异导致疾病的关键机制。使用深度学习进行准确的PTM位点预测可以帮助识别PTM改变的变体,但由于缺乏大型、高质量的训练数据集,进展受到限制。在这里,我们介绍了PTMAtlas,一个通过系统再处理241个公共质谱数据集生成的397,524个PTM位点的精选纲要,以及DeepMVP,一个基于PTMAtlas训练的深度学习框架,用于预测PTM位点的磷酸化、乙酰化、甲基化、sumoylation、泛素化和n -糖基化。DeepMVP在所有六种PTM类型中都大大优于现有工具。它在预测ptm改变错义变异方面的应用显示了与实验结果的强烈一致性,并使用文献整理的变异和癌症蛋白质基因组数据集进行了验证。PTMAtlas和DeepMVP共同为PTM研究提供了一个强大的平台,并为通过PTM来评估编码变体的功能后果提供了一个可扩展的框架。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

DeepMVP: deep learning models trained on high-quality data accurately predict PTM sites and variant-induced alterations

DeepMVP: deep learning models trained on high-quality data accurately predict PTM sites and variant-induced alterations
Post-translational modifications (PTMs) are critical regulators of protein function, and their disruption is a key mechanism by which missense variants contribute to disease. Accurate PTM site prediction using deep learning can help identify PTM-altering variants, but progress has been limited by the lack of large, high-quality training datasets. Here, we introduce PTMAtlas, a curated compendium of 397,524 PTM sites generated through systematic reprocessing of 241 public mass-spectrometry datasets, and DeepMVP, a deep learning framework trained on PTMAtlas to predict PTM sites for phosphorylation, acetylation, methylation, sumoylation, ubiquitination and N-glycosylation. DeepMVP substantially outperforms existing tools across all six PTM types. Its application to predicting PTM-altering missense variants shows strong concordance with experimental results, validated using literature-curated variants and cancer proteogenomic datasets. Together, PTMAtlas and DeepMVP provide a robust platform for PTM research and a scalable framework for assessing the functional consequences of coding variants through the lens of PTMs. DeepMVP is a deep learning framework for predicting PTM sites and variant-induced alterations across six modification types, including phosphorylation, acetylation, methylation, sumoylation, ubiquitination and N-glycosylation.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Nature Methods
Nature Methods 生物-生化研究方法
CiteScore
58.70
自引率
1.70%
发文量
326
审稿时长
1 months
期刊介绍: Nature Methods is a monthly journal that focuses on publishing innovative methods and substantial enhancements to fundamental life sciences research techniques. Geared towards a diverse, interdisciplinary readership of researchers in academia and industry engaged in laboratory work, the journal offers new tools for research and emphasizes the immediate practical significance of the featured work. It publishes primary research papers and reviews recent technical and methodological advancements, with a particular interest in primary methods papers relevant to the biological and biomedical sciences. This includes methods rooted in chemistry with practical applications for studying biological problems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信