Subash C Pakhrin, Moriah R Beck, Punjan Subedi, Rabina Lama, Simonsha Shrestha
{"title":"多模态深度学习预测蛋白质泛素化位点。","authors":"Subash C Pakhrin, Moriah R Beck, Punjan Subedi, Rabina Lama, Simonsha Shrestha","doi":"10.1093/bioadv/vbaf200","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Ubiquitination is a crucial post-translational modification that regulates various biological functions, including protein degradation, signal transduction, and cellular homeostasis. Accurate identification of ubiquitination sites is essential for understanding these mechanisms, yet existing prediction tools often lack generalizability across diverse datasets. To address this limitation, we developed Multimodal Ubiquitination Predictor, a deep learning-based approach capable of predicting ubiquitination sites across general, human-specific, and plant-specific datasets. By integrating diverse protein sequence representations-one-hot encoding, embeddings, and physicochemical properties-within a unified deep-learning framework, the proposed method significantly enhances prediction accuracy and robustness, offering a valuable resource for both research and applications in ubiquitination site discovery.</p><p><strong>Results: </strong>Multimodal Ubiquitination Predictor achieved superior performance across general, human-specific, and plant-specific datasets, with 77.25% accuracy, 74.98% sensitivity, 80.67% specificity, an MCC of 0.54, and an AUC of 0.87 on an independent human ubiquitination test dataset. It outperformed existing methods, demonstrating enhanced reliability for ubiquitination site prediction. This robust predictor and dataset serve as valuable resources for future research and discovery.</p><p><strong>Availability and implementation: </strong>The developed tool, programs, training, and test dataset are available at https://github.com/PakhrinLab/MMUbiPred.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"5 1","pages":"vbaf200"},"PeriodicalIF":2.8000,"publicationDate":"2025-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12408473/pdf/","citationCount":"0","resultStr":"{\"title\":\"Multimodal deep learning for predicting protein ubiquitination sites.\",\"authors\":\"Subash C Pakhrin, Moriah R Beck, Punjan Subedi, Rabina Lama, Simonsha Shrestha\",\"doi\":\"10.1093/bioadv/vbaf200\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Motivation: </strong>Ubiquitination is a crucial post-translational modification that regulates various biological functions, including protein degradation, signal transduction, and cellular homeostasis. Accurate identification of ubiquitination sites is essential for understanding these mechanisms, yet existing prediction tools often lack generalizability across diverse datasets. To address this limitation, we developed Multimodal Ubiquitination Predictor, a deep learning-based approach capable of predicting ubiquitination sites across general, human-specific, and plant-specific datasets. By integrating diverse protein sequence representations-one-hot encoding, embeddings, and physicochemical properties-within a unified deep-learning framework, the proposed method significantly enhances prediction accuracy and robustness, offering a valuable resource for both research and applications in ubiquitination site discovery.</p><p><strong>Results: </strong>Multimodal Ubiquitination Predictor achieved superior performance across general, human-specific, and plant-specific datasets, with 77.25% accuracy, 74.98% sensitivity, 80.67% specificity, an MCC of 0.54, and an AUC of 0.87 on an independent human ubiquitination test dataset. It outperformed existing methods, demonstrating enhanced reliability for ubiquitination site prediction. This robust predictor and dataset serve as valuable resources for future research and discovery.</p><p><strong>Availability and implementation: </strong>The developed tool, programs, training, and test dataset are available at https://github.com/PakhrinLab/MMUbiPred.</p>\",\"PeriodicalId\":72368,\"journal\":{\"name\":\"Bioinformatics advances\",\"volume\":\"5 1\",\"pages\":\"vbaf200\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-08-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12408473/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bioinformatics advances\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/bioadv/vbaf200\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"MATHEMATICAL & COMPUTATIONAL BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioadv/vbaf200","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
Multimodal deep learning for predicting protein ubiquitination sites.
Motivation: Ubiquitination is a crucial post-translational modification that regulates various biological functions, including protein degradation, signal transduction, and cellular homeostasis. Accurate identification of ubiquitination sites is essential for understanding these mechanisms, yet existing prediction tools often lack generalizability across diverse datasets. To address this limitation, we developed Multimodal Ubiquitination Predictor, a deep learning-based approach capable of predicting ubiquitination sites across general, human-specific, and plant-specific datasets. By integrating diverse protein sequence representations-one-hot encoding, embeddings, and physicochemical properties-within a unified deep-learning framework, the proposed method significantly enhances prediction accuracy and robustness, offering a valuable resource for both research and applications in ubiquitination site discovery.
Results: Multimodal Ubiquitination Predictor achieved superior performance across general, human-specific, and plant-specific datasets, with 77.25% accuracy, 74.98% sensitivity, 80.67% specificity, an MCC of 0.54, and an AUC of 0.87 on an independent human ubiquitination test dataset. It outperformed existing methods, demonstrating enhanced reliability for ubiquitination site prediction. This robust predictor and dataset serve as valuable resources for future research and discovery.
Availability and implementation: The developed tool, programs, training, and test dataset are available at https://github.com/PakhrinLab/MMUbiPred.