Subash C Pakhrin, Moriah R Beck, Punjan Subedi, Rabina Lama, Simonsha Shrestha
{"title":"Multimodal deep learning for predicting protein ubiquitination sites.","authors":"Subash C Pakhrin, Moriah R Beck, Punjan Subedi, Rabina Lama, Simonsha Shrestha","doi":"10.1093/bioadv/vbaf200","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Ubiquitination is a crucial post-translational modification that regulates various biological functions, including protein degradation, signal transduction, and cellular homeostasis. Accurate identification of ubiquitination sites is essential for understanding these mechanisms, yet existing prediction tools often lack generalizability across diverse datasets. To address this limitation, we developed Multimodal Ubiquitination Predictor, a deep learning-based approach capable of predicting ubiquitination sites across general, human-specific, and plant-specific datasets. By integrating diverse protein sequence representations-one-hot encoding, embeddings, and physicochemical properties-within a unified deep-learning framework, the proposed method significantly enhances prediction accuracy and robustness, offering a valuable resource for both research and applications in ubiquitination site discovery.</p><p><strong>Results: </strong>Multimodal Ubiquitination Predictor achieved superior performance across general, human-specific, and plant-specific datasets, with 77.25% accuracy, 74.98% sensitivity, 80.67% specificity, an MCC of 0.54, and an AUC of 0.87 on an independent human ubiquitination test dataset. It outperformed existing methods, demonstrating enhanced reliability for ubiquitination site prediction. This robust predictor and dataset serve as valuable resources for future research and discovery.</p><p><strong>Availability and implementation: </strong>The developed tool, programs, training, and test dataset are available at https://github.com/PakhrinLab/MMUbiPred.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"5 1","pages":"vbaf200"},"PeriodicalIF":2.8000,"publicationDate":"2025-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12408473/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioadv/vbaf200","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Motivation: Ubiquitination is a crucial post-translational modification that regulates various biological functions, including protein degradation, signal transduction, and cellular homeostasis. Accurate identification of ubiquitination sites is essential for understanding these mechanisms, yet existing prediction tools often lack generalizability across diverse datasets. To address this limitation, we developed Multimodal Ubiquitination Predictor, a deep learning-based approach capable of predicting ubiquitination sites across general, human-specific, and plant-specific datasets. By integrating diverse protein sequence representations-one-hot encoding, embeddings, and physicochemical properties-within a unified deep-learning framework, the proposed method significantly enhances prediction accuracy and robustness, offering a valuable resource for both research and applications in ubiquitination site discovery.
Results: Multimodal Ubiquitination Predictor achieved superior performance across general, human-specific, and plant-specific datasets, with 77.25% accuracy, 74.98% sensitivity, 80.67% specificity, an MCC of 0.54, and an AUC of 0.87 on an independent human ubiquitination test dataset. It outperformed existing methods, demonstrating enhanced reliability for ubiquitination site prediction. This robust predictor and dataset serve as valuable resources for future research and discovery.
Availability and implementation: The developed tool, programs, training, and test dataset are available at https://github.com/PakhrinLab/MMUbiPred.