ProstaNet: A Novel Geometric Vector Perceptrons-Graph Neural Network Algorithm for Protein Stability Prediction in Single- and Multiple-Point Mutations with Experimental Validation.
{"title":"ProstaNet: A Novel Geometric Vector Perceptrons-Graph Neural Network Algorithm for Protein Stability Prediction in Single- and Multiple-Point Mutations with Experimental Validation.","authors":"Tianjian Liang, Ze-Yu Sun, Rieko Ishima, Xiang-Qun Xie, Ying Xue, Wei Li, Zhiwei Feng","doi":"10.34133/research.0674","DOIUrl":null,"url":null,"abstract":"<p><p>Proteins play a critical role in biology and biopharma due to their specificity and minimal side effects. Predicting the effects of mutations on protein stability is vital but experimentally challenging. Deep learning offers an efficient solution to this problem. In the present work, we introduced ProstaNet, a deep learning framework that predicts stability changes resulting from single- and multiple-point mutations using geometric vector perceptrons-graph neural network for 3-dimensional feature processing. For training ProstaNet, we meticulously crafted ProstaDB, a comprehensive and pristine thermodynamics repository, including 3,784 single-point mutations and 1,642 multiple-point mutations. We also created thermodynamic looping for enlarging the limited data size of multiple-point mutation and applied an innovative clustering method to generate a standard testing set of multiple-point mutation. Besides, we identified residue scoring as the most important encoding method in protein properties prediction. With these innovations, ProstaNet accurately predicts thermostability changes for both single-point and multiple-point mutations without showing any bias. ProstaNet achieves an accuracy of 0.75, outperforming existing methods for single-point mutation prediction, including ThermoMPNN (0.63), PoPMuSiC<sup>sym</sup> (0.66), MUPRO (0.52), and FoldX (0.71). ProstaNet also achieves a 1.3-fold increase in accuracy compared to FoldX for multiple-point mutation predictions. Validated by experiment, 4 out of 5 single-point mutation predictions (80%) and all multiple-point mutation predictions (100%) for HuJ3 mutants were accurate, demonstrating the potential benefits of ProstaNet for protein engineering and drug development.</p>","PeriodicalId":21120,"journal":{"name":"Research","volume":"8 ","pages":"0674"},"PeriodicalIF":11.0000,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11997553/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.34133/research.0674","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"Multidisciplinary","Score":null,"Total":0}
引用次数: 0
Abstract
Proteins play a critical role in biology and biopharma due to their specificity and minimal side effects. Predicting the effects of mutations on protein stability is vital but experimentally challenging. Deep learning offers an efficient solution to this problem. In the present work, we introduced ProstaNet, a deep learning framework that predicts stability changes resulting from single- and multiple-point mutations using geometric vector perceptrons-graph neural network for 3-dimensional feature processing. For training ProstaNet, we meticulously crafted ProstaDB, a comprehensive and pristine thermodynamics repository, including 3,784 single-point mutations and 1,642 multiple-point mutations. We also created thermodynamic looping for enlarging the limited data size of multiple-point mutation and applied an innovative clustering method to generate a standard testing set of multiple-point mutation. Besides, we identified residue scoring as the most important encoding method in protein properties prediction. With these innovations, ProstaNet accurately predicts thermostability changes for both single-point and multiple-point mutations without showing any bias. ProstaNet achieves an accuracy of 0.75, outperforming existing methods for single-point mutation prediction, including ThermoMPNN (0.63), PoPMuSiCsym (0.66), MUPRO (0.52), and FoldX (0.71). ProstaNet also achieves a 1.3-fold increase in accuracy compared to FoldX for multiple-point mutation predictions. Validated by experiment, 4 out of 5 single-point mutation predictions (80%) and all multiple-point mutation predictions (100%) for HuJ3 mutants were accurate, demonstrating the potential benefits of ProstaNet for protein engineering and drug development.
期刊介绍:
Research serves as a global platform for academic exchange, collaboration, and technological advancements. This journal welcomes high-quality research contributions from any domain, with open arms to authors from around the globe.
Comprising fundamental research in the life and physical sciences, Research also highlights significant findings and issues in engineering and applied science. The journal proudly features original research articles, reviews, perspectives, and editorials, fostering a diverse and dynamic scholarly environment.