{"title":"Improved Prediction of Stabilizing Mutations in Proteins by Incorporation of Mutational Effects on Ligand Binding.","authors":"Srivarshini Ganesan, Nidhi Mittal, Akash Bhat, Rachana S Adiga, Ananthakrishnan Ganesan, Deepesh Nagarajan, Raghavan Varadarajan","doi":"10.1002/prot.26738","DOIUrl":null,"url":null,"abstract":"<p><p>While many computational methods accurately predict destabilizing mutations, identifying stabilizing mutations has remained a challenge, because of their relative rarity. We tested ΔΔG <sup>0</sup> predictions from computational predictors such as Rosetta, ThermoMPNN, RaSP, and DeepDDG, using 82 mutants of the bacterial toxin CcdB as a test case. On this dataset, the best computational predictor is ThermoMPNN, which identifies stabilizing mutations with a precision of 68%. However, the average increase in T <sub>m</sub> for these predicted mutations was only 1°C for CcdB, and predictions were poorer for a more challenging target, influenza neuraminidase. Using data from multiple previously described yeast surface display libraries and in vitro thermal stability measurements, we trained logistic regression models to identify stabilizing mutations with a precision of 90% and an average increase in T <sub>m</sub> of 3°C for CcdB. When such libraries contain a population of mutants with significantly enhanced binding relative to the corresponding wild type, there is no benefit in using computational predictors. It is then possible to predict stabilizing mutations without any training, simply by examining the distribution of mutational binding scores. This avoids laborious steps of in vitro expression, purification, and stability characterization. When this is not the case, combining data from computational predictors with high-throughput experimental binding data enhances the prediction of stabilizing mutations. However, this requires training on stability data measured in vitro with known stabilized mutants. It is thus feasible to predict stabilizing mutations rapidly and accurately for any system of interest that can be subjected to a binding selection or screen.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":"384-395"},"PeriodicalIF":3.2000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proteins-Structure Function and Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/prot.26738","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/21 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
While many computational methods accurately predict destabilizing mutations, identifying stabilizing mutations has remained a challenge, because of their relative rarity. We tested ΔΔG 0 predictions from computational predictors such as Rosetta, ThermoMPNN, RaSP, and DeepDDG, using 82 mutants of the bacterial toxin CcdB as a test case. On this dataset, the best computational predictor is ThermoMPNN, which identifies stabilizing mutations with a precision of 68%. However, the average increase in T m for these predicted mutations was only 1°C for CcdB, and predictions were poorer for a more challenging target, influenza neuraminidase. Using data from multiple previously described yeast surface display libraries and in vitro thermal stability measurements, we trained logistic regression models to identify stabilizing mutations with a precision of 90% and an average increase in T m of 3°C for CcdB. When such libraries contain a population of mutants with significantly enhanced binding relative to the corresponding wild type, there is no benefit in using computational predictors. It is then possible to predict stabilizing mutations without any training, simply by examining the distribution of mutational binding scores. This avoids laborious steps of in vitro expression, purification, and stability characterization. When this is not the case, combining data from computational predictors with high-throughput experimental binding data enhances the prediction of stabilizing mutations. However, this requires training on stability data measured in vitro with known stabilized mutants. It is thus feasible to predict stabilizing mutations rapidly and accurately for any system of interest that can be subjected to a binding selection or screen.
期刊介绍:
PROTEINS : Structure, Function, and Bioinformatics publishes original reports of significant experimental and analytic research in all areas of protein research: structure, function, computation, genetics, and design. The journal encourages reports that present new experimental or computational approaches for interpreting and understanding data from biophysical chemistry, structural studies of proteins and macromolecular assemblies, alterations of protein structure and function engineered through techniques of molecular biology and genetics, functional analyses under physiologic conditions, as well as the interactions of proteins with receptors, nucleic acids, or other specific ligands or substrates. Research in protein and peptide biochemistry directed toward synthesizing or characterizing molecules that simulate aspects of the activity of proteins, or that act as inhibitors of protein function, is also within the scope of PROTEINS. In addition to full-length reports, short communications (usually not more than 4 printed pages) and prediction reports are welcome. Reviews are typically by invitation; authors are encouraged to submit proposed topics for consideration.