Qianqian Wang, Zhiwei Miao, Xiongjie Xiao, Xu Zhang, Daiwen Yang, Bin Jiang, Maili Liu
{"title":"Prediction of order parameters based on protein NMR structure ensemble and machine learning","authors":"Qianqian Wang, Zhiwei Miao, Xiongjie Xiao, Xu Zhang, Daiwen Yang, Bin Jiang, Maili Liu","doi":"10.1007/s10858-024-00435-w","DOIUrl":null,"url":null,"abstract":"<div><p>The fast motions of proteins at the picosecond to nanosecond timescale, known as fast dynamics, are closely related to protein conformational entropy and rearrangement, which in turn affect catalysis, ligand binding and protein allosteric effects. The most used NMR approach to study fast protein dynamics is the model free method, which uses order parameter <i>S</i><sup>2</sup> to describe the amplitude of the internal motion of local group. However, to obtain order parameter through NMR experiments is quite complex and lengthy. In this paper, we present a machine learning approach for predicting backbone <sup>1</sup>H-<sup>15</sup>N order parameters based on protein NMR structure ensemble. A random forest model is used to learn the relationship between order parameters and structural features. Our method achieves high accuracy in predicting backbone <sup>1</sup>H-<sup>15</sup>N order parameters for a test dataset of 10 proteins, with a Pearson correlation coefficient of 0.817 and a root-mean-square error of 0.131.</p></div>","PeriodicalId":613,"journal":{"name":"Journal of Biomolecular NMR","volume":"78 2","pages":"87 - 94"},"PeriodicalIF":1.3000,"publicationDate":"2024-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biomolecular NMR","FirstCategoryId":"99","ListUrlMain":"https://link.springer.com/article/10.1007/s10858-024-00435-w","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The fast motions of proteins at the picosecond to nanosecond timescale, known as fast dynamics, are closely related to protein conformational entropy and rearrangement, which in turn affect catalysis, ligand binding and protein allosteric effects. The most used NMR approach to study fast protein dynamics is the model free method, which uses order parameter S2 to describe the amplitude of the internal motion of local group. However, to obtain order parameter through NMR experiments is quite complex and lengthy. In this paper, we present a machine learning approach for predicting backbone 1H-15N order parameters based on protein NMR structure ensemble. A random forest model is used to learn the relationship between order parameters and structural features. Our method achieves high accuracy in predicting backbone 1H-15N order parameters for a test dataset of 10 proteins, with a Pearson correlation coefficient of 0.817 and a root-mean-square error of 0.131.
期刊介绍:
The Journal of Biomolecular NMR provides a forum for publishing research on technical developments and innovative applications of nuclear magnetic resonance spectroscopy for the study of structure and dynamic properties of biopolymers in solution, liquid crystals, solids and mixed environments, e.g., attached to membranes. This may include:
Three-dimensional structure determination of biological macromolecules (polypeptides/proteins, DNA, RNA, oligosaccharides) by NMR.
New NMR techniques for studies of biological macromolecules.
Novel approaches to computer-aided automated analysis of multidimensional NMR spectra.
Computational methods for the structural interpretation of NMR data, including structure refinement.
Comparisons of structures determined by NMR with those obtained by other methods, e.g. by diffraction techniques with protein single crystals.
New techniques of sample preparation for NMR experiments (biosynthetic and chemical methods for isotope labeling, preparation of nutrients for biosynthetic isotope labeling, etc.). An NMR characterization of the products must be included.