Qianqian Wang, Zhiwei Miao, Xiongjie Xiao, Xu Zhang, Daiwen Yang, Bin Jiang, Maili Liu
{"title":"基于蛋白质核磁共振结构集合和机器学习的阶次参数预测。","authors":"Qianqian Wang, Zhiwei Miao, Xiongjie Xiao, Xu Zhang, Daiwen Yang, Bin Jiang, Maili Liu","doi":"10.1007/s10858-024-00435-w","DOIUrl":null,"url":null,"abstract":"<div><p>The fast motions of proteins at the picosecond to nanosecond timescale, known as fast dynamics, are closely related to protein conformational entropy and rearrangement, which in turn affect catalysis, ligand binding and protein allosteric effects. The most used NMR approach to study fast protein dynamics is the model free method, which uses order parameter <i>S</i><sup>2</sup> to describe the amplitude of the internal motion of local group. However, to obtain order parameter through NMR experiments is quite complex and lengthy. In this paper, we present a machine learning approach for predicting backbone <sup>1</sup>H-<sup>15</sup>N order parameters based on protein NMR structure ensemble. A random forest model is used to learn the relationship between order parameters and structural features. Our method achieves high accuracy in predicting backbone <sup>1</sup>H-<sup>15</sup>N order parameters for a test dataset of 10 proteins, with a Pearson correlation coefficient of 0.817 and a root-mean-square error of 0.131.</p></div>","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":null,"pages":null},"PeriodicalIF":16.4000,"publicationDate":"2024-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediction of order parameters based on protein NMR structure ensemble and machine learning\",\"authors\":\"Qianqian Wang, Zhiwei Miao, Xiongjie Xiao, Xu Zhang, Daiwen Yang, Bin Jiang, Maili Liu\",\"doi\":\"10.1007/s10858-024-00435-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The fast motions of proteins at the picosecond to nanosecond timescale, known as fast dynamics, are closely related to protein conformational entropy and rearrangement, which in turn affect catalysis, ligand binding and protein allosteric effects. The most used NMR approach to study fast protein dynamics is the model free method, which uses order parameter <i>S</i><sup>2</sup> to describe the amplitude of the internal motion of local group. However, to obtain order parameter through NMR experiments is quite complex and lengthy. In this paper, we present a machine learning approach for predicting backbone <sup>1</sup>H-<sup>15</sup>N order parameters based on protein NMR structure ensemble. A random forest model is used to learn the relationship between order parameters and structural features. Our method achieves high accuracy in predicting backbone <sup>1</sup>H-<sup>15</sup>N order parameters for a test dataset of 10 proteins, with a Pearson correlation coefficient of 0.817 and a root-mean-square error of 0.131.</p></div>\",\"PeriodicalId\":1,\"journal\":{\"name\":\"Accounts of Chemical Research\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":16.4000,\"publicationDate\":\"2024-03-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Accounts of Chemical Research\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10858-024-00435-w\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"99","ListUrlMain":"https://link.springer.com/article/10.1007/s10858-024-00435-w","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
Prediction of order parameters based on protein NMR structure ensemble and machine learning
The fast motions of proteins at the picosecond to nanosecond timescale, known as fast dynamics, are closely related to protein conformational entropy and rearrangement, which in turn affect catalysis, ligand binding and protein allosteric effects. The most used NMR approach to study fast protein dynamics is the model free method, which uses order parameter S2 to describe the amplitude of the internal motion of local group. However, to obtain order parameter through NMR experiments is quite complex and lengthy. In this paper, we present a machine learning approach for predicting backbone 1H-15N order parameters based on protein NMR structure ensemble. A random forest model is used to learn the relationship between order parameters and structural features. Our method achieves high accuracy in predicting backbone 1H-15N order parameters for a test dataset of 10 proteins, with a Pearson correlation coefficient of 0.817 and a root-mean-square error of 0.131.
期刊介绍:
Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance.
Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.