Joseph I. J. Ellaway, Stephen Anyango, S. Nair, Hossam A. Zaki, Nurul Nadzirin, Harold R. Powell, Aleksandras Gutmanas, Mihaly Varadi, Sameer Velankar
{"title":"在蛋白质数据库中识别蛋白质构象状态:挖掘综合动力学研究的潜力","authors":"Joseph I. J. Ellaway, Stephen Anyango, S. Nair, Hossam A. Zaki, Nurul Nadzirin, Harold R. Powell, Aleksandras Gutmanas, Mihaly Varadi, Sameer Velankar","doi":"10.1063/4.0000251","DOIUrl":null,"url":null,"abstract":"Studying protein dynamics and conformational heterogeneity is crucial for understanding biomolecular systems and treating disease. Despite the deposition of over 215 000 macromolecular structures in the Protein Data Bank and the advent of AI-based structure prediction tools such as AlphaFold2, RoseTTAFold, and ESMFold, static representations are typically produced, which fail to fully capture macromolecular motion. Here, we discuss the importance of integrating experimental structures with computational clustering to explore the conformational landscapes that manifest protein function. We describe the method developed by the Protein Data Bank in Europe – Knowledge Base to identify distinct conformational states, demonstrate the resource's primary use cases, through examples, and discuss the need for further efforts to annotate protein conformations with functional information. Such initiatives will be crucial in unlocking the potential of protein dynamics data, expediting drug discovery research, and deepening our understanding of macromolecular mechanisms.","PeriodicalId":21992,"journal":{"name":"Structural Dynamics","volume":"14 9","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identifying protein conformational states in the Protein Data Bank: Toward unlocking the potential of integrative dynamics studies\",\"authors\":\"Joseph I. J. Ellaway, Stephen Anyango, S. Nair, Hossam A. Zaki, Nurul Nadzirin, Harold R. Powell, Aleksandras Gutmanas, Mihaly Varadi, Sameer Velankar\",\"doi\":\"10.1063/4.0000251\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Studying protein dynamics and conformational heterogeneity is crucial for understanding biomolecular systems and treating disease. Despite the deposition of over 215 000 macromolecular structures in the Protein Data Bank and the advent of AI-based structure prediction tools such as AlphaFold2, RoseTTAFold, and ESMFold, static representations are typically produced, which fail to fully capture macromolecular motion. Here, we discuss the importance of integrating experimental structures with computational clustering to explore the conformational landscapes that manifest protein function. We describe the method developed by the Protein Data Bank in Europe – Knowledge Base to identify distinct conformational states, demonstrate the resource's primary use cases, through examples, and discuss the need for further efforts to annotate protein conformations with functional information. Such initiatives will be crucial in unlocking the potential of protein dynamics data, expediting drug discovery research, and deepening our understanding of macromolecular mechanisms.\",\"PeriodicalId\":21992,\"journal\":{\"name\":\"Structural Dynamics\",\"volume\":\"14 9\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Structural Dynamics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1063/4.0000251\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Structural Dynamics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1063/4.0000251","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
研究蛋白质动力学和构象异质性对于了解生物分子系统和治疗疾病至关重要。尽管蛋白质数据库(Protein Data Bank)中储存了 215,000 多种大分子结构,而且出现了 AlphaFold2、RoseTTAFold 和 ESMFold 等基于人工智能的结构预测工具,但这些工具通常生成的是静态表征,无法完全捕捉大分子的运动。在此,我们讨论了将实验结构与计算聚类相结合以探索体现蛋白质功能的构象景观的重要性。我们介绍了欧洲蛋白质数据库--知识库(Protein Data Bank in Europe - Knowledge Base)开发的识别不同构象状态的方法,通过实例展示了该资源的主要用例,并讨论了进一步努力为蛋白质构象注释功能信息的必要性。这些举措对于释放蛋白质动力学数据的潜力、加快药物发现研究以及加深我们对大分子机理的理解至关重要。
Identifying protein conformational states in the Protein Data Bank: Toward unlocking the potential of integrative dynamics studies
Studying protein dynamics and conformational heterogeneity is crucial for understanding biomolecular systems and treating disease. Despite the deposition of over 215 000 macromolecular structures in the Protein Data Bank and the advent of AI-based structure prediction tools such as AlphaFold2, RoseTTAFold, and ESMFold, static representations are typically produced, which fail to fully capture macromolecular motion. Here, we discuss the importance of integrating experimental structures with computational clustering to explore the conformational landscapes that manifest protein function. We describe the method developed by the Protein Data Bank in Europe – Knowledge Base to identify distinct conformational states, demonstrate the resource's primary use cases, through examples, and discuss the need for further efforts to annotate protein conformations with functional information. Such initiatives will be crucial in unlocking the potential of protein dynamics data, expediting drug discovery research, and deepening our understanding of macromolecular mechanisms.