Systematic analysis of biomolecular conformational ensembles with PENSA.

IF 3.1 2区 化学 Q3 CHEMISTRY, PHYSICAL
Martin Vögele, Neil J Thomson, Sang T Truong, Jasper McAvity, Ulrich Zachariae, Ron O Dror
{"title":"Systematic analysis of biomolecular conformational ensembles with PENSA.","authors":"Martin Vögele, Neil J Thomson, Sang T Truong, Jasper McAvity, Ulrich Zachariae, Ron O Dror","doi":"10.1063/5.0235544","DOIUrl":null,"url":null,"abstract":"<p><p>Atomic-level simulations are widely used to study biomolecules and their dynamics. A common goal in such studies is to compare simulations of a molecular system under several conditions-for example, with various mutations or bound ligands-in order to identify differences between the molecular conformations adopted under these conditions. However, the large amount of data produced by simulations of ever larger and more complex systems often renders it difficult to identify the structural features that are relevant to a particular biochemical phenomenon. We present a flexible software package named Python ENSemble Analysis (PENSA) that enables a comprehensive and thorough investigation into biomolecular conformational ensembles. It provides featurization and feature transformations that allow for a complete representation of biomolecules such as proteins and nucleic acids, including water and ion binding sites, thus avoiding the bias that would come with manual feature selection. PENSA implements methods to systematically compare the distributions of molecular features across ensembles to find the significant differences between them and identify regions of interest. It also includes a novel approach to quantify the state-specific information between two regions of a biomolecule, which allows, for example, tracing information flow to identify allosteric pathways. PENSA also comes with convenient tools for loading data and visualizing results, making them quick to process and easy to interpret. PENSA is an open-source Python library maintained at https://github.com/drorlab/pensa along with an example workflow and a tutorial. We demonstrate its usefulness in real-world examples by showing how it helps us determine molecular mechanisms efficiently.</p>","PeriodicalId":15313,"journal":{"name":"Journal of Chemical Physics","volume":"162 1","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11698571/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Physics","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1063/5.0235544","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Atomic-level simulations are widely used to study biomolecules and their dynamics. A common goal in such studies is to compare simulations of a molecular system under several conditions-for example, with various mutations or bound ligands-in order to identify differences between the molecular conformations adopted under these conditions. However, the large amount of data produced by simulations of ever larger and more complex systems often renders it difficult to identify the structural features that are relevant to a particular biochemical phenomenon. We present a flexible software package named Python ENSemble Analysis (PENSA) that enables a comprehensive and thorough investigation into biomolecular conformational ensembles. It provides featurization and feature transformations that allow for a complete representation of biomolecules such as proteins and nucleic acids, including water and ion binding sites, thus avoiding the bias that would come with manual feature selection. PENSA implements methods to systematically compare the distributions of molecular features across ensembles to find the significant differences between them and identify regions of interest. It also includes a novel approach to quantify the state-specific information between two regions of a biomolecule, which allows, for example, tracing information flow to identify allosteric pathways. PENSA also comes with convenient tools for loading data and visualizing results, making them quick to process and easy to interpret. PENSA is an open-source Python library maintained at https://github.com/drorlab/pensa along with an example workflow and a tutorial. We demonstrate its usefulness in real-world examples by showing how it helps us determine molecular mechanisms efficiently.

生物分子构象系的PENSA系统分析。
原子水平模拟被广泛用于研究生物分子及其动力学。这类研究的一个共同目标是比较不同条件下的分子系统模拟,例如,不同的突变或结合配体,以确定在这些条件下所采用的分子构象之间的差异。然而,模拟越来越大、越来越复杂的系统所产生的大量数据,往往使识别与特定生化现象有关的结构特征变得困难。我们提出了一个名为Python ENSemble Analysis (PENSA)的灵活软件包,可以对生物分子构象集成进行全面彻底的调查。它提供了特征和特征转换,允许完整地表示生物分子,如蛋白质和核酸,包括水和离子结合位点,从而避免了手动特征选择带来的偏差。PENSA实现了系统地比较分子特征分布的方法,以发现它们之间的显着差异,并确定感兴趣的区域。它还包括一种新的方法来量化生物分子两个区域之间的特定状态信息,例如,它允许跟踪信息流来识别变弹性途径。PENSA还提供了方便的工具,用于加载数据和可视化结果,使其快速处理和易于解释。PENSA是一个开源Python库,在https://github.com/drorlab/pensa上维护,附带一个示例工作流和教程。我们通过展示它如何帮助我们有效地确定分子机制来证明它在现实世界中的实用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Chemical Physics
Journal of Chemical Physics 物理-物理:原子、分子和化学物理
CiteScore
7.40
自引率
15.90%
发文量
1615
审稿时长
2 months
期刊介绍: The Journal of Chemical Physics publishes quantitative and rigorous science of long-lasting value in methods and applications of chemical physics. The Journal also publishes brief Communications of significant new findings, Perspectives on the latest advances in the field, and Special Topic issues. The Journal focuses on innovative research in experimental and theoretical areas of chemical physics, including spectroscopy, dynamics, kinetics, statistical mechanics, and quantum mechanics. In addition, topical areas such as polymers, soft matter, materials, surfaces/interfaces, and systems of biological relevance are of increasing importance. Topical coverage includes: Theoretical Methods and Algorithms Advanced Experimental Techniques Atoms, Molecules, and Clusters Liquids, Glasses, and Crystals Surfaces, Interfaces, and Materials Polymers and Soft Matter Biological Molecules and Networks.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信