Roodmus: a toolkit for benchmarking heterogeneous electron cryo-microscopy reconstructions

IF 2.9 2区 材料科学 Q2 CHEMISTRY, MULTIDISCIPLINARY
IUCrJ Pub Date : 2024-11-01 DOI:10.1107/S2052252524009321
{"title":"Roodmus: a toolkit for benchmarking heterogeneous electron cryo-microscopy reconstructions","authors":"","doi":"10.1107/S2052252524009321","DOIUrl":null,"url":null,"abstract":"<div><div><em>Roodmus</em> is a toolkit sourcing conformational heterogeneity of biomacromolecules from molecular dynamics simulations to generate high-quality synthetic data for the development and benchmarking of heterogeneous reconstruction algorithms.</div></div><div><div>Conformational heterogeneity of biological macromolecules is a challenge in single-particle averaging (SPA). Current standard practice is to employ classification and filtering methods that may allow a discrete number of conformational states to be reconstructed. However, the conformation space accessible to these molecules is continuous and, therefore, explored incompletely by a small number of discrete classes. Recently developed heterogeneous reconstruction algorithms (HRAs) to analyse continuous heterogeneity rely on machine-learning methods that employ low-dimensional latent space representations. The non-linear nature of many of these methods poses a challenge to their validation and interpretation and to identifying functionally relevant conformational trajectories. These methods would benefit from in-depth benchmarking using high-quality synthetic data and concomitant ground truth information. We present a framework for the simulation and subsequent analysis with respect to the ground truth of cryo-EM micrographs containing particles whose conformational heterogeneity is sourced from molecular dynamics simulations. These synthetic data can be processed as if they were experimental data, allowing aspects of standard SPA workflows as well as heterogeneous reconstruction methods to be compared with known ground truth using available utilities. The simulation and analysis of several such datasets are demonstrated and an initial investigation into HRAs is presented.</div></div>","PeriodicalId":14775,"journal":{"name":"IUCrJ","volume":null,"pages":null},"PeriodicalIF":2.9000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11533995/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IUCrJ","FirstCategoryId":"88","ListUrlMain":"https://www.sciencedirect.com/org/science/article/pii/S2052252524000939","RegionNum":2,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Roodmus is a toolkit sourcing conformational heterogeneity of biomacromolecules from molecular dynamics simulations to generate high-quality synthetic data for the development and benchmarking of heterogeneous reconstruction algorithms.
Conformational heterogeneity of biological macromolecules is a challenge in single-particle averaging (SPA). Current standard practice is to employ classification and filtering methods that may allow a discrete number of conformational states to be reconstructed. However, the conformation space accessible to these molecules is continuous and, therefore, explored incompletely by a small number of discrete classes. Recently developed heterogeneous reconstruction algorithms (HRAs) to analyse continuous heterogeneity rely on machine-learning methods that employ low-dimensional latent space representations. The non-linear nature of many of these methods poses a challenge to their validation and interpretation and to identifying functionally relevant conformational trajectories. These methods would benefit from in-depth benchmarking using high-quality synthetic data and concomitant ground truth information. We present a framework for the simulation and subsequent analysis with respect to the ground truth of cryo-EM micrographs containing particles whose conformational heterogeneity is sourced from molecular dynamics simulations. These synthetic data can be processed as if they were experimental data, allowing aspects of standard SPA workflows as well as heterogeneous reconstruction methods to be compared with known ground truth using available utilities. The simulation and analysis of several such datasets are demonstrated and an initial investigation into HRAs is presented.
Roodmus:异构电子冷冻显微镜重建基准工具包。
生物大分子的构象异质性是单粒子平均法(SPA)面临的一个挑战。目前的标准做法是采用分类和过滤方法,这样可以重建离散的构象状态。然而,这些分子所能进入的构象空间是连续的,因此少量的离散类别对其进行的探索是不完整的。最近开发的异构重构算法(HRAs)分析连续异构性依赖于采用低维潜在空间表示的机器学习方法。其中许多方法的非线性性质对其验证和解释以及识别功能相关的构象轨迹构成了挑战。利用高质量的合成数据和相关的地面实况信息对这些方法进行深入的基准测试将使它们受益匪浅。我们提出了一个框架,用于模拟和随后分析含有构象异质性来自分子动力学模拟的颗粒的低温电子显微镜显微照片的基本真相。可以像处理实验数据一样处理这些合成数据,从而利用现有工具将标准 SPA 工作流程以及异构重建方法的各个方面与已知的基本事实进行比较。本文演示了几个此类数据集的模拟和分析,并介绍了对 HRA 的初步研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IUCrJ
IUCrJ CHEMISTRY, MULTIDISCIPLINARYCRYSTALLOGRAPH-CRYSTALLOGRAPHY
CiteScore
7.50
自引率
5.10%
发文量
95
审稿时长
10 weeks
期刊介绍: IUCrJ is a new fully open-access peer-reviewed journal from the International Union of Crystallography (IUCr). The journal will publish high-profile articles on all aspects of the sciences and technologies supported by the IUCr via its commissions, including emerging fields where structural results underpin the science reported in the article. Our aim is to make IUCrJ the natural home for high-quality structural science results. Chemists, biologists, physicists and material scientists will be actively encouraged to report their structural studies in IUCrJ.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信