IF 2.6 4区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS
Pranav N M Shah, Ruben Sanchez-Garcia, David I Stuart
{"title":"TomoCPT: a generalizable model for 3D particle detection and localization in cryo-electron tomograms.","authors":"Pranav N M Shah, Ruben Sanchez-Garcia, David I Stuart","doi":"10.1107/S2059798325000865","DOIUrl":null,"url":null,"abstract":"<p><p>Cryo-electron tomography is a rapidly developing field for studying macromolecular complexes in their native environments and has the potential to revolutionize our understanding of protein function. However, fast and accurate identification of particles in cryo-tomograms is challenging and represents a significant bottleneck in downstream processes such as subtomogram averaging. Here, we present tomoCPT (Tomogram Centroid Prediction Tool), a transformer-based solution that reformulates particle detection as a centroid-prediction task using Gaussian labels. Our approach, which is built upon the SwinUNETR architecture, demonstrates superior performance compared with both conventional binary labelling strategies and template matching. We show that tomoCPT effectively generalizes to novel particle types through zero-shot inference and can be significantly enhanced through fine-tuning with limited data. The efficacy of tomoCPT is validated using three case studies: apoferritin, achieving a resolution of 3.0 Å compared with 3.3 Å using template matching, SARS-CoV-2 spike proteins on cell surfaces, yielding an 18.3 Å resolution map where template matching proved unsuccessful, and rubisco molecules within carboxysomes, reaching 8.0 Å resolution. These results demonstrate the ability of tomoCPT to handle varied scenarios, including densely packed environments and membrane-bound proteins. The implementation of the tool as a command-line program, coupled with its minimal data requirements for fine-tuning, makes it a practical solution for high-throughput cryo-ET data-processing workflows.</p>","PeriodicalId":7116,"journal":{"name":"Acta Crystallographica. Section D, Structural Biology","volume":" ","pages":"63-76"},"PeriodicalIF":2.6000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11804251/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acta Crystallographica. Section D, Structural Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1107/S2059798325000865","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

摘要

低温电子断层成像是一个快速发展的领域,用于研究原生环境中的大分子复合物,并有可能彻底改变我们对蛋白质功能的认识。然而,快速准确地识别低温断层图中的颗粒是一项挑战,也是子断层图平均化等下游过程中的一个重要瓶颈。在这里,我们提出了 tomoCPT(断层扫描中心点预测工具),这是一种基于变换器的解决方案,它利用高斯标签将粒子检测重新表述为中心点预测任务。我们的方法建立在 SwinUNETR 架构之上,与传统的二进制标签策略和模板匹配相比,表现出更优越的性能。我们的研究表明,tomoCPT 可通过零点推理有效地泛化到新的粒子类型,并可通过对有限数据的微调显著提高性能。我们通过三个案例研究验证了 tomoCPT 的功效:apoferritin 的分辨率为 3.0 Å,而模板匹配的分辨率为 3.3 Å;细胞表面的 SARS-CoV-2 棘突蛋白的分辨率为 18.3 Å,而模板匹配的分辨率为 18.3 Å;羧基体中的 rubisco 分子的分辨率为 8.0 Å。这些结果证明了 tomoCPT 处理各种情况的能力,包括密集环境和膜结合蛋白。该工具以命令行程序的形式实施,加上其对微调数据的最低要求,使其成为高通量低温电子显微镜数据处理工作流程的实用解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
TomoCPT: a generalizable model for 3D particle detection and localization in cryo-electron tomograms.

Cryo-electron tomography is a rapidly developing field for studying macromolecular complexes in their native environments and has the potential to revolutionize our understanding of protein function. However, fast and accurate identification of particles in cryo-tomograms is challenging and represents a significant bottleneck in downstream processes such as subtomogram averaging. Here, we present tomoCPT (Tomogram Centroid Prediction Tool), a transformer-based solution that reformulates particle detection as a centroid-prediction task using Gaussian labels. Our approach, which is built upon the SwinUNETR architecture, demonstrates superior performance compared with both conventional binary labelling strategies and template matching. We show that tomoCPT effectively generalizes to novel particle types through zero-shot inference and can be significantly enhanced through fine-tuning with limited data. The efficacy of tomoCPT is validated using three case studies: apoferritin, achieving a resolution of 3.0 Å compared with 3.3 Å using template matching, SARS-CoV-2 spike proteins on cell surfaces, yielding an 18.3 Å resolution map where template matching proved unsuccessful, and rubisco molecules within carboxysomes, reaching 8.0 Å resolution. These results demonstrate the ability of tomoCPT to handle varied scenarios, including densely packed environments and membrane-bound proteins. The implementation of the tool as a command-line program, coupled with its minimal data requirements for fine-tuning, makes it a practical solution for high-throughput cryo-ET data-processing workflows.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Acta Crystallographica. Section D, Structural Biology
Acta Crystallographica. Section D, Structural Biology BIOCHEMICAL RESEARCH METHODSBIOCHEMISTRY &-BIOCHEMISTRY & MOLECULAR BIOLOGY
CiteScore
4.50
自引率
13.60%
发文量
216
期刊介绍: Acta Crystallographica Section D welcomes the submission of articles covering any aspect of structural biology, with a particular emphasis on the structures of biological macromolecules or the methods used to determine them. Reports on new structures of biological importance may address the smallest macromolecules to the largest complex molecular machines. These structures may have been determined using any structural biology technique including crystallography, NMR, cryoEM and/or other techniques. The key criterion is that such articles must present significant new insights into biological, chemical or medical sciences. The inclusion of complementary data that support the conclusions drawn from the structural studies (such as binding studies, mass spectrometry, enzyme assays, or analysis of mutants or other modified forms of biological macromolecule) is encouraged. Methods articles may include new approaches to any aspect of biological structure determination or structure analysis but will only be accepted where they focus on new methods that are demonstrated to be of general applicability and importance to structural biology. Articles describing particularly difficult problems in structural biology are also welcomed, if the analysis would provide useful insights to others facing similar problems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信