AutoPD: an integrated meta-pipeline for high-throughput X-ray crystallography data processing and structure determination

IF 5.2 3区 材料科学 Q2 CHEMISTRY, MULTIDISCIPLINARY
Xin Zhang, Haikai Sun, Yu Hu, Zengru Li, Zhi Geng, Zengqiang Gao, Quan Hao, Fazhi Qi, Wei Ding
{"title":"AutoPD: an integrated meta-pipeline for high-throughput X-ray crystallography data processing and structure determination","authors":"Xin Zhang,&nbsp;Haikai Sun,&nbsp;Yu Hu,&nbsp;Zengru Li,&nbsp;Zhi Geng,&nbsp;Zengqiang Gao,&nbsp;Quan Hao,&nbsp;Fazhi Qi,&nbsp;Wei Ding","doi":"10.1107/S1600576725003218","DOIUrl":null,"url":null,"abstract":"<p>The advent of hybrid pixel array detectors and fully automated data acquisition workflows has revolutionized synchrotron light sources, enabling high-throughput collection of diffraction data from biological macromolecular crystals. However, these advancements have also created an urgent need for efficient and fully automated data processing pipelines. To address this challenge, we introduce <i>AutoPD</i>, an open-source high-throughput meta-pipeline for automated data processing and structure determination. Developed for the biological macromolecular crystallography beamline at the High Energy Photon Source in Beijing, <i>AutoPD</i> is also accessible to other academic and synchrotron users. By integrating cutting-edge parallel computing strategies, <i>AlphaFold</i>-assisted molecular replacement, a direct-method-based dual-space-iteration approach for model building, and an adaptive decision-making strategy that dynamically selects the optimal modeling pathway based on data quality and intermediate results, <i>AutoPD</i> streamlines the process from raw diffraction data and sequence files to high-precision structural models. When benchmarked against 186 recently deposited X-ray diffraction datasets from the Protein Data Bank, <i>AutoPD</i> successfully determined structures for 92% of cases, achieving map–model correlation values of at least 0.5 between density-modified electron density maps and the generated models. These results highlight the robustness and efficiency of <i>AutoPD</i> in addressing the challenges of modern structural biology, setting a new standard for automated structure determination.</p>","PeriodicalId":48737,"journal":{"name":"Journal of Applied Crystallography","volume":"58 3","pages":"746-758"},"PeriodicalIF":5.2000,"publicationDate":"2025-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Crystallography","FirstCategoryId":"88","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1107/S1600576725003218","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

The advent of hybrid pixel array detectors and fully automated data acquisition workflows has revolutionized synchrotron light sources, enabling high-throughput collection of diffraction data from biological macromolecular crystals. However, these advancements have also created an urgent need for efficient and fully automated data processing pipelines. To address this challenge, we introduce AutoPD, an open-source high-throughput meta-pipeline for automated data processing and structure determination. Developed for the biological macromolecular crystallography beamline at the High Energy Photon Source in Beijing, AutoPD is also accessible to other academic and synchrotron users. By integrating cutting-edge parallel computing strategies, AlphaFold-assisted molecular replacement, a direct-method-based dual-space-iteration approach for model building, and an adaptive decision-making strategy that dynamically selects the optimal modeling pathway based on data quality and intermediate results, AutoPD streamlines the process from raw diffraction data and sequence files to high-precision structural models. When benchmarked against 186 recently deposited X-ray diffraction datasets from the Protein Data Bank, AutoPD successfully determined structures for 92% of cases, achieving map–model correlation values of at least 0.5 between density-modified electron density maps and the generated models. These results highlight the robustness and efficiency of AutoPD in addressing the challenges of modern structural biology, setting a new standard for automated structure determination.

用于高通量x射线晶体学数据处理和结构确定的集成元管道
混合像素阵列探测器和全自动数据采集工作流程的出现彻底改变了同步加速器光源,使高通量收集生物大分子晶体的衍射数据成为可能。然而,这些进步也产生了对高效和全自动数据处理管道的迫切需求。为了应对这一挑战,我们引入了AutoPD,一种用于自动数据处理和结构确定的开源高通量元管道。为北京高能光子源的生物大分子晶体学光束线而开发的AutoPD也可供其他学术和同步加速器用户使用。通过集成先进的并行计算策略、alphafold辅助的分子替换、基于直接方法的双空间迭代建模方法,以及基于数据质量和中间结果动态选择最优建模路径的自适应决策策略,AutoPD简化了从原始衍射数据和序列文件到高精度结构模型的过程。当以来自蛋白质数据库的186个最近沉积的x射线衍射数据集为基准时,AutoPD成功地确定了92%的病例的结构,在密度修正的电子密度图和生成的模型之间实现了至少0.5的图-模型相关值。这些结果突出了AutoPD在解决现代结构生物学挑战方面的稳健性和效率,为自动结构确定设定了新的标准。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Applied Crystallography
Journal of Applied Crystallography CHEMISTRY, MULTIDISCIPLINARYCRYSTALLOGRAPH-CRYSTALLOGRAPHY
CiteScore
7.80
自引率
3.30%
发文量
178
期刊介绍: Many research topics in condensed matter research, materials science and the life sciences make use of crystallographic methods to study crystalline and non-crystalline matter with neutrons, X-rays and electrons. Articles published in the Journal of Applied Crystallography focus on these methods and their use in identifying structural and diffusion-controlled phase transformations, structure-property relationships, structural changes of defects, interfaces and surfaces, etc. Developments of instrumentation and crystallographic apparatus, theory and interpretation, numerical analysis and other related subjects are also covered. The journal is the primary place where crystallographic computer program information is published.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信