{"title":"AutoPD: an integrated meta-pipeline for high-throughput X-ray crystallography data processing and structure determination","authors":"Xin Zhang, Haikai Sun, Yu Hu, Zengru Li, Zhi Geng, Zengqiang Gao, Quan Hao, Fazhi Qi, Wei Ding","doi":"10.1107/S1600576725003218","DOIUrl":null,"url":null,"abstract":"<p>The advent of hybrid pixel array detectors and fully automated data acquisition workflows has revolutionized synchrotron light sources, enabling high-throughput collection of diffraction data from biological macromolecular crystals. However, these advancements have also created an urgent need for efficient and fully automated data processing pipelines. To address this challenge, we introduce <i>AutoPD</i>, an open-source high-throughput meta-pipeline for automated data processing and structure determination. Developed for the biological macromolecular crystallography beamline at the High Energy Photon Source in Beijing, <i>AutoPD</i> is also accessible to other academic and synchrotron users. By integrating cutting-edge parallel computing strategies, <i>AlphaFold</i>-assisted molecular replacement, a direct-method-based dual-space-iteration approach for model building, and an adaptive decision-making strategy that dynamically selects the optimal modeling pathway based on data quality and intermediate results, <i>AutoPD</i> streamlines the process from raw diffraction data and sequence files to high-precision structural models. When benchmarked against 186 recently deposited X-ray diffraction datasets from the Protein Data Bank, <i>AutoPD</i> successfully determined structures for 92% of cases, achieving map–model correlation values of at least 0.5 between density-modified electron density maps and the generated models. These results highlight the robustness and efficiency of <i>AutoPD</i> in addressing the challenges of modern structural biology, setting a new standard for automated structure determination.</p>","PeriodicalId":48737,"journal":{"name":"Journal of Applied Crystallography","volume":"58 3","pages":"746-758"},"PeriodicalIF":5.2000,"publicationDate":"2025-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Crystallography","FirstCategoryId":"88","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1107/S1600576725003218","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
The advent of hybrid pixel array detectors and fully automated data acquisition workflows has revolutionized synchrotron light sources, enabling high-throughput collection of diffraction data from biological macromolecular crystals. However, these advancements have also created an urgent need for efficient and fully automated data processing pipelines. To address this challenge, we introduce AutoPD, an open-source high-throughput meta-pipeline for automated data processing and structure determination. Developed for the biological macromolecular crystallography beamline at the High Energy Photon Source in Beijing, AutoPD is also accessible to other academic and synchrotron users. By integrating cutting-edge parallel computing strategies, AlphaFold-assisted molecular replacement, a direct-method-based dual-space-iteration approach for model building, and an adaptive decision-making strategy that dynamically selects the optimal modeling pathway based on data quality and intermediate results, AutoPD streamlines the process from raw diffraction data and sequence files to high-precision structural models. When benchmarked against 186 recently deposited X-ray diffraction datasets from the Protein Data Bank, AutoPD successfully determined structures for 92% of cases, achieving map–model correlation values of at least 0.5 between density-modified electron density maps and the generated models. These results highlight the robustness and efficiency of AutoPD in addressing the challenges of modern structural biology, setting a new standard for automated structure determination.
期刊介绍:
Many research topics in condensed matter research, materials science and the life sciences make use of crystallographic methods to study crystalline and non-crystalline matter with neutrons, X-rays and electrons. Articles published in the Journal of Applied Crystallography focus on these methods and their use in identifying structural and diffusion-controlled phase transformations, structure-property relationships, structural changes of defects, interfaces and surfaces, etc. Developments of instrumentation and crystallographic apparatus, theory and interpretation, numerical analysis and other related subjects are also covered. The journal is the primary place where crystallographic computer program information is published.