Real-time data processing for serial crystallography experiments

IF 2.9 2区 材料科学 Q2 CHEMISTRY, MULTIDISCIPLINARY
IUCrJ Pub Date : 2025-01-01 DOI:10.1107/S2052252524011837
Thomas White , Tim Schoof , Sergey Yakubov , Aleksandra Tolstikova , Philipp Middendorf , Mikhail Karnevskiy , Valerio Mariani , Alessandra Henkel , Bjarne Klopprogge , Juergen Hannappel , Dominik Oberthuer , Ivan De Gennaro Aquino , Dmitry Egorov , Anna Munke , Janina Sprenger , Guillaume Pompidor , Helena Taberman , Andrey Gruzinov , Jan Meyer , Johanna Hakanpää , Martin Gasthuber
{"title":"Real-time data processing for serial crystallography experiments","authors":"Thomas White ,&nbsp;Tim Schoof ,&nbsp;Sergey Yakubov ,&nbsp;Aleksandra Tolstikova ,&nbsp;Philipp Middendorf ,&nbsp;Mikhail Karnevskiy ,&nbsp;Valerio Mariani ,&nbsp;Alessandra Henkel ,&nbsp;Bjarne Klopprogge ,&nbsp;Juergen Hannappel ,&nbsp;Dominik Oberthuer ,&nbsp;Ivan De Gennaro Aquino ,&nbsp;Dmitry Egorov ,&nbsp;Anna Munke ,&nbsp;Janina Sprenger ,&nbsp;Guillaume Pompidor ,&nbsp;Helena Taberman ,&nbsp;Andrey Gruzinov ,&nbsp;Jan Meyer ,&nbsp;Johanna Hakanpää ,&nbsp;Martin Gasthuber","doi":"10.1107/S2052252524011837","DOIUrl":null,"url":null,"abstract":"<div><div>We report the use of streaming data interfaces to process data in real time from serial crystallography experiments, with a latency of less than 1 s per frame and without requiring intermediate data storage on disk.</div></div><div><div>We report the use of streaming data interfaces to perform fully online data processing for serial crystallography experiments, without storing intermediate data on disk. The system produces Bragg reflection intensity measurements suitable for scaling and merging, with a latency of less than 1 s per frame. Our system uses the <em>CrystFEL</em> software in combination with the ASAP::O data framework. In a series of user experiments at PETRA III, frames from a 16 megapixel Dectris EIGER2 X detector were searched for peaks, indexed and integrated at the maximum full-frame readout speed of 133 frames per second. The computational resources required depend on various factors, most significantly the fraction of non-blank frames (‘hits’). The average single-thread processing time per frame was 242 ms for blank frames and 455 ms for hits, meaning that a single 96-core computing node was sufficient to keep up with the data, with ample headroom for unexpected throughput reductions. Further significant improvements are expected, for example by binning pixel intensities together to reduce the pixel count. We discuss the implications of real-time data processing on the ‘data deluge’ problem from recent and future photon-science experiments, in particular on calibration requirements, computing access patterns and the need for the preservation of raw data.</div></div>","PeriodicalId":14775,"journal":{"name":"IUCrJ","volume":"12 1","pages":"Pages 97-108"},"PeriodicalIF":2.9000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11707691/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IUCrJ","FirstCategoryId":"88","ListUrlMain":"https://www.sciencedirect.com/org/science/article/pii/S2052252525000065","RegionNum":2,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

We report the use of streaming data interfaces to process data in real time from serial crystallography experiments, with a latency of less than 1 s per frame and without requiring intermediate data storage on disk.
We report the use of streaming data interfaces to perform fully online data processing for serial crystallography experiments, without storing intermediate data on disk. The system produces Bragg reflection intensity measurements suitable for scaling and merging, with a latency of less than 1 s per frame. Our system uses the CrystFEL software in combination with the ASAP::O data framework. In a series of user experiments at PETRA III, frames from a 16 megapixel Dectris EIGER2 X detector were searched for peaks, indexed and integrated at the maximum full-frame readout speed of 133 frames per second. The computational resources required depend on various factors, most significantly the fraction of non-blank frames (‘hits’). The average single-thread processing time per frame was 242 ms for blank frames and 455 ms for hits, meaning that a single 96-core computing node was sufficient to keep up with the data, with ample headroom for unexpected throughput reductions. Further significant improvements are expected, for example by binning pixel intensities together to reduce the pixel count. We discuss the implications of real-time data processing on the ‘data deluge’ problem from recent and future photon-science experiments, in particular on calibration requirements, computing access patterns and the need for the preservation of raw data.
连续晶体学实验的实时数据处理。
我们报告使用流数据接口来执行串行晶体学实验的完全在线数据处理,而不将中间数据存储在磁盘上。该系统产生适合缩放和合并的布拉格反射强度测量,每帧延迟小于1秒。我们的系统使用CrystFEL软件结合ASAP::O数据框架。在PETRA III的一系列用户实验中,来自1600万像素Dectris EIGER2 X探测器的帧被搜索到峰值,索引并以每秒133帧的最大全帧读出速度集成。所需的计算资源取决于各种因素,最重要的是非空白帧(“命中”)的比例。对于空白帧,每帧的平均单线程处理时间为242 ms,对于命中帧为455 ms,这意味着单个96核计算节点足以处理数据,并且有足够的空间应对意外的吞吐量减少。期望进一步的显著改进,例如将像素强度合并在一起以减少像素计数。我们讨论了实时数据处理对最近和未来光子科学实验中“数据泛滥”问题的影响,特别是在校准要求、计算访问模式和保存原始数据的需要方面。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IUCrJ
IUCrJ CHEMISTRY, MULTIDISCIPLINARYCRYSTALLOGRAPH-CRYSTALLOGRAPHY
CiteScore
7.50
自引率
5.10%
发文量
95
审稿时长
10 weeks
期刊介绍: IUCrJ is a new fully open-access peer-reviewed journal from the International Union of Crystallography (IUCr). The journal will publish high-profile articles on all aspects of the sciences and technologies supported by the IUCr via its commissions, including emerging fields where structural results underpin the science reported in the article. Our aim is to make IUCrJ the natural home for high-quality structural science results. Chemists, biologists, physicists and material scientists will be actively encouraged to report their structural studies in IUCrJ.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信