A service-based approach to cryoEM facility processing pipelines at eBIC.

IF 4.6 Q2 MATERIALS SCIENCE, BIOMATERIALS
ACS Applied Bio Materials Pub Date : 2024-03-01 Epub Date: 2024-02-20 DOI:10.1107/S2059798324000986
Anna Horstmann, Stephen Riggs, Yuriy Chaban, Daniel K Clare, Guilherme de Freitas, David Farmer, Andrew Howe, Kyle L Morris, Daniel Hatton
{"title":"A service-based approach to cryoEM facility processing pipelines at eBIC.","authors":"Anna Horstmann, Stephen Riggs, Yuriy Chaban, Daniel K Clare, Guilherme de Freitas, David Farmer, Andrew Howe, Kyle L Morris, Daniel Hatton","doi":"10.1107/S2059798324000986","DOIUrl":null,"url":null,"abstract":"<p><p>Electron cryo-microscopy image-processing workflows are typically composed of elements that may, broadly speaking, be categorized as high-throughput workloads which transition to high-performance workloads as preprocessed data are aggregated. The high-throughput elements are of particular importance in the context of live processing, where an optimal response is highly coupled to the temporal profile of the data collection. In other words, each movie should be processed as quickly as possible at the earliest opportunity. The high level of disconnected parallelization in the high-throughput problem directly allows a completely scalable solution across a distributed computer system, with the only technical obstacle being an efficient and reliable implementation. The cloud computing frameworks primarily developed for the deployment of high-availability web applications provide an environment with a number of appealing features for such high-throughput processing tasks. Here, an implementation of an early-stage processing pipeline for electron cryotomography experiments using a service-based architecture deployed on a Kubernetes cluster is discussed in order to demonstrate the benefits of this approach and how it may be extended to scenarios of considerably increased complexity.</p>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10910546/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1107/S2059798324000986","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/2/20 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0

Abstract

Electron cryo-microscopy image-processing workflows are typically composed of elements that may, broadly speaking, be categorized as high-throughput workloads which transition to high-performance workloads as preprocessed data are aggregated. The high-throughput elements are of particular importance in the context of live processing, where an optimal response is highly coupled to the temporal profile of the data collection. In other words, each movie should be processed as quickly as possible at the earliest opportunity. The high level of disconnected parallelization in the high-throughput problem directly allows a completely scalable solution across a distributed computer system, with the only technical obstacle being an efficient and reliable implementation. The cloud computing frameworks primarily developed for the deployment of high-availability web applications provide an environment with a number of appealing features for such high-throughput processing tasks. Here, an implementation of an early-stage processing pipeline for electron cryotomography experiments using a service-based architecture deployed on a Kubernetes cluster is discussed in order to demonstrate the benefits of this approach and how it may be extended to scenarios of considerably increased complexity.

Abstract Image

eBIC 的低温电子显微镜设施处理管道采用基于服务的方法。
电子低温显微图像处理工作流程通常由一些元素组成,从广义上讲,这些元素可被归类为高吞吐量工作负载,随着预处理数据的汇总,这些工作负载会过渡到高性能工作负载。高吞吐量元素在实时处理中尤为重要,因为在实时处理中,最佳响应与数据收集的时间轮廓高度相关。换句话说,每部电影都应尽早尽快处理。高吞吐量问题中的高水平断开并行化直接允许在分布式计算机系统中采用完全可扩展的解决方案,唯一的技术障碍是高效可靠的实施。主要为部署高可用性网络应用程序而开发的云计算框架为此类高吞吐量处理任务提供了一个具有许多吸引人特性的环境。在此,我们将讨论使用部署在 Kubernetes 集群上的基于服务的架构为电子低温显像实验实施早期处理流水线的情况,以展示这种方法的优势,以及如何将其扩展到复杂性大大增加的场景中。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
ACS Applied Bio Materials
ACS Applied Bio Materials Chemistry-Chemistry (all)
CiteScore
9.40
自引率
2.10%
发文量
464
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信