一个低温电子显微镜图像处理和结构生物学的工作流引擎

Biological imaging Pub Date : 2023-06-29 eCollection Date: 2023-01-01 DOI:10.1017/S2633903X23000132
Pablo Conesa, Yunior C Fonseca, Jorge Jiménez de la Morena, Grigory Sharov, Jose Miguel de la Rosa-Trevín, Ana Cuervo, Alberto García Mena, Borja Rodríguez de Francisco, Daniel Del Hoyo, David Herreros, Daniel Marchan, David Strelak, Estrella Fernández-Giménez, Erney Ramírez-Aportela, Federico Pedro de Isidro-Gómez, Irene Sánchez, James Krieger, José Luis Vilas, Laura Del Cano, Marcos Gragera, Mikel Iceta, Marta Martínez, Patricia Losana, Roberto Melero, Roberto Marabini, José María Carazo, Carlos Oscar Sánchez Sorzano
{"title":"一个低温电子显微镜图像处理和结构生物学的工作流引擎","authors":"Pablo Conesa, Yunior C Fonseca, Jorge Jiménez de la Morena, Grigory Sharov, Jose Miguel de la Rosa-Trevín, Ana Cuervo, Alberto García Mena, Borja Rodríguez de Francisco, Daniel Del Hoyo, David Herreros, Daniel Marchan, David Strelak, Estrella Fernández-Giménez, Erney Ramírez-Aportela, Federico Pedro de Isidro-Gómez, Irene Sánchez, James Krieger, José Luis Vilas, Laura Del Cano, Marcos Gragera, Mikel Iceta, Marta Martínez, Patricia Losana, Roberto Melero, Roberto Marabini, José María Carazo, Carlos Oscar Sánchez Sorzano","doi":"10.1017/S2633903X23000132","DOIUrl":null,"url":null,"abstract":"<p><p>Image-processing pipelines require the design of complex workflows combining many different steps that bring the raw acquired data to a final result with biological meaning. In the image-processing domain of cryo-electron microscopy single-particle analysis (cryo-EM SPA), hundreds of steps must be performed to obtain the three-dimensional structure of a biological macromolecule by integrating data spread over thousands of micrographs containing millions of copies of allegedly the same macromolecule. The execution of such complicated workflows demands a specific tool to keep track of all these steps performed. Additionally, due to the extremely low signal-to-noise ratio (SNR), the estimation of any image parameter is heavily affected by noise resulting in a significant fraction of incorrect estimates. Although low SNR and processing millions of images by hundreds of sequential steps requiring substantial computational resources are specific to cryo-EM, these characteristics may be shared by other biological imaging domains. Here, we present Scipion, a Python generic open-source workflow engine specifically adapted for image processing. Its main characteristics are: (a) interoperability, (b) smart object model, (c) gluing operations, (d) comparison operations, (e) wide set of domain-specific operations, (f) execution in streaming, (g) smooth integration in high-performance computing environments, (h) execution with and without graphical capabilities, (i) flexible visualization, (j) user authentication and private access to private data, (k) scripting capabilities, (l) high performance, (m) traceability, (n) reproducibility, (o) self-reporting, (p) reusability, (q) extensibility, (r) software updates, and (s) non-restrictive software licensing.</p>","PeriodicalId":72371,"journal":{"name":"Biological imaging","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10951921/pdf/","citationCount":"0","resultStr":"{\"title\":\"Scipion3: A workflow engine for cryo-electron microscopy image processing and structural biology.\",\"authors\":\"Pablo Conesa, Yunior C Fonseca, Jorge Jiménez de la Morena, Grigory Sharov, Jose Miguel de la Rosa-Trevín, Ana Cuervo, Alberto García Mena, Borja Rodríguez de Francisco, Daniel Del Hoyo, David Herreros, Daniel Marchan, David Strelak, Estrella Fernández-Giménez, Erney Ramírez-Aportela, Federico Pedro de Isidro-Gómez, Irene Sánchez, James Krieger, José Luis Vilas, Laura Del Cano, Marcos Gragera, Mikel Iceta, Marta Martínez, Patricia Losana, Roberto Melero, Roberto Marabini, José María Carazo, Carlos Oscar Sánchez Sorzano\",\"doi\":\"10.1017/S2633903X23000132\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Image-processing pipelines require the design of complex workflows combining many different steps that bring the raw acquired data to a final result with biological meaning. In the image-processing domain of cryo-electron microscopy single-particle analysis (cryo-EM SPA), hundreds of steps must be performed to obtain the three-dimensional structure of a biological macromolecule by integrating data spread over thousands of micrographs containing millions of copies of allegedly the same macromolecule. The execution of such complicated workflows demands a specific tool to keep track of all these steps performed. Additionally, due to the extremely low signal-to-noise ratio (SNR), the estimation of any image parameter is heavily affected by noise resulting in a significant fraction of incorrect estimates. Although low SNR and processing millions of images by hundreds of sequential steps requiring substantial computational resources are specific to cryo-EM, these characteristics may be shared by other biological imaging domains. Here, we present Scipion, a Python generic open-source workflow engine specifically adapted for image processing. Its main characteristics are: (a) interoperability, (b) smart object model, (c) gluing operations, (d) comparison operations, (e) wide set of domain-specific operations, (f) execution in streaming, (g) smooth integration in high-performance computing environments, (h) execution with and without graphical capabilities, (i) flexible visualization, (j) user authentication and private access to private data, (k) scripting capabilities, (l) high performance, (m) traceability, (n) reproducibility, (o) self-reporting, (p) reusability, (q) extensibility, (r) software updates, and (s) non-restrictive software licensing.</p>\",\"PeriodicalId\":72371,\"journal\":{\"name\":\"Biological imaging\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10951921/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biological imaging\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1017/S2633903X23000132\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2023/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biological imaging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1017/S2633903X23000132","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

图像处理管道需要设计复杂的工作流程,结合许多不同的步骤,将原始采集的数据转化为具有生物学意义的最终结果。在低温电子显微镜单粒子分析(cryo-EM SPA)的图像处理领域,必须通过整合数千张显微照片上的数据来获得生物大分子的三维结构,这些照片包含据称相同大分子的数百万份拷贝。执行如此复杂的工作流需要一个特定的工具来跟踪所有这些执行的步骤。此外,由于极低的信噪比(SNR),任何图像参数的估计都受到噪声的严重影响,导致大量不正确的估计。虽然低信噪比和通过数百个连续步骤处理数百万张图像需要大量的计算资源是冷冻电镜所特有的,但这些特征可能与其他生物成像领域共享。在这里,我们介绍Scipion,一个专门用于图像处理的Python通用开源工作流引擎。其主要特点是:(a)互操作性,(b)智能对象模型,(c)粘合操作,(d)比较操作,(e)广泛的领域特定操作,(f)流执行,(g)高性能计算环境中的平滑集成,(h)有或没有图形功能的执行,(i)灵活的可视化,(j)用户身份验证和对私有数据的私有访问,(k)脚本功能,(l)高性能,(m)可追溯性,(n)可重复性,(o)自我报告,(p)可重用性,(q)可扩展性,(r)软件更新,以及(s)非限制性软件许可。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Scipion3: A workflow engine for cryo-electron microscopy image processing and structural biology.

Image-processing pipelines require the design of complex workflows combining many different steps that bring the raw acquired data to a final result with biological meaning. In the image-processing domain of cryo-electron microscopy single-particle analysis (cryo-EM SPA), hundreds of steps must be performed to obtain the three-dimensional structure of a biological macromolecule by integrating data spread over thousands of micrographs containing millions of copies of allegedly the same macromolecule. The execution of such complicated workflows demands a specific tool to keep track of all these steps performed. Additionally, due to the extremely low signal-to-noise ratio (SNR), the estimation of any image parameter is heavily affected by noise resulting in a significant fraction of incorrect estimates. Although low SNR and processing millions of images by hundreds of sequential steps requiring substantial computational resources are specific to cryo-EM, these characteristics may be shared by other biological imaging domains. Here, we present Scipion, a Python generic open-source workflow engine specifically adapted for image processing. Its main characteristics are: (a) interoperability, (b) smart object model, (c) gluing operations, (d) comparison operations, (e) wide set of domain-specific operations, (f) execution in streaming, (g) smooth integration in high-performance computing environments, (h) execution with and without graphical capabilities, (i) flexible visualization, (j) user authentication and private access to private data, (k) scripting capabilities, (l) high performance, (m) traceability, (n) reproducibility, (o) self-reporting, (p) reusability, (q) extensibility, (r) software updates, and (s) non-restrictive software licensing.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信