Workflow Management for Real-Time Analysis of Lightsource Experiments

J. Deslippe, Abdelilah Essiari, S. Patton, T. Samak, C. Tull, A. Hexemer, G. Kumar, D. Parkinson, Polite Stewart
{"title":"Workflow Management for Real-Time Analysis of Lightsource Experiments","authors":"J. Deslippe, Abdelilah Essiari, S. Patton, T. Samak, C. Tull, A. Hexemer, G. Kumar, D. Parkinson, Polite Stewart","doi":"10.1109/WORKS.2014.9","DOIUrl":null,"url":null,"abstract":"The Advanced lightsource (ALS) is a X-ray synchrotron facility at Lawrence Berkeley National Laboratory. The ALS generates terabytes of raw and derived data each day and serves 1,000's of researchers each year. Only a subset of the data is analyzed due to barriers in terms of processing that small science teams are ill-equipped to surmount. In this paper, we discuss the development and application of a computational framework, termed SPOT, fed with synchrotron data, powered by storage, networking and compute resources at NERSC and ESnet. We describe issues and recommendations for an end-to-end analysis workflow for ALS data. After one year of operation, the collection contains over 90,000 datasets (550 TB) from 85 users across three beamlines. For 16 months, beamline data taken has been promptly and automatically analyzed and annotated with metadata, allowing users to focus on analysis, conclusions and experiments.","PeriodicalId":206005,"journal":{"name":"2014 9th Workshop on Workflows in Support of Large-Scale Science","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 9th Workshop on Workflows in Support of Large-Scale Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WORKS.2014.9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 23

Abstract

The Advanced lightsource (ALS) is a X-ray synchrotron facility at Lawrence Berkeley National Laboratory. The ALS generates terabytes of raw and derived data each day and serves 1,000's of researchers each year. Only a subset of the data is analyzed due to barriers in terms of processing that small science teams are ill-equipped to surmount. In this paper, we discuss the development and application of a computational framework, termed SPOT, fed with synchrotron data, powered by storage, networking and compute resources at NERSC and ESnet. We describe issues and recommendations for an end-to-end analysis workflow for ALS data. After one year of operation, the collection contains over 90,000 datasets (550 TB) from 85 users across three beamlines. For 16 months, beamline data taken has been promptly and automatically analyzed and annotated with metadata, allowing users to focus on analysis, conclusions and experiments.
光源实验实时分析的工作流管理
先进光源(ALS)是劳伦斯伯克利国家实验室的x射线同步加速器设备。ALS每天产生数tb的原始和衍生数据,每年为1000名研究人员提供服务。由于处理方面的障碍,小型科学团队没有能力克服,因此只有一小部分数据被分析。在本文中,我们讨论了一个计算框架的开发和应用,称为SPOT,由同步加速器数据提供,由NERSC和ESnet的存储、网络和计算资源提供动力。我们描述了ALS数据端到端分析工作流的问题和建议。经过一年的运行,该收集包含来自三个光束线的85个用户的90,000多个数据集(550 TB)。16个月来,采集的光束线数据已被及时、自动地分析和注释元数据,让用户专注于分析、结论和实验。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信