立足现在,放眼未来:尤利希 LOFAR 长期档案

IF 1.9 4区 物理与天体物理 Q2 ASTRONOMY & ASTROPHYSICS
C. Manzano, A. Miskolczi, H. Stiele, V. Vybornov, T. Fieseler, S. Pfalzner
{"title":"立足现在,放眼未来:尤利希 LOFAR 长期档案","authors":"C. Manzano,&nbsp;A. Miskolczi,&nbsp;H. Stiele,&nbsp;V. Vybornov,&nbsp;T. Fieseler,&nbsp;S. Pfalzner","doi":"10.1016/j.ascom.2024.100835","DOIUrl":null,"url":null,"abstract":"<div><p>The Forschungszentrum Jülich has been hosting the German part of the LOFAR archive since 2013. It is Germany’s most extensive radio astronomy archive, currently storing nearly 22 petabytes (PB) of data. Future radio telescopes are expected to require a dramatic increase in long-term data storage. Here, we take stock of the current data management of the Jülich LOFAR Data Archive, describe the ingestion, the storage system, the export to the long-term archive, and the request chain. We analysed the data availability over the last 10 years and searched for the underlying data access pattern and the energy consumption of the process. We determine hardware-related limiting factors, such as network bandwidth and cache pool availability and performance, and software aspects, e.g. workflow adjustment and parameter tuning, as the main data storage bottlenecks. By contrast, the challenge in providing the data from the archive for the users lies in retrieving the data from the tape archive and staging them. Building on this analysis, we suggest how to avoid/mitigate these problems in the future and define the requirements for future even more extensive long-term data archives.</p></div>","PeriodicalId":48757,"journal":{"name":"Astronomy and Computing","volume":"48 ","pages":"Article 100835"},"PeriodicalIF":1.9000,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2213133724000507/pdfft?md5=8384bf7573be7dd5e41b8607f6174d14&pid=1-s2.0-S2213133724000507-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Learning from the present for the future: The Jülich LOFAR Long-term Archive\",\"authors\":\"C. Manzano,&nbsp;A. Miskolczi,&nbsp;H. Stiele,&nbsp;V. Vybornov,&nbsp;T. Fieseler,&nbsp;S. Pfalzner\",\"doi\":\"10.1016/j.ascom.2024.100835\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The Forschungszentrum Jülich has been hosting the German part of the LOFAR archive since 2013. It is Germany’s most extensive radio astronomy archive, currently storing nearly 22 petabytes (PB) of data. Future radio telescopes are expected to require a dramatic increase in long-term data storage. Here, we take stock of the current data management of the Jülich LOFAR Data Archive, describe the ingestion, the storage system, the export to the long-term archive, and the request chain. We analysed the data availability over the last 10 years and searched for the underlying data access pattern and the energy consumption of the process. We determine hardware-related limiting factors, such as network bandwidth and cache pool availability and performance, and software aspects, e.g. workflow adjustment and parameter tuning, as the main data storage bottlenecks. By contrast, the challenge in providing the data from the archive for the users lies in retrieving the data from the tape archive and staging them. Building on this analysis, we suggest how to avoid/mitigate these problems in the future and define the requirements for future even more extensive long-term data archives.</p></div>\",\"PeriodicalId\":48757,\"journal\":{\"name\":\"Astronomy and Computing\",\"volume\":\"48 \",\"pages\":\"Article 100835\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2024-05-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2213133724000507/pdfft?md5=8384bf7573be7dd5e41b8607f6174d14&pid=1-s2.0-S2213133724000507-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Astronomy and Computing\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2213133724000507\",\"RegionNum\":4,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ASTRONOMY & ASTROPHYSICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Astronomy and Computing","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2213133724000507","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0

摘要

自2013年以来,尤利希研究中心一直是LOFAR档案德国部分的托管机构。它是德国最广泛的射电天文学档案库,目前存储了近22PB的数据。未来的射电望远镜预计需要大幅增加长期数据存储量。在此,我们对尤利希 LOFAR 数据档案馆目前的数据管理情况进行了评估,介绍了数据接收、存储系统、向长期档案馆的输出以及请求链。我们分析了过去 10 年的数据可用性,并搜索了基础数据访问模式和流程能耗。我们将与硬件相关的限制因素(如网络带宽和缓存池的可用性和性能)和软件方面(如工作流程调整和参数调整)确定为主要的数据存储瓶颈。相比之下,为用户提供存档数据的挑战在于从磁带存档中检索数据并将其分期。在此分析基础上,我们提出了今后如何避免/解决这些问题的建议,并确定了未来更广泛的长期数据存档的要求。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Learning from the present for the future: The Jülich LOFAR Long-term Archive

The Forschungszentrum Jülich has been hosting the German part of the LOFAR archive since 2013. It is Germany’s most extensive radio astronomy archive, currently storing nearly 22 petabytes (PB) of data. Future radio telescopes are expected to require a dramatic increase in long-term data storage. Here, we take stock of the current data management of the Jülich LOFAR Data Archive, describe the ingestion, the storage system, the export to the long-term archive, and the request chain. We analysed the data availability over the last 10 years and searched for the underlying data access pattern and the energy consumption of the process. We determine hardware-related limiting factors, such as network bandwidth and cache pool availability and performance, and software aspects, e.g. workflow adjustment and parameter tuning, as the main data storage bottlenecks. By contrast, the challenge in providing the data from the archive for the users lies in retrieving the data from the tape archive and staging them. Building on this analysis, we suggest how to avoid/mitigate these problems in the future and define the requirements for future even more extensive long-term data archives.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Astronomy and Computing
Astronomy and Computing ASTRONOMY & ASTROPHYSICSCOMPUTER SCIENCE,-COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
CiteScore
4.10
自引率
8.00%
发文量
67
期刊介绍: Astronomy and Computing is a peer-reviewed journal that focuses on the broad area between astronomy, computer science and information technology. The journal aims to publish the work of scientists and (software) engineers in all aspects of astronomical computing, including the collection, analysis, reduction, visualisation, preservation and dissemination of data, and the development of astronomical software and simulations. The journal covers applications for academic computer science techniques to astronomy, as well as novel applications of information technologies within astronomy.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信