Enabling High-Throughput Parallel I/O in Particle-in-Cell Monte Carlo Simulations with openPMD and Darshan I/O Monitoring

Jeremy J. Williams, Daniel Medeiros, Stefan Costea, David Tskhakaya, Franz Poeschel, René Widera, Axel Huebl, Scott Klasky, Norbert Podhorszki, Leon Kos, Ales Podolnik, Jakub Hromadka, Tapish Narwal, Klaus Steiniger, Michael Bussmann, Erwin Laure, Stefano Markidis
{"title":"Enabling High-Throughput Parallel I/O in Particle-in-Cell Monte Carlo Simulations with openPMD and Darshan I/O Monitoring","authors":"Jeremy J. Williams, Daniel Medeiros, Stefan Costea, David Tskhakaya, Franz Poeschel, René Widera, Axel Huebl, Scott Klasky, Norbert Podhorszki, Leon Kos, Ales Podolnik, Jakub Hromadka, Tapish Narwal, Klaus Steiniger, Michael Bussmann, Erwin Laure, Stefano Markidis","doi":"arxiv-2408.02869","DOIUrl":null,"url":null,"abstract":"Large-scale HPC simulations of plasma dynamics in fusion devices require\nefficient parallel I/O to avoid slowing down the simulation and to enable the\npost-processing of critical information. Such complex simulations lacking\nparallel I/O capabilities may encounter performance bottlenecks, hindering\ntheir effectiveness in data-intensive computing tasks. In this work, we focus\non introducing and enhancing the efficiency of parallel I/O operations in\nParticle-in-Cell Monte Carlo simulations. We first evaluate the scalability of\nBIT1, a massively-parallel electrostatic PIC MC code, determining its initial\nwrite throughput capabilities and performance bottlenecks using an HPC I/O\nperformance monitoring tool, Darshan. We design and develop an adaptor to the\nopenPMD I/O interface that allows us to stream PIC particle and field\ninformation to I/O using the BP4 backend, aggressively optimized for I/O\nefficiency, including the highly efficient ADIOS2 interface. Next, we explore\nadvanced optimization techniques such as data compression, aggregation, and\nLustre file striping, achieving write throughput improvements while enhancing\ndata storage efficiency. Finally, we analyze the enhanced high-throughput\nparallel I/O and storage capabilities achieved through the integration of\nopenPMD with rapid metadata extraction in BP4 format. Our study demonstrates\nthat the integration of openPMD and advanced I/O optimizations significantly\nenhances BIT1's I/O performance and storage capabilities, successfully\nintroducing high throughput parallel I/O and surpassing the capabilities of\ntraditional file I/O.","PeriodicalId":501274,"journal":{"name":"arXiv - PHYS - Plasma Physics","volume":"13 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - PHYS - Plasma Physics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.02869","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Large-scale HPC simulations of plasma dynamics in fusion devices require efficient parallel I/O to avoid slowing down the simulation and to enable the post-processing of critical information. Such complex simulations lacking parallel I/O capabilities may encounter performance bottlenecks, hindering their effectiveness in data-intensive computing tasks. In this work, we focus on introducing and enhancing the efficiency of parallel I/O operations in Particle-in-Cell Monte Carlo simulations. We first evaluate the scalability of BIT1, a massively-parallel electrostatic PIC MC code, determining its initial write throughput capabilities and performance bottlenecks using an HPC I/O performance monitoring tool, Darshan. We design and develop an adaptor to the openPMD I/O interface that allows us to stream PIC particle and field information to I/O using the BP4 backend, aggressively optimized for I/O efficiency, including the highly efficient ADIOS2 interface. Next, we explore advanced optimization techniques such as data compression, aggregation, and Lustre file striping, achieving write throughput improvements while enhancing data storage efficiency. Finally, we analyze the enhanced high-throughput parallel I/O and storage capabilities achieved through the integration of openPMD with rapid metadata extraction in BP4 format. Our study demonstrates that the integration of openPMD and advanced I/O optimizations significantly enhances BIT1's I/O performance and storage capabilities, successfully introducing high throughput parallel I/O and surpassing the capabilities of traditional file I/O.
利用 openPMD 和 Darshan I/O 监控在细胞内粒子蒙特卡罗模拟中实现高吞吐量并行 I/O
对核聚变装置中的等离子体动力学进行大规模高性能计算仿真需要高效的并行 I/O,以避免降低仿真速度,并对关键信息进行后处理。缺乏并行 I/O 能力的此类复杂仿真可能会遇到性能瓶颈,从而影响其在数据密集型计算任务中的效率。在这项工作中,我们重点关注在粒子池蒙特卡罗模拟中引入并提高并行 I/O 操作的效率。我们首先评估了大规模并行静电 PIC MC 代码 BIT1 的可扩展性,利用 HPC I/O 性能监控工具 Darshan 确定了其初始写入吞吐能力和性能瓶颈。我们设计并开发了openPMD I/O接口的适配器,允许我们使用BP4后端将PIC粒子和场信息流传输到I/O,并针对I/O效率进行了积极优化,包括高效的ADIOS2接口。接下来,我们探索了数据压缩、聚合和 Lustre 文件条带化等高级优化技术,在提高写入吞吐量的同时提升了数据存储效率。最后,我们分析了通过将 openPMD 与 BP4 格式的快速元数据提取集成所实现的增强型高吞吐量并行 I/O 和存储能力。我们的研究表明,openPMD 与高级 I/O 优化的集成大大提高了 BIT1 的 I/O 性能和存储能力,成功引入了高吞吐量并行 I/O,并超越了传统文件 I/O 的能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信