可复制的HPC软件部署、模拟和工作流程——远场深层地质储库评估的案例研究

IF 2.8 4区 环境科学与生态学 Q3 ENVIRONMENTAL SCIENCES
Lars Bilke, Thomas Fischer, Dmitri Naumov, Tobias Meisel
{"title":"可复制的HPC软件部署、模拟和工作流程——远场深层地质储库评估的案例研究","authors":"Lars Bilke,&nbsp;Thomas Fischer,&nbsp;Dmitri Naumov,&nbsp;Tobias Meisel","doi":"10.1007/s12665-025-12501-z","DOIUrl":null,"url":null,"abstract":"<div><p>Reproducibility across diverse high-performance computing (HPC) environments remains a major challenge in computational science, particularly for complex, multi-physics simulation workflows. This study presents a comprehensive approach to achieving bit-for-bit result reproducibility in the context of the OpenGeoSys (OGS) simulation suite. Given the widespread use of OGS in environmental science applications such as safety assessments for radioactive waste disposal and the optimisation of geothermal energy systems our approach enhances the reliability, transparency, and acceptance of simulation results in these safety-critical domains, shown in a case study for far-field deep geological repository assessment. We use GNU Guix to define fully declarative, verifiable software environments and deploy them as portable Apptainer containers, enabling consistent execution across multiple HPC systems. Leveraging AiiDA for workflow automation and provenance tracking, we conduct simulations and complex simulation workflows on three heterogeneous clusters, confirming identical binary-level outputs. The results demonstrate that reproducible and portable software environments can offer a pathway toward long-term verifiability in scientific high-performance computing. We also show how full data provenance originating from software source code and model input data resulting in full simulation workflow result data can be achieved.</p></div>","PeriodicalId":542,"journal":{"name":"Environmental Earth Sciences","volume":"84 17","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s12665-025-12501-z.pdf","citationCount":"0","resultStr":"{\"title\":\"Reproducible HPC software deployments, simulations, and workflows – a case study for far-field deep geological repository assessment\",\"authors\":\"Lars Bilke,&nbsp;Thomas Fischer,&nbsp;Dmitri Naumov,&nbsp;Tobias Meisel\",\"doi\":\"10.1007/s12665-025-12501-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Reproducibility across diverse high-performance computing (HPC) environments remains a major challenge in computational science, particularly for complex, multi-physics simulation workflows. This study presents a comprehensive approach to achieving bit-for-bit result reproducibility in the context of the OpenGeoSys (OGS) simulation suite. Given the widespread use of OGS in environmental science applications such as safety assessments for radioactive waste disposal and the optimisation of geothermal energy systems our approach enhances the reliability, transparency, and acceptance of simulation results in these safety-critical domains, shown in a case study for far-field deep geological repository assessment. We use GNU Guix to define fully declarative, verifiable software environments and deploy them as portable Apptainer containers, enabling consistent execution across multiple HPC systems. Leveraging AiiDA for workflow automation and provenance tracking, we conduct simulations and complex simulation workflows on three heterogeneous clusters, confirming identical binary-level outputs. The results demonstrate that reproducible and portable software environments can offer a pathway toward long-term verifiability in scientific high-performance computing. We also show how full data provenance originating from software source code and model input data resulting in full simulation workflow result data can be achieved.</p></div>\",\"PeriodicalId\":542,\"journal\":{\"name\":\"Environmental Earth Sciences\",\"volume\":\"84 17\",\"pages\":\"\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-08-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://link.springer.com/content/pdf/10.1007/s12665-025-12501-z.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Earth Sciences\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s12665-025-12501-z\",\"RegionNum\":4,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Earth Sciences","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s12665-025-12501-z","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

跨各种高性能计算(HPC)环境的再现性仍然是计算科学的主要挑战,特别是对于复杂的多物理场仿真工作流程。本研究提出了一种在OpenGeoSys (OGS)仿真套件环境中实现逐位结果可重复性的综合方法。鉴于OGS在环境科学应用中的广泛应用,例如放射性废物处理的安全评估和地热能系统的优化,我们的方法提高了这些安全关键领域模拟结果的可靠性、透明度和可接受性,这在远场深层地质储库评估的案例研究中得到了体现。我们使用GNU Guix来定义完全声明的、可验证的软件环境,并将它们部署为可移植的Apptainer容器,从而实现跨多个HPC系统的一致执行。利用AiiDA进行工作流自动化和来源跟踪,我们在三个异构集群上进行模拟和复杂的模拟工作流,确认相同的二进制级输出。结果表明,可复制和可移植的软件环境可以为科学高性能计算提供长期可验证性的途径。我们还展示了如何实现源自软件源代码和模型输入数据的完整数据来源,从而实现完整的仿真工作流结果数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Reproducible HPC software deployments, simulations, and workflows – a case study for far-field deep geological repository assessment

Reproducibility across diverse high-performance computing (HPC) environments remains a major challenge in computational science, particularly for complex, multi-physics simulation workflows. This study presents a comprehensive approach to achieving bit-for-bit result reproducibility in the context of the OpenGeoSys (OGS) simulation suite. Given the widespread use of OGS in environmental science applications such as safety assessments for radioactive waste disposal and the optimisation of geothermal energy systems our approach enhances the reliability, transparency, and acceptance of simulation results in these safety-critical domains, shown in a case study for far-field deep geological repository assessment. We use GNU Guix to define fully declarative, verifiable software environments and deploy them as portable Apptainer containers, enabling consistent execution across multiple HPC systems. Leveraging AiiDA for workflow automation and provenance tracking, we conduct simulations and complex simulation workflows on three heterogeneous clusters, confirming identical binary-level outputs. The results demonstrate that reproducible and portable software environments can offer a pathway toward long-term verifiability in scientific high-performance computing. We also show how full data provenance originating from software source code and model input data resulting in full simulation workflow result data can be achieved.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Environmental Earth Sciences
Environmental Earth Sciences 环境科学-地球科学综合
CiteScore
5.10
自引率
3.60%
发文量
494
审稿时长
8.3 months
期刊介绍: Environmental Earth Sciences is an international multidisciplinary journal concerned with all aspects of interaction between humans, natural resources, ecosystems, special climates or unique geographic zones, and the earth: Water and soil contamination caused by waste management and disposal practices Environmental problems associated with transportation by land, air, or water Geological processes that may impact biosystems or humans Man-made or naturally occurring geological or hydrological hazards Environmental problems associated with the recovery of materials from the earth Environmental problems caused by extraction of minerals, coal, and ores, as well as oil and gas, water and alternative energy sources Environmental impacts of exploration and recultivation – Environmental impacts of hazardous materials Management of environmental data and information in data banks and information systems Dissemination of knowledge on techniques, methods, approaches and experiences to improve and remediate the environment In pursuit of these topics, the geoscientific disciplines are invited to contribute their knowledge and experience. Major disciplines include: hydrogeology, hydrochemistry, geochemistry, geophysics, engineering geology, remediation science, natural resources management, environmental climatology and biota, environmental geography, soil science and geomicrobiology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信