Lars Bilke, Thomas Fischer, Dmitri Naumov, Tobias Meisel
{"title":"可复制的HPC软件部署、模拟和工作流程——远场深层地质储库评估的案例研究","authors":"Lars Bilke, Thomas Fischer, Dmitri Naumov, Tobias Meisel","doi":"10.1007/s12665-025-12501-z","DOIUrl":null,"url":null,"abstract":"<div><p>Reproducibility across diverse high-performance computing (HPC) environments remains a major challenge in computational science, particularly for complex, multi-physics simulation workflows. This study presents a comprehensive approach to achieving bit-for-bit result reproducibility in the context of the OpenGeoSys (OGS) simulation suite. Given the widespread use of OGS in environmental science applications such as safety assessments for radioactive waste disposal and the optimisation of geothermal energy systems our approach enhances the reliability, transparency, and acceptance of simulation results in these safety-critical domains, shown in a case study for far-field deep geological repository assessment. We use GNU Guix to define fully declarative, verifiable software environments and deploy them as portable Apptainer containers, enabling consistent execution across multiple HPC systems. Leveraging AiiDA for workflow automation and provenance tracking, we conduct simulations and complex simulation workflows on three heterogeneous clusters, confirming identical binary-level outputs. The results demonstrate that reproducible and portable software environments can offer a pathway toward long-term verifiability in scientific high-performance computing. We also show how full data provenance originating from software source code and model input data resulting in full simulation workflow result data can be achieved.</p></div>","PeriodicalId":542,"journal":{"name":"Environmental Earth Sciences","volume":"84 17","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s12665-025-12501-z.pdf","citationCount":"0","resultStr":"{\"title\":\"Reproducible HPC software deployments, simulations, and workflows – a case study for far-field deep geological repository assessment\",\"authors\":\"Lars Bilke, Thomas Fischer, Dmitri Naumov, Tobias Meisel\",\"doi\":\"10.1007/s12665-025-12501-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Reproducibility across diverse high-performance computing (HPC) environments remains a major challenge in computational science, particularly for complex, multi-physics simulation workflows. This study presents a comprehensive approach to achieving bit-for-bit result reproducibility in the context of the OpenGeoSys (OGS) simulation suite. Given the widespread use of OGS in environmental science applications such as safety assessments for radioactive waste disposal and the optimisation of geothermal energy systems our approach enhances the reliability, transparency, and acceptance of simulation results in these safety-critical domains, shown in a case study for far-field deep geological repository assessment. We use GNU Guix to define fully declarative, verifiable software environments and deploy them as portable Apptainer containers, enabling consistent execution across multiple HPC systems. Leveraging AiiDA for workflow automation and provenance tracking, we conduct simulations and complex simulation workflows on three heterogeneous clusters, confirming identical binary-level outputs. The results demonstrate that reproducible and portable software environments can offer a pathway toward long-term verifiability in scientific high-performance computing. We also show how full data provenance originating from software source code and model input data resulting in full simulation workflow result data can be achieved.</p></div>\",\"PeriodicalId\":542,\"journal\":{\"name\":\"Environmental Earth Sciences\",\"volume\":\"84 17\",\"pages\":\"\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-08-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://link.springer.com/content/pdf/10.1007/s12665-025-12501-z.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Earth Sciences\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s12665-025-12501-z\",\"RegionNum\":4,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Earth Sciences","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s12665-025-12501-z","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Reproducible HPC software deployments, simulations, and workflows – a case study for far-field deep geological repository assessment
Reproducibility across diverse high-performance computing (HPC) environments remains a major challenge in computational science, particularly for complex, multi-physics simulation workflows. This study presents a comprehensive approach to achieving bit-for-bit result reproducibility in the context of the OpenGeoSys (OGS) simulation suite. Given the widespread use of OGS in environmental science applications such as safety assessments for radioactive waste disposal and the optimisation of geothermal energy systems our approach enhances the reliability, transparency, and acceptance of simulation results in these safety-critical domains, shown in a case study for far-field deep geological repository assessment. We use GNU Guix to define fully declarative, verifiable software environments and deploy them as portable Apptainer containers, enabling consistent execution across multiple HPC systems. Leveraging AiiDA for workflow automation and provenance tracking, we conduct simulations and complex simulation workflows on three heterogeneous clusters, confirming identical binary-level outputs. The results demonstrate that reproducible and portable software environments can offer a pathway toward long-term verifiability in scientific high-performance computing. We also show how full data provenance originating from software source code and model input data resulting in full simulation workflow result data can be achieved.
期刊介绍:
Environmental Earth Sciences is an international multidisciplinary journal concerned with all aspects of interaction between humans, natural resources, ecosystems, special climates or unique geographic zones, and the earth:
Water and soil contamination caused by waste management and disposal practices
Environmental problems associated with transportation by land, air, or water
Geological processes that may impact biosystems or humans
Man-made or naturally occurring geological or hydrological hazards
Environmental problems associated with the recovery of materials from the earth
Environmental problems caused by extraction of minerals, coal, and ores, as well as oil and gas, water and alternative energy sources
Environmental impacts of exploration and recultivation – Environmental impacts of hazardous materials
Management of environmental data and information in data banks and information systems
Dissemination of knowledge on techniques, methods, approaches and experiences to improve and remediate the environment
In pursuit of these topics, the geoscientific disciplines are invited to contribute their knowledge and experience. Major disciplines include: hydrogeology, hydrochemistry, geochemistry, geophysics, engineering geology, remediation science, natural resources management, environmental climatology and biota, environmental geography, soil science and geomicrobiology.