International Journal of High Performance Computing Applications最新文献_第3页

Corrigendum to large-scale direct numerical simulations of turbulence using GPUs and modern Fortran 使用gpu和现代Fortran的大规模直接数值模拟湍流的勘误表

3区计算机科学

International Journal of High Performance Computing Applications Pub Date : 2023-05-05 DOI: 10.1177/10943420231173573

引用次数: 0

A study on the performance of distributed training of data-driven CFD simulations 数据驱动CFD模拟的分布式训练性能研究

IF 3.1 3区计算机科学

International Journal of High Performance Computing Applications Pub Date : 2023-05-04 DOI: 10.1177/10943420231160557

Sergio Iserte, Alejandro González-Barberá, Paloma Barreda, K. Rojek

{"title":"A study on the performance of distributed training of data-driven CFD simulations","authors":"Sergio Iserte, Alejandro González-Barberá, Paloma Barreda, K. Rojek","doi":"10.1177/10943420231160557","DOIUrl":"https://doi.org/10.1177/10943420231160557","url":null,"abstract":"Data-driven methods for computer simulations are blooming in many scientific areas. The traditional approach to simulating physical behaviors relies on solving partial differential equations (PDEs). Since calculating these iterative equations is highly both computationally demanding and time-consuming, data-driven methods leverage artificial intelligence (AI) techniques to alleviate that workload. Data-driven methods have to be trained in advance to provide their subsequent fast predictions; however, the cost of the training stage is non-negligible. This article presents a predictive model for inferencing future states of a specific fluid simulation that serves as a use case for evaluating different training alternatives. Particularly, this study compares the performance of only CPU, multi-GPU, and distributed approaches for training a time series forecasting deep learning model. With some slight code adaptations, results show and compare, in different implementations, the benefits of distributed GPU-enabled training for predicting high-accuracy states in a fraction of the time needed by the computational fluid dynamics solver.","PeriodicalId":54957,"journal":{"name":"International Journal of High Performance Computing Applications","volume":"37 1","pages":"503 - 515"},"PeriodicalIF":3.1,"publicationDate":"2023-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49621368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Orchestration of materials science workflows for heterogeneous resources at large scale 大规模异构资源的材料科学工作流编排

IF 3.1 3区计算机科学

International Journal of High Performance Computing Applications Pub Date : 2023-04-14 DOI: 10.1177/10943420231167800

Naweiluo Zhou, G. Scorzelli, Jakob Luettgau, R. Kancharla, Joshua J. Kane, Robert Wheeler, B. Croom, P. Newell, Valerio Pascucci, M. Taufer

{"title":"Orchestration of materials science workflows for heterogeneous resources at large scale","authors":"Naweiluo Zhou, G. Scorzelli, Jakob Luettgau, R. Kancharla, Joshua J. Kane, Robert Wheeler, B. Croom, P. Newell, Valerio Pascucci, M. Taufer","doi":"10.1177/10943420231167800","DOIUrl":"https://doi.org/10.1177/10943420231167800","url":null,"abstract":"In the era of big data, materials science workflows need to handle large-scale data distribution, storage, and computation. Any of these areas can become a performance bottleneck. We present a framework for analyzing internal material structures (e.g., cracks) to mitigate these bottlenecks. We demonstrate the effectiveness of our framework for a workflow performing synchrotron X-ray computed tomography reconstruction and segmentation of a silica-based structure. Our framework provides a cloud-based, cutting-edge solution to challenges such as growing intermediate and output data and heavy resource demands during image reconstruction and segmentation. Specifically, our framework efficiently manages data storage, scaling up compute resources on the cloud. The multi-layer software structure of our framework includes three layers. A top layer uses Jupyter notebooks and serves as the user interface. A middle layer uses Ansible for resource deployment and managing the execution environment. A low layer is dedicated to resource management and provides resource management and job scheduling on heterogeneous nodes (i.e., GPU and CPU). At the core of this layer, Kubernetes supports resource management, and Dask enables large-scale job scheduling for heterogeneous resources. The broader impact of our work is four-fold: through our framework, we hide the complexity of the cloud’s software stack to the user who otherwise is required to have expertise in cloud technologies; we manage job scheduling efficiently and in a scalable manner; we enable resource elasticity and workflow orchestration at a large scale; and we facilitate moving the study of nonporous structures, which has wide applications in engineering and scientific fields, to the cloud. While we demonstrate the capability of our framework for a specific materials science application, it can be adapted for other applications and domains because of its modular, multi-layer architecture.","PeriodicalId":54957,"journal":{"name":"International Journal of High Performance Computing Applications","volume":"37 1","pages":"260 - 271"},"PeriodicalIF":3.1,"publicationDate":"2023-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49196158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Versatile software-defined HPC and cloud clusters on Alps supercomputer for diverse workflows Alps超级计算机上的多功能软件定义HPC和云集群，用于不同的工作流程

IF 3.1 3区计算机科学

International Journal of High Performance Computing Applications Pub Date : 2023-04-11 DOI: 10.1177/10943420231167811

S. Alam, M. Gila, Mark Klein, Maxime Martinasso, T. Schulthess

{"title":"Versatile software-defined HPC and cloud clusters on Alps supercomputer for diverse workflows","authors":"S. Alam, M. Gila, Mark Klein, Maxime Martinasso, T. Schulthess","doi":"10.1177/10943420231167811","DOIUrl":"https://doi.org/10.1177/10943420231167811","url":null,"abstract":"Supercomputers have been driving innovations for performance and scaling benefiting several scientific applications for the past few decades. Yet their ecosystems remain virtually unchanged when it comes to integrating distributed data-driven workflows, primarily due to rather rigid access methods and restricted configuration management options. X-as-a-Service model of cloud has introduced, among other features, a developer-centric DevOps approach empowering developers of infrastructure, platform to software artefacts, which, unfortunately contemporary supercomputers still lack. We introduce vClusters (versatile software-defined clusters), which is based on Infrastructure-as-code (IaC) technology. vClusters approach is a unique fusion of HPC and cloud technologies resulting in a software-defined, multi-tenant cluster on a supercomputing ecosystem, that, together with software-defined storage, enable DevOps for complex, data-driven workflows like grid middleware, alongside a classic HPC platform. IaC has been a commonplace in cloud computing, however, it lacked adoption within multi-Petascale ecosystems due to concerns related to performance and interoperability with classic HPC data centres’ ecosystems. We present an overview of the Swiss National Supercomputing Centre’s flagship Alps ecosystem as an implementation target for vClusters for HPC and data-driven workflows. Alps is based on the Cray-HPE Shasta EX supercomputing platform that includes an IaC compliant, microservices architecture (MSA) management system, which we leverage for demonstrating vClusters usage for our diverse operational workflows. We provide implementation details of two operational vClusters platforms: a classic HPC platform that is used predominantly by hundreds of users running thousands of large-scale numerical simulations batch jobs; and a widely used, data-intensive, Grid computing middleware platform used for CERN Worldwide LHC Computing Grid (WLCG) operations. The resulting solution showcases reuse and reduction of common configuration recipes across vCluster implementations, minimising operational change management overheads while introducing flexibility for managing artefacts for DevOps required by diverse workflows.","PeriodicalId":54957,"journal":{"name":"International Journal of High Performance Computing Applications","volume":"37 1","pages":"288 - 305"},"PeriodicalIF":3.1,"publicationDate":"2023-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45220506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Survey of Graph Comparison Methods with Applications to Nondeterminism in High-Performance Computing 图比较方法及其在高性能计算中不确定性的应用综述

IF 3.1 3区计算机科学

International Journal of High Performance Computing Applications Pub Date : 2023-04-05 DOI: 10.1177/10943420231166610

S. Bhowmick, Patrick Bell, M. Taufer

{"title":"A Survey of Graph Comparison Methods with Applications to Nondeterminism in High-Performance Computing","authors":"S. Bhowmick, Patrick Bell, M. Taufer","doi":"10.1177/10943420231166610","DOIUrl":"https://doi.org/10.1177/10943420231166610","url":null,"abstract":"The convergence of extremely high levels of hardware concurrency and the effective overlap of computation and communication in asynchronous executions has resulted in increasing nondeterminism in High-Performance Computing (HPC) applications. Nondeterminism can manifest at multiple levels: from low-level communication primitives to libraries to application-level functions. No matter its source, nondeterminism can drastically increase the cost of result reproducibility, debugging workflows, testing parallel programs, or ensuring fault-tolerance. Nondeterministic executions of HPC applications can be modeled as event graphs, and the applications’ nondeterministic behavior can be understood and, in some cases, mitigated using graph comparison algorithms. However, a connection between graph comparison algorithms and approaches to understanding nondeterminism in HPC still needs to be established. This survey article moves the first steps toward establishing a connection between graph comparison algorithms and nondeterminism in HPC with its three contributions: it provides a survey of different graph comparison algorithms and a timeline for each category’s significant works; it discusses how existing graph comparison methods do not fully support properties needed to understand nondeterministic patterns in HPC applications; and it presents the open challenges that should be addressed to leverage the power of graph comparisons for the study of nondeterminism in HPC applications.","PeriodicalId":54957,"journal":{"name":"International Journal of High Performance Computing Applications","volume":"37 1","pages":"306 - 327"},"PeriodicalIF":3.1,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47768873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Combining multitask and transfer learning with deep Gaussian processes for autotuning-based performance engineering 将多任务和迁移学习与深度高斯过程相结合用于基于自动调谐的性能工程

IF 3.1 3区计算机科学

International Journal of High Performance Computing Applications Pub Date : 2023-03-30 DOI: 10.1177/10943420231166365

P. Luszczek, Wissam M. Sid-Lakhdar, J. Dongarra

引用次数: 1

Automatizing the creation of specialized high-performance computing containers 自动化创建专用的高性能计算容器

IF 3.1 3区计算机科学

International Journal of High Performance Computing Applications Pub Date : 2023-03-29 DOI: 10.1177/10943420231165729

J. Ejarque, Rosa M. Badia

引用次数: 1

Accelerating cluster dynamics simulation of fission gas behavior in nuclear fuel on deep computing unit–based heterogeneous architecture supercomputer 基于深度计算单元的异构结构超级计算机上核燃料裂变气体行为加速簇动力学模拟

IF 3.1 3区计算机科学

International Journal of High Performance Computing Applications Pub Date : 2023-03-14 DOI: 10.1177/10943420231162831

He Bai, Changjun Hu, Yuhan Zhu, Dandan Chen, Genshen Chu, Shuai Ren

{"title":"Accelerating cluster dynamics simulation of fission gas behavior in nuclear fuel on deep computing unit–based heterogeneous architecture supercomputer","authors":"He Bai, Changjun Hu, Yuhan Zhu, Dandan Chen, Genshen Chu, Shuai Ren","doi":"10.1177/10943420231162831","DOIUrl":"https://doi.org/10.1177/10943420231162831","url":null,"abstract":"High fidelity simulation of fission gas behavior is able to help us understand and predict the performance of nuclear fuel under different irradiation conditions. Cluster dynamics (CD) is a mesoscale simulation method which is rapidly developed in nuclear fuel research area in recent years, and it can effectively describe the microdynamic behavior of fission gas in nuclear fuel; however, due to the huge cost of computation needed for CD model solution, the application scenario of CD has been limited. Thus, how to design the acceleration algorithm for the given computing resources to improve the computing efficiency and simulation scale has become a key problem of CD simulation. In this work, we present an accelerating cluster dynamics model based on the spatially dependent cluster dynamics model, combined with multi optimization methods on a DCU (deep computing unit)-based heterogeneous architecture supercomputer. The correctness of the model is verified by comparing with experimental data and Xolotl—a software of SciDAC program from the U.S. Department of Energy’s Office of Science. Furthermore, our model implementation has a better computing performance than Xolotl’s GPU version. Our code has gained great strong/weak scaling performance with more than 72.75%/84.07% parallel efficiency on 1024 compute nodes. This work developed a new efficient model for CD simulation of fission gas in nuclear fuel.","PeriodicalId":54957,"journal":{"name":"International Journal of High Performance Computing Applications","volume":"37 1","pages":"516 - 529"},"PeriodicalIF":3.1,"publicationDate":"2023-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47260453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Experiences with nested parallelism in task-parallel applications using malleable BLAS on multicore processors 在多核处理器上使用可延展BLAS的任务并行应用程序中嵌套并行的经验

IF 3.1 3区计算机科学

International Journal of High Performance Computing Applications Pub Date : 2023-03-10 DOI: 10.1177/10943420231157653

Rafael Rodríguez-Sánchez, Adrián Castelló, Sandra Catalán, Francisco D. Igual, E. S. Quintana‐Ortí

引用次数: 0

Myths and legends in high-performance computing 高性能计算的神话和传说

IF 3.1 3区计算机科学

International Journal of High Performance Computing Applications Pub Date : 2023-01-06 DOI: 10.1177/10943420231166608

S. Matsuoka, Jens Domke, M. Wahib, Aleksandr Drozd, T. Hoefler

{"title":"Myths and legends in high-performance computing","authors":"S. Matsuoka, Jens Domke, M. Wahib, Aleksandr Drozd, T. Hoefler","doi":"10.1177/10943420231166608","DOIUrl":"https://doi.org/10.1177/10943420231166608","url":null,"abstract":"In this thought-provoking article, we discuss certain myths and legends that are folklore among members of the high-performance computing community. We gathered these myths from conversations at conferences and meetings, product advertisements, papers, and other communications such as tweets, blogs, and news articles within and beyond our community. We believe they represent the zeitgeist of the current era of massive change, driven by the end of many scaling laws such as Dennard scaling and Moore’s law. While some laws end, new directions are emerging, such as algorithmic scaling or novel architecture research. Nevertheless, these myths are rarely based on scientific facts, but rather on some evidence or argumentation. In fact, we believe that this is the very reason for the existence of many myths and why they cannot be answered clearly. While it feels like there should be clear answers for each, some may remain endless philosophical debates, such as whether Beethoven was better than Mozart. We would like to see our collection of myths as a discussion of possible new directions for research and industry investment.","PeriodicalId":54957,"journal":{"name":"International Journal of High Performance Computing Applications","volume":"37 1","pages":"245 - 259"},"PeriodicalIF":3.1,"publicationDate":"2023-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42989305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5