A Comprehensive Analysis of User Job Data on a Petascale Supercomputer Dedicated to CFD

2019 IEEE 5th International Conference on Computer and Communications (ICCC) Pub Date : 2019-12-01 DOI:10.1109/ICCC47050.2019.9064094

Wenxiang Yang, Zhigong Yang, Yongguo Zhou, F. Wang, Cheng Chen, Yueqing Wang

引用次数: 1

Abstract

High performance computing (HPC) systems play a crucial role in performing large-scale scientific applications and their efficiencies are imperative to be improved. This paper aims to comprehensively understand job characteristics and the factors that affect system efficiency and performance, which lays a solid foundation for proposing and evaluating job scheduling and resource management methods. To achieve this goal, we collect job data covering two years from a petascale HPC system that is dedicated to computational fluid dynamics (CFD) applications. Furthermore, a detailed analysis about failed jobs and waiting time is conducted based on the dataset. Our analysis excavates some important characteristics of submitted jobs, which can not only help system owners understand and master the situation about CFD applications in the system, but also provide good guidance and ideas for optimizing job scheduling and resource management algorithms.

查看原文本刊更多论文

千兆级CFD专用超级计算机用户作业数据的综合分析

高性能计算(HPC)系统在执行大规模科学应用中起着至关重要的作用，其效率亟待提高。本文旨在全面了解作业特征以及影响系统效率和性能的因素，为作业调度和资源管理方法的提出和评价奠定坚实的基础。为了实现这一目标，我们从专用于计算流体动力学(CFD)应用的petascale HPC系统中收集了两年的作业数据。在此基础上，对失败作业和等待时间进行了详细的分析。我们的分析挖掘了提交作业的一些重要特征，不仅可以帮助系统所有者了解和掌握CFD在系统中的应用情况，而且可以为优化作业调度和资源管理算法提供良好的指导和思路。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 IEEE 5th International Conference on Computer and Communications (ICCC)

自引率

0.00%

发文量