Using On-Demand File Systems in HPC Environments

Mehmet Soysal, M. Berghoff, T. Zirwes, Marc-André Vef, Sebastian Oeste, A. Brinkmann, W. Nagel, A. Streit
{"title":"Using On-Demand File Systems in HPC Environments","authors":"Mehmet Soysal, M. Berghoff, T. Zirwes, Marc-André Vef, Sebastian Oeste, A. Brinkmann, W. Nagel, A. Streit","doi":"10.1109/HPCS48598.2019.9188216","DOIUrl":null,"url":null,"abstract":"In modern HPC systems, parallel (distributed) file systems are used to allow fast access from and to the storage infrastructure. However, I/O performance in large-scale HPC systems has failed to keep up with the increase in computational power. As a result, the I/O subsystem which also has to cope with a large number of demanding metadata operations is often the bottleneck of the entire HPC system. In some cases, even a single bad behaving application can be held responsible for slowing down the entire HPC system, disrupting other applications that use the same I/O subsystem. These kinds of situations are likely to become more frequent in the future with larger and more powerful HPC systems. In this work, we present a simple solution for applications with very high I/O demands. Our proposed solution is to create a private parallel file system on-demand for an HPC job and use the node-local storage devices, e.g. solid-state-disks (SSD). We show that this feature is easy to add to an existing HPC environment and requires only minimal configuration to the system. We conclude that the impact on running applications is manageable and the advantages to applications that generate a high load outweigh the disadvantages. We show that in some cases applications may run slower, but the reduction of load on the global file system is prevailing in these cases.","PeriodicalId":371856,"journal":{"name":"2019 International Conference on High Performance Computing & Simulation (HPCS)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on High Performance Computing & Simulation (HPCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPCS48598.2019.9188216","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

In modern HPC systems, parallel (distributed) file systems are used to allow fast access from and to the storage infrastructure. However, I/O performance in large-scale HPC systems has failed to keep up with the increase in computational power. As a result, the I/O subsystem which also has to cope with a large number of demanding metadata operations is often the bottleneck of the entire HPC system. In some cases, even a single bad behaving application can be held responsible for slowing down the entire HPC system, disrupting other applications that use the same I/O subsystem. These kinds of situations are likely to become more frequent in the future with larger and more powerful HPC systems. In this work, we present a simple solution for applications with very high I/O demands. Our proposed solution is to create a private parallel file system on-demand for an HPC job and use the node-local storage devices, e.g. solid-state-disks (SSD). We show that this feature is easy to add to an existing HPC environment and requires only minimal configuration to the system. We conclude that the impact on running applications is manageable and the advantages to applications that generate a high load outweigh the disadvantages. We show that in some cases applications may run slower, but the reduction of load on the global file system is prevailing in these cases.
在HPC环境中使用按需文件系统
在现代HPC系统中,并行(分布式)文件系统被用来实现对存储基础设施的快速访问。然而,大规模高性能计算系统的I/O性能已经无法跟上计算能力的增长。因此,需要处理大量元数据操作的I/O子系统往往成为整个HPC系统的瓶颈。在某些情况下,即使是单个行为不良的应用程序也可能导致整个HPC系统变慢,从而影响使用相同I/O子系统的其他应用程序。在未来,随着高性能计算系统的发展,这些情况可能会变得更加频繁。在这项工作中,我们为具有非常高I/O需求的应用程序提供了一个简单的解决方案。我们提出的解决方案是为HPC作业按需创建一个私有并行文件系统,并使用节点本地存储设备,例如固态磁盘(SSD)。我们展示了这个特性很容易添加到现有的HPC环境中,并且只需要对系统进行最小的配置。我们得出的结论是,对运行应用程序的影响是可管理的,生成高负载的应用程序的优点大于缺点。我们展示了在某些情况下,应用程序可能运行得较慢,但在这些情况下,全局文件系统上负载的减少是普遍的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信