Mapping and scheduling HPC applications for optimizing I/O

Proceedings of the 34th ACM International Conference on Supercomputing Pub Date : 2020-06-29 DOI:10.1145/3392717.3392764

J. Carretero, E. Jeannot, Guillaume Pallez, D. E. Singh, Nicolas Vidal

{"title":"Mapping and scheduling HPC applications for optimizing I/O","authors":"J. Carretero, E. Jeannot, Guillaume Pallez, D. E. Singh, Nicolas Vidal","doi":"10.1145/3392717.3392764","DOIUrl":null,"url":null,"abstract":"In HPC platforms, concurrent applications are sharing the same file system. This can lead to conflicts, especially as applications are more and more data intensive. I/O contention can represent a performance bottleneck. The access to bandwidth can be split in two complementary yet distinct problems. The mapping problem and the scheduling problem. The mapping problem consists in selecting the set of applications that are in competition for the I/O resource. The scheduling problem consists then, given I/O requests on the same resource, in determining the order to these accesses to minimize the I/O time. In this work we propose to couple a novel bandwidth-aware mapping algorithm to I/O list-scheduling policies to develop a cross-layer optimization solution. We study this solution experimentally using an I/O middleware: CLARISSE. We show that naive policies such as FIFO perform relatively well in order to schedule I/O movements, and that the important part to reduce congestion lies mostly on the mapping part. We evaluate the algorithm that we propose using a simulator that we validated experimentally. This evaluation shows important gains for the simple, bandwidth-aware mapping solution that we provide compared to its non bandwidth-aware counterpart. The gains are both in terms of machine efficiency (makespan) and application efficiency (stretch). This stresses even more the importance of designing efficient, bandwidth-aware mapping strategies to alleviate the cost of I/O congestion.","PeriodicalId":346687,"journal":{"name":"Proceedings of the 34th ACM International Conference on Supercomputing","volume":" 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 34th ACM International Conference on Supercomputing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3392717.3392764","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 9

Abstract

In HPC platforms, concurrent applications are sharing the same file system. This can lead to conflicts, especially as applications are more and more data intensive. I/O contention can represent a performance bottleneck. The access to bandwidth can be split in two complementary yet distinct problems. The mapping problem and the scheduling problem. The mapping problem consists in selecting the set of applications that are in competition for the I/O resource. The scheduling problem consists then, given I/O requests on the same resource, in determining the order to these accesses to minimize the I/O time. In this work we propose to couple a novel bandwidth-aware mapping algorithm to I/O list-scheduling policies to develop a cross-layer optimization solution. We study this solution experimentally using an I/O middleware: CLARISSE. We show that naive policies such as FIFO perform relatively well in order to schedule I/O movements, and that the important part to reduce congestion lies mostly on the mapping part. We evaluate the algorithm that we propose using a simulator that we validated experimentally. This evaluation shows important gains for the simple, bandwidth-aware mapping solution that we provide compared to its non bandwidth-aware counterpart. The gains are both in terms of machine efficiency (makespan) and application efficiency (stretch). This stresses even more the importance of designing efficient, bandwidth-aware mapping strategies to alleviate the cost of I/O congestion.

查看原文本刊更多论文

映射和调度HPC应用程序以优化I/O

在HPC平台中，并发应用程序共享相同的文件系统。这可能导致冲突，特别是当应用程序的数据越来越密集时。I/O争用可能是性能瓶颈。对带宽的访问可以分为两个互补但又截然不同的问题。映射问题和调度问题。映射问题包括选择一组竞争I/O资源的应用程序。调度问题包括，给定相同资源上的I/O请求，确定这些访问的顺序以最小化I/O时间。在这项工作中，我们提出将一种新的带宽感知映射算法与I/O列表调度策略相结合，以开发一种跨层优化解决方案。我们使用I/O中间件CLARISSE对该解决方案进行了实验研究。我们表明，为了调度I/O运动，诸如FIFO之类的朴素策略表现相对较好，并且减少拥塞的重要部分主要在于映射部分。我们使用经过实验验证的模拟器来评估我们提出的算法。此评估显示，与非带宽感知的对应方案相比，我们提供的简单、带宽感知的映射解决方案获得了重要的收益。收益体现在机器效率(makespan)和应用效率(stretch)两方面。这更加强调了设计高效、带宽感知的映射策略以减轻I/O拥塞成本的重要性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 34th ACM International Conference on Supercomputing

自引率

0.00%

发文量