Fine-grained Policy-driven I/O Sharing for Burst Buffers

E. Karrels, Lei Huang, Yuhong Kan, Ishank Arora, Yinzhi Wang, D. Katz, W. Gropp, Zhao Zhang
{"title":"Fine-grained Policy-driven I/O Sharing for Burst Buffers","authors":"E. Karrels, Lei Huang, Yuhong Kan, Ishank Arora, Yinzhi Wang, D. Katz, W. Gropp, Zhao Zhang","doi":"10.48550/arXiv.2306.11615","DOIUrl":null,"url":null,"abstract":"A burst buffer is a common method to bridge the performance gap between the I/O needs of modern supercomputing applications and the performance of the shared file system on large-scale supercomputers. However, existing I/O sharing methods require resource isolation, offline profiling, or repeated execution that significantly limit the utilization and applicability of these systems. Here we present ThemisIO, a policy-driven I/O sharing framework for a remote-shared burst buffer: a dedicated group of I/O nodes, each with a local storage device. ThemisIO preserves high utilization by implementing opportunity fairness so that it can reallocate unused I/O resources to other applications. ThemisIO accurately and efficiently allocates I/O cycles among applications, purely based on real-time I/O behavior without requiring user-supplied information or offline-profiled application characteristics. ThemisIO supports a variety of fair sharing policies, such as user-fair, size-fair, as well as composite policies, e.g., group-then-user-fair. All these features are enabled by its statistical token design. ThemisIO can alter the execution order of incoming I/O requests based on assigned tokens to precisely balance I/O cycles between applications via time slicing, thereby enforcing processing isolation. Experiments using I/O benchmarks show that ThemisIO sustains 13.5--13.7% higher I/O throughput and 19.5--40.4% lower performance variation than existing algorithms. For real applications, ThemisIO significantly reduces the slowdown by 59.1--99.8% caused by I/O interference.","PeriodicalId":124077,"journal":{"name":"Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2306.11615","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

A burst buffer is a common method to bridge the performance gap between the I/O needs of modern supercomputing applications and the performance of the shared file system on large-scale supercomputers. However, existing I/O sharing methods require resource isolation, offline profiling, or repeated execution that significantly limit the utilization and applicability of these systems. Here we present ThemisIO, a policy-driven I/O sharing framework for a remote-shared burst buffer: a dedicated group of I/O nodes, each with a local storage device. ThemisIO preserves high utilization by implementing opportunity fairness so that it can reallocate unused I/O resources to other applications. ThemisIO accurately and efficiently allocates I/O cycles among applications, purely based on real-time I/O behavior without requiring user-supplied information or offline-profiled application characteristics. ThemisIO supports a variety of fair sharing policies, such as user-fair, size-fair, as well as composite policies, e.g., group-then-user-fair. All these features are enabled by its statistical token design. ThemisIO can alter the execution order of incoming I/O requests based on assigned tokens to precisely balance I/O cycles between applications via time slicing, thereby enforcing processing isolation. Experiments using I/O benchmarks show that ThemisIO sustains 13.5--13.7% higher I/O throughput and 19.5--40.4% lower performance variation than existing algorithms. For real applications, ThemisIO significantly reduces the slowdown by 59.1--99.8% caused by I/O interference.
突发缓冲区的细粒度策略驱动I/O共享
突发缓冲区是弥合现代超级计算应用程序的I/O需求与大型超级计算机上共享文件系统的性能之间的性能差距的常用方法。但是,现有的I/O共享方法需要资源隔离、脱机分析或重复执行,这极大地限制了这些系统的利用率和适用性。这里我们介绍ThemisIO,一个策略驱动的I/O共享框架,用于远程共享突发缓冲区:一组专用的I/O节点,每个节点都有一个本地存储设备。ThemisIO通过实现机会公平来保持高利用率,这样它就可以将未使用的I/O资源重新分配给其他应用程序。ThemisIO完全基于实时I/O行为,在应用程序之间准确有效地分配I/O周期,而不需要用户提供的信息或离线配置的应用程序特征。ThemisIO支持各种公平共享策略,例如user-fair、size-fair以及组合策略,例如group-then-user-fair。所有这些特性都是通过其统计令牌设计实现的。ThemisIO可以根据分配的令牌改变传入I/O请求的执行顺序,通过时间切片精确地平衡应用程序之间的I/O周期,从而强制处理隔离。使用I/O基准测试的实验表明,与现有算法相比,ThemisIO的I/O吞吐量提高了13.5—13.7%,性能变化降低了19.5—40.4%。对于实际应用,ThemisIO显著降低了由I/O干扰引起的59.1—99.8%的速度减慢。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信