使用服务质量通道来控制突发缓冲区内突袭流量的影响

Elsa Gonsiorowski, C. Carothers, Justin M. LaPre, P. Heidelberger, C. Minkenberg, G. Rodríguez
{"title":"使用服务质量通道来控制突发缓冲区内突袭流量的影响","authors":"Elsa Gonsiorowski, C. Carothers, Justin M. LaPre, P. Heidelberger, C. Minkenberg, G. Rodríguez","doi":"10.1109/WSC.2017.8247844","DOIUrl":null,"url":null,"abstract":"The next generation of leadership supercomputer systems will require a medium-term layer of storage. The basis for this stratum of storage will be a Storage I/O Node (SION). For increased reliability, a redundancy algorithm will be implemented on top of groups of SIONs. In addition to the overheads of implementing a redundancy mechanism, a large cost of using a RAID strategy comes from the possibility of increased network congestion due to rebuild operations. To better understand the impact of RAID rebuild traffic, we have developed a simulation model of the SIONs. After validation, we use this model to investigate the impact of several configuration parameters, including redundancy mechanism and the physical arrangement of hardware. Additionally, our model analyzes the use of Quality of Service lanes to limit the impact of RAID traffic. We conclude with a series of recommendations for configuring a resilient and high performing I/O subsystem.","PeriodicalId":145780,"journal":{"name":"2017 Winter Simulation Conference (WSC)","volume":"815 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Using quality of service lanes to control the impact of raid traffic within a burst buffer\",\"authors\":\"Elsa Gonsiorowski, C. Carothers, Justin M. LaPre, P. Heidelberger, C. Minkenberg, G. Rodríguez\",\"doi\":\"10.1109/WSC.2017.8247844\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The next generation of leadership supercomputer systems will require a medium-term layer of storage. The basis for this stratum of storage will be a Storage I/O Node (SION). For increased reliability, a redundancy algorithm will be implemented on top of groups of SIONs. In addition to the overheads of implementing a redundancy mechanism, a large cost of using a RAID strategy comes from the possibility of increased network congestion due to rebuild operations. To better understand the impact of RAID rebuild traffic, we have developed a simulation model of the SIONs. After validation, we use this model to investigate the impact of several configuration parameters, including redundancy mechanism and the physical arrangement of hardware. Additionally, our model analyzes the use of Quality of Service lanes to limit the impact of RAID traffic. We conclude with a series of recommendations for configuring a resilient and high performing I/O subsystem.\",\"PeriodicalId\":145780,\"journal\":{\"name\":\"2017 Winter Simulation Conference (WSC)\",\"volume\":\"815 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-12-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 Winter Simulation Conference (WSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WSC.2017.8247844\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Winter Simulation Conference (WSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WSC.2017.8247844","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

下一代超级计算机系统将需要一个中期存储层。这个存储层的基础将是存储I/O节点(SION)。为了提高可靠性,将在sion组之上实现冗余算法。除了实现冗余机制的开销之外,使用RAID策略的一大成本来自于由于重建操作而增加的网络拥塞的可能性。为了更好地理解RAID重建流量的影响,我们开发了一个模拟模型。经过验证后,我们使用该模型研究了几个配置参数的影响,包括冗余机制和硬件的物理安排。此外,我们的模型分析了服务质量通道的使用,以限制RAID流量的影响。最后,我们给出了一系列配置弹性和高性能I/O子系统的建议。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Using quality of service lanes to control the impact of raid traffic within a burst buffer
The next generation of leadership supercomputer systems will require a medium-term layer of storage. The basis for this stratum of storage will be a Storage I/O Node (SION). For increased reliability, a redundancy algorithm will be implemented on top of groups of SIONs. In addition to the overheads of implementing a redundancy mechanism, a large cost of using a RAID strategy comes from the possibility of increased network congestion due to rebuild operations. To better understand the impact of RAID rebuild traffic, we have developed a simulation model of the SIONs. After validation, we use this model to investigate the impact of several configuration parameters, including redundancy mechanism and the physical arrangement of hardware. Additionally, our model analyzes the use of Quality of Service lanes to limit the impact of RAID traffic. We conclude with a series of recommendations for configuring a resilient and high performing I/O subsystem.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信