N. Evans, K. Pedretti, Brian Kocoloski, J. Lange, M. Lang, P. Bridges
{"title":"A Cross-Enclave Composition Mechanism for Exascale System Software","authors":"N. Evans, K. Pedretti, Brian Kocoloski, J. Lange, M. Lang, P. Bridges","doi":"10.1145/2931088.2931094","DOIUrl":null,"url":null,"abstract":"As supercomputers move to exascale, the number of cores per node continues to increase, but the I/O bandwidth between nodes is increasing more slowly. This leads to computational power outstripping I/O bandwidth. This growth, in turn, encourages moving as much of an HPC workflow as possible onto the node in order to minimize data movement. One particular method of application composition, enclaves, co-locates different operating systems and runtimes on the same node where they communicate by in situ communication mechanisms. In this work, we describe a mechanism for communicating between composed applications. We implement a mechanism using Copy on Write cooperating with XEMEM shared memory to provide consistent, implicitly unsynchronized communication across enclaves. We then evaluate this mechanism using a composed application and analytics between the Kitten Lightweight Kernel and Linux on top of the Hobbes Operating System and Runtime. These results show a 3% overhead compared to an application running in isolation, demonstrating the viability of this approach.","PeriodicalId":262414,"journal":{"name":"Proceedings of the 6th International Workshop on Runtime and Operating Systems for Supercomputers","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 6th International Workshop on Runtime and Operating Systems for Supercomputers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2931088.2931094","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
As supercomputers move to exascale, the number of cores per node continues to increase, but the I/O bandwidth between nodes is increasing more slowly. This leads to computational power outstripping I/O bandwidth. This growth, in turn, encourages moving as much of an HPC workflow as possible onto the node in order to minimize data movement. One particular method of application composition, enclaves, co-locates different operating systems and runtimes on the same node where they communicate by in situ communication mechanisms. In this work, we describe a mechanism for communicating between composed applications. We implement a mechanism using Copy on Write cooperating with XEMEM shared memory to provide consistent, implicitly unsynchronized communication across enclaves. We then evaluate this mechanism using a composed application and analytics between the Kitten Lightweight Kernel and Linux on top of the Hobbes Operating System and Runtime. These results show a 3% overhead compared to an application running in isolation, demonstrating the viability of this approach.
随着超级计算机向百亿亿级发展,每个节点的核心数量继续增加,但节点之间的I/O带宽增加得更慢。这导致计算能力超过I/O带宽。这种增长反过来又鼓励将尽可能多的HPC工作流移动到节点上,以尽量减少数据移动。应用程序组合的一种特殊方法是将不同的操作系统和运行时放在同一个节点上,通过原位通信机制进行通信。在这项工作中,我们描述了一种在组合应用程序之间进行通信的机制。我们实现了一种使用Copy on Write与XEMEM共享内存协作的机制,以提供跨enclave的一致的、隐式的非同步通信。然后,我们在霍布斯操作系统和运行时之上,使用小猫轻量级内核和Linux之间的组合应用程序和分析来评估这种机制。与单独运行的应用程序相比,这些结果显示了3%的开销,证明了这种方法的可行性。