高性能计算异构集群的设计与实现

G. Barone, D. Bottalico, L. Carracciuolo, A. Doria, Davide Michelino, S. Pardi, G. Russo, G. Sabella, B. Spisso
{"title":"高性能计算异构集群的设计与实现","authors":"G. Barone, D. Bottalico, L. Carracciuolo, A. Doria, Davide Michelino, S. Pardi, G. Russo, G. Sabella, B. Spisso","doi":"10.1109/ICECET55527.2022.9872709","DOIUrl":null,"url":null,"abstract":"We present a new hybrid cluster, characterized by heterogeneous resources, set up in the Federico II University of Naples Data Center, funded by the IBiSCo (Infrastructure for BIg data and Scientific COmputing) project. It aims at big data analytics, high throughput and high performance processing, image processing and analysis. The purpose of the hybrid features is to guarantee the best use of resources for their applications in different scenario, so as to profit from different computational paradigms: from parallel computing to GPGPU accelerated workload and their combinations. The cluster provides 128 GPUs as well as the coexistence of technologies for High Throughput Computing (HTC) and High Performance Computing (HPC). To offer heterogeneous resources, cluster nodes have multiple network connections together with an NVLink bus between the GPUs on each node, which ensures more efficient intranode communication. The data storage is separated from the computing nodes and its efficient access is assured by Lustre distributed and parallel file system which leverages on Infini-Band technology. Our work should be useful to evaluate some promising technologies for the management and the efficient usage of computing resources under development within different Exascale Computing Projects.","PeriodicalId":249012,"journal":{"name":"2022 International Conference on Electrical, Computer and Energy Technologies (ICECET)","volume":"59 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Designing And Implementing A High-Performance Computing Heterogeneous Cluster\",\"authors\":\"G. Barone, D. Bottalico, L. Carracciuolo, A. Doria, Davide Michelino, S. Pardi, G. Russo, G. Sabella, B. Spisso\",\"doi\":\"10.1109/ICECET55527.2022.9872709\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present a new hybrid cluster, characterized by heterogeneous resources, set up in the Federico II University of Naples Data Center, funded by the IBiSCo (Infrastructure for BIg data and Scientific COmputing) project. It aims at big data analytics, high throughput and high performance processing, image processing and analysis. The purpose of the hybrid features is to guarantee the best use of resources for their applications in different scenario, so as to profit from different computational paradigms: from parallel computing to GPGPU accelerated workload and their combinations. The cluster provides 128 GPUs as well as the coexistence of technologies for High Throughput Computing (HTC) and High Performance Computing (HPC). To offer heterogeneous resources, cluster nodes have multiple network connections together with an NVLink bus between the GPUs on each node, which ensures more efficient intranode communication. The data storage is separated from the computing nodes and its efficient access is assured by Lustre distributed and parallel file system which leverages on Infini-Band technology. Our work should be useful to evaluate some promising technologies for the management and the efficient usage of computing resources under development within different Exascale Computing Projects.\",\"PeriodicalId\":249012,\"journal\":{\"name\":\"2022 International Conference on Electrical, Computer and Energy Technologies (ICECET)\",\"volume\":\"59 2\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Electrical, Computer and Energy Technologies (ICECET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICECET55527.2022.9872709\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Electrical, Computer and Energy Technologies (ICECET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECET55527.2022.9872709","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

我们提出了一个新的混合集群,以异构资源为特征,建立在那不勒斯费德里科二世大学数据中心,由IBiSCo(大数据和科学计算基础设施)项目资助。面向大数据分析、高吞吐量高性能处理、图像处理与分析。混合特性的目的是保证在不同场景下应用程序的资源得到最佳利用,从而从不同的计算范式中获益:从并行计算到GPGPU加速工作负载及其组合。集群提供128个gpu, HTC (High Throughput Computing)和HPC (High Performance Computing)技术共存。为了提供异构资源,集群节点有多个网络连接,每个节点的gpu之间都有NVLink总线,保证了更高效的内部网通信。数据存储与计算节点分离,采用Lustre分布式并行文件系统,利用infiniband技术保证数据存储的高效访问。我们的工作应该有助于评估在不同的Exascale计算项目中正在开发的计算资源的管理和有效使用的一些有前途的技术。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Designing And Implementing A High-Performance Computing Heterogeneous Cluster
We present a new hybrid cluster, characterized by heterogeneous resources, set up in the Federico II University of Naples Data Center, funded by the IBiSCo (Infrastructure for BIg data and Scientific COmputing) project. It aims at big data analytics, high throughput and high performance processing, image processing and analysis. The purpose of the hybrid features is to guarantee the best use of resources for their applications in different scenario, so as to profit from different computational paradigms: from parallel computing to GPGPU accelerated workload and their combinations. The cluster provides 128 GPUs as well as the coexistence of technologies for High Throughput Computing (HTC) and High Performance Computing (HPC). To offer heterogeneous resources, cluster nodes have multiple network connections together with an NVLink bus between the GPUs on each node, which ensures more efficient intranode communication. The data storage is separated from the computing nodes and its efficient access is assured by Lustre distributed and parallel file system which leverages on Infini-Band technology. Our work should be useful to evaluate some promising technologies for the management and the efficient usage of computing resources under development within different Exascale Computing Projects.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信