{"title":"The High Performance Interconnect Architecture for Supercomputers","authors":"","doi":"10.14529/jsfi230208","DOIUrl":null,"url":null,"abstract":"c (cid:13) The Authors 2023. This paper is published with open access at SuperFri.org In this paper, we introduce the design of an advanced high-performance interconnect architecture for supercomputers. In the first part of the paper, we consider the first generation high-performance Angara interconnect (Angara G1). The Angara interconnect is based on the router ASIC, which supports a 4D torus topology, a deterministic and an adaptive routing, and has the hardware support of the RDMA technology. The interface with a processor unit is PCI Express. The Angara G1 interconnect has an extremely low communication latency of 850 ns using the MPI library, as well as a link bandwidth of 75 Gbps. In the paper, we present the scalability performance results of the considered application problems on the supercomputers equipped with the Angara G1 interconnect. In the second part of the paper, using research results and experience we present the architecture of the advanced interconnect for supercomputers (G2). The G2 architecture supports 6D torus topology, the advanced deterministic and zone adaptive routing algorithms, and a low-level interconnect operations including acknowledgments and notifications. G2 includes support for exceptions, performance counters, and SR-IOV virtualization. A G2 hardware is planned in the form factor of a 32-port switch with the QSFP-DD connectors and a two-port low profile PCI Express adapter. The switches can be combined to 4D torus topology. We show the performance evaluation of an experimental FPGA prototype, which confirm the possibility of implementing the proposed advanced high performance interconnect architecture.","PeriodicalId":52144,"journal":{"name":"Supercomputing Frontiers and Innovations","volume":"22 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Supercomputing Frontiers and Innovations","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14529/jsfi230208","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0
Abstract
c (cid:13) The Authors 2023. This paper is published with open access at SuperFri.org In this paper, we introduce the design of an advanced high-performance interconnect architecture for supercomputers. In the first part of the paper, we consider the first generation high-performance Angara interconnect (Angara G1). The Angara interconnect is based on the router ASIC, which supports a 4D torus topology, a deterministic and an adaptive routing, and has the hardware support of the RDMA technology. The interface with a processor unit is PCI Express. The Angara G1 interconnect has an extremely low communication latency of 850 ns using the MPI library, as well as a link bandwidth of 75 Gbps. In the paper, we present the scalability performance results of the considered application problems on the supercomputers equipped with the Angara G1 interconnect. In the second part of the paper, using research results and experience we present the architecture of the advanced interconnect for supercomputers (G2). The G2 architecture supports 6D torus topology, the advanced deterministic and zone adaptive routing algorithms, and a low-level interconnect operations including acknowledgments and notifications. G2 includes support for exceptions, performance counters, and SR-IOV virtualization. A G2 hardware is planned in the form factor of a 32-port switch with the QSFP-DD connectors and a two-port low profile PCI Express adapter. The switches can be combined to 4D torus topology. We show the performance evaluation of an experimental FPGA prototype, which confirm the possibility of implementing the proposed advanced high performance interconnect architecture.
期刊介绍:
The Journal of Supercomputing Frontiers and Innovations (JSFI) is a new peer reviewed publication that addresses the urgent need for greater dissemination of research and development findings and results at the leading edge of high performance computing systems, highly parallel methods, and extreme scaled applications. Key topic areas germane include, but not limited to: Enabling technologies for high performance computing Future generation supercomputer architectures Extreme-scale concepts beyond conventional practices including exascale Parallel programming models, interfaces, languages, libraries, and tools Supercomputer applications and algorithms Distributed operating systems, kernels, supervisors, and virtualization for highly scalable computing Scalable runtime systems software Methods and means of supercomputer system management, administration, and monitoring Mass storage systems, protocols, and allocation Energy and power minimization for very large deployed computers Resilience, reliability, and fault tolerance for future generation highly parallel computing systems Parallel performance and correctness debugging Scientific visualization for massive data and computing both external and in situ Education in high performance computing and computational science.