In-network stable radix sorter using many FPGAs with high-bandwidth photonics [Invited]

IF 4 2区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE
Kenji Mizutani;Yutaka Urino;Takanori Shimizu;Hiroshi Yamaguchi;Shigeru Nakamura;Tatsuya Usuki;Kiyo Ishii;Ryosuke Matsumoto;Takashi Inoue;Shu Namiki;Michihiro Koibuchi
{"title":"In-network stable radix sorter using many FPGAs with high-bandwidth photonics [Invited]","authors":"Kenji Mizutani;Yutaka Urino;Takanori Shimizu;Hiroshi Yamaguchi;Shigeru Nakamura;Tatsuya Usuki;Kiyo Ishii;Ryosuke Matsumoto;Takashi Inoue;Shu Namiki;Michihiro Koibuchi","doi":"10.1364/JOCN.530695","DOIUrl":null,"url":null,"abstract":"A modern field-programmable gate array (FPGA) card can be equipped with high-bandwidth memory and high-bandwidth optical interconnection networks. This paper presents an in-network stable radix sorter on an eight-FPGA cluster. Each custom Stratix10 MX2100 FPGA card has up to 819-Gbps memory bandwidth (\n<tex>${51.2}\\;{\\rm Gbps} \\times {16}\\;{\\rm channels}$</tex>\n) and up to 800-Gbps network bandwidth (\n<tex>${25}\\;{\\rm Gbps} \\times {32}\\;{\\rm channels}$</tex>\n) with eight custom embedded optical modules. Our radix sorter for a 32-bit key range consists of eight 4-bit counting sorts optimized to in-network processing. Each counting sort needs only one memory read/write access for improving its throughput. We demonstrated a sorting throughput of 37.2 GB/s and an energy efficiency of 9.2 MB/J for 32-bit key range and 16-GiB data size using eight memory channels with 409.6 Gbps memory bandwidth per FPGA. It can scale up to 256 FPGAs with a sorting throughput of 983 GB/s for a 32-bit key range and 512-GiB data size.","PeriodicalId":50103,"journal":{"name":"Journal of Optical Communications and Networking","volume":"17 1","pages":"A34-A45"},"PeriodicalIF":4.0000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Optical Communications and Networking","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10752945/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0

Abstract

A modern field-programmable gate array (FPGA) card can be equipped with high-bandwidth memory and high-bandwidth optical interconnection networks. This paper presents an in-network stable radix sorter on an eight-FPGA cluster. Each custom Stratix10 MX2100 FPGA card has up to 819-Gbps memory bandwidth ( ${51.2}\;{\rm Gbps} \times {16}\;{\rm channels}$ ) and up to 800-Gbps network bandwidth ( ${25}\;{\rm Gbps} \times {32}\;{\rm channels}$ ) with eight custom embedded optical modules. Our radix sorter for a 32-bit key range consists of eight 4-bit counting sorts optimized to in-network processing. Each counting sort needs only one memory read/write access for improving its throughput. We demonstrated a sorting throughput of 37.2 GB/s and an energy efficiency of 9.2 MB/J for 32-bit key range and 16-GiB data size using eight memory channels with 409.6 Gbps memory bandwidth per FPGA. It can scale up to 256 FPGAs with a sorting throughput of 983 GB/s for a 32-bit key range and 512-GiB data size.
利用高带宽光子技术的多 FPGA 网内稳定弧度分类器 [特邀]
现代现场可编程门阵列(FPGA)卡可配备高带宽存储器和高带宽光互连网络。本文介绍了一种基于八 FPGA 集群的网内稳定弧度分拣机。每个定制的 Stratix10 MX2100 FPGA 卡拥有高达 819-Gbps 的内存带宽({51.2}\;{rm Gbps} ({16}\;{rm 通道}$)和高达 800-Gbps 的网络带宽({25}\;{rm Gbps} ({32}\;{rm 通道}$),并带有八个定制的嵌入式光模块。我们用于 32 位密钥范围的 radix 排序器由 8 个 4 位计数排序器组成,并针对网络内处理进行了优化。每个计数排序只需一次内存读/写访问即可提高吞吐量。我们演示了在 32 位密钥范围和 16 GB 数据大小下,使用 8 个内存通道(每个 FPGA 拥有 409.6 Gbps 内存带宽)实现 37.2 GB/s 的排序吞吐量和 9.2 MB/J 的能效。在 32 位密钥范围和 512 GB 数据大小的情况下,它可以扩展到 256 个 FPGA,排序吞吐量为 983 GB/s。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
9.40
自引率
16.00%
发文量
104
审稿时长
4 months
期刊介绍: The scope of the Journal includes advances in the state-of-the-art of optical networking science, technology, and engineering. Both theoretical contributions (including new techniques, concepts, analyses, and economic studies) and practical contributions (including optical networking experiments, prototypes, and new applications) are encouraged. Subareas of interest include the architecture and design of optical networks, optical network survivability and security, software-defined optical networking, elastic optical networks, data and control plane advances, network management related innovation, and optical access networks. Enabling technologies and their applications are suitable topics only if the results are shown to directly impact optical networking beyond simple point-to-point networks.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信