FPGA-Based Distributed Union-Find Decoder for Surface Codes

Namitha Liyanage;Yue Wu;Siona Tagare;Lin Zhong
{"title":"FPGA-Based Distributed Union-Find Decoder for Surface Codes","authors":"Namitha Liyanage;Yue Wu;Siona Tagare;Lin Zhong","doi":"10.1109/TQE.2024.3467271","DOIUrl":null,"url":null,"abstract":"A fault-tolerant quantum computer must decode and correct errors faster than they appear to prevent exponential slowdown due to error correction. The Union-Find (UF) decoder is promising with an average time complexity slightly higher than \n<inline-formula><tex-math>$O(d^{3})$</tex-math></inline-formula>\n. We report a distributed version of the UF decoder that exploits parallel computing resources for further speedup. Using a field-programmable gate array (FPGA)-based implementation, we empirically show that this distributed UF decoder has a sublinear average time complexity with regard to \n<inline-formula><tex-math>$d$</tex-math></inline-formula>\n, given \n<inline-formula><tex-math>$O(d^{3})$</tex-math></inline-formula>\n parallel computing resources. The decoding time per measurement round decreases as \n<inline-formula><tex-math>$d$</tex-math></inline-formula>\n increases, the first time for a quantum error decoder. The implementation employs a scalable architecture called Helios that organizes parallel computing resources into a hybrid tree-grid structure. Using a Xilinx VCU129 FPGA, we successfully implement \n<inline-formula><tex-math>$d$</tex-math></inline-formula>\n up to 21 with an average decoding time of 11.5 ns per measurement round under 0.1% phenomenological noise and 23.7 ns for \n<inline-formula><tex-math>$d=17$</tex-math></inline-formula>\n under equivalent circuit-level noise. This performance is significantly faster than any existing decoder implementation. Furthermore, we show that Helios can optimize for resource efficiency by decoding \n<inline-formula><tex-math>$d=51$</tex-math></inline-formula>\n on a Xilinx VCU129 FPGA with an average latency of 544 ns per measurement round.","PeriodicalId":100644,"journal":{"name":"IEEE Transactions on Quantum Engineering","volume":"5 ","pages":"1-18"},"PeriodicalIF":0.0000,"publicationDate":"2024-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10693533","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Quantum Engineering","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10693533/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

A fault-tolerant quantum computer must decode and correct errors faster than they appear to prevent exponential slowdown due to error correction. The Union-Find (UF) decoder is promising with an average time complexity slightly higher than $O(d^{3})$ . We report a distributed version of the UF decoder that exploits parallel computing resources for further speedup. Using a field-programmable gate array (FPGA)-based implementation, we empirically show that this distributed UF decoder has a sublinear average time complexity with regard to $d$ , given $O(d^{3})$ parallel computing resources. The decoding time per measurement round decreases as $d$ increases, the first time for a quantum error decoder. The implementation employs a scalable architecture called Helios that organizes parallel computing resources into a hybrid tree-grid structure. Using a Xilinx VCU129 FPGA, we successfully implement $d$ up to 21 with an average decoding time of 11.5 ns per measurement round under 0.1% phenomenological noise and 23.7 ns for $d=17$ under equivalent circuit-level noise. This performance is significantly faster than any existing decoder implementation. Furthermore, we show that Helios can optimize for resource efficiency by decoding $d=51$ on a Xilinx VCU129 FPGA with an average latency of 544 ns per measurement round.
基于 FPGA 的分布式曲面码联合查找解码器
容错量子计算机必须以比错误出现更快的速度解码和纠错,以防止因纠错而导致指数级减速。Union-Find(UF)解码器的平均时间复杂度略高于 $O(d^{3})$,前景广阔。我们报告了 UF 解码器的分布式版本,它利用并行计算资源进一步提高了速度。利用基于现场可编程门阵列(FPGA)的实现,我们通过经验证明,在并行计算资源为 $O(d^{3}$ 的情况下,这种分布式 UF 解码器的平均时间复杂度与 $d$ 呈亚线性关系。每个测量回合的解码时间随着 $d$ 的增加而减少,这在量子误差解码器中尚属首次。实现过程采用了一种名为 Helios 的可扩展架构,该架构将并行计算资源组织成混合树状网格结构。通过使用赛灵思 VCU129 FPGA,我们成功实现了高达 21d 的 $d$,在 0.1% 的现象学噪声下,每轮测量的平均解码时间为 11.5 ns,在等效电路级噪声下,$d=17$ 的平均解码时间为 23.7 ns。这一性能明显快于任何现有的解码器实现。此外,我们还展示了 Helios 可以优化资源效率,在 Xilinx VCU129 FPGA 上解码 $d=51$,每轮测量的平均延迟时间为 544 ns。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.00
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信