{"title":"基于 FPGA 的分布式曲面码联合查找解码器","authors":"Namitha Liyanage;Yue Wu;Siona Tagare;Lin Zhong","doi":"10.1109/TQE.2024.3467271","DOIUrl":null,"url":null,"abstract":"A fault-tolerant quantum computer must decode and correct errors faster than they appear to prevent exponential slowdown due to error correction. The Union-Find (UF) decoder is promising with an average time complexity slightly higher than \n<inline-formula><tex-math>$O(d^{3})$</tex-math></inline-formula>\n. We report a distributed version of the UF decoder that exploits parallel computing resources for further speedup. Using a field-programmable gate array (FPGA)-based implementation, we empirically show that this distributed UF decoder has a sublinear average time complexity with regard to \n<inline-formula><tex-math>$d$</tex-math></inline-formula>\n, given \n<inline-formula><tex-math>$O(d^{3})$</tex-math></inline-formula>\n parallel computing resources. The decoding time per measurement round decreases as \n<inline-formula><tex-math>$d$</tex-math></inline-formula>\n increases, the first time for a quantum error decoder. The implementation employs a scalable architecture called Helios that organizes parallel computing resources into a hybrid tree-grid structure. Using a Xilinx VCU129 FPGA, we successfully implement \n<inline-formula><tex-math>$d$</tex-math></inline-formula>\n up to 21 with an average decoding time of 11.5 ns per measurement round under 0.1% phenomenological noise and 23.7 ns for \n<inline-formula><tex-math>$d=17$</tex-math></inline-formula>\n under equivalent circuit-level noise. This performance is significantly faster than any existing decoder implementation. Furthermore, we show that Helios can optimize for resource efficiency by decoding \n<inline-formula><tex-math>$d=51$</tex-math></inline-formula>\n on a Xilinx VCU129 FPGA with an average latency of 544 ns per measurement round.","PeriodicalId":100644,"journal":{"name":"IEEE Transactions on Quantum Engineering","volume":"5 ","pages":"1-18"},"PeriodicalIF":0.0000,"publicationDate":"2024-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10693533","citationCount":"0","resultStr":"{\"title\":\"FPGA-Based Distributed Union-Find Decoder for Surface Codes\",\"authors\":\"Namitha Liyanage;Yue Wu;Siona Tagare;Lin Zhong\",\"doi\":\"10.1109/TQE.2024.3467271\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A fault-tolerant quantum computer must decode and correct errors faster than they appear to prevent exponential slowdown due to error correction. The Union-Find (UF) decoder is promising with an average time complexity slightly higher than \\n<inline-formula><tex-math>$O(d^{3})$</tex-math></inline-formula>\\n. We report a distributed version of the UF decoder that exploits parallel computing resources for further speedup. Using a field-programmable gate array (FPGA)-based implementation, we empirically show that this distributed UF decoder has a sublinear average time complexity with regard to \\n<inline-formula><tex-math>$d$</tex-math></inline-formula>\\n, given \\n<inline-formula><tex-math>$O(d^{3})$</tex-math></inline-formula>\\n parallel computing resources. The decoding time per measurement round decreases as \\n<inline-formula><tex-math>$d$</tex-math></inline-formula>\\n increases, the first time for a quantum error decoder. The implementation employs a scalable architecture called Helios that organizes parallel computing resources into a hybrid tree-grid structure. Using a Xilinx VCU129 FPGA, we successfully implement \\n<inline-formula><tex-math>$d$</tex-math></inline-formula>\\n up to 21 with an average decoding time of 11.5 ns per measurement round under 0.1% phenomenological noise and 23.7 ns for \\n<inline-formula><tex-math>$d=17$</tex-math></inline-formula>\\n under equivalent circuit-level noise. This performance is significantly faster than any existing decoder implementation. Furthermore, we show that Helios can optimize for resource efficiency by decoding \\n<inline-formula><tex-math>$d=51$</tex-math></inline-formula>\\n on a Xilinx VCU129 FPGA with an average latency of 544 ns per measurement round.\",\"PeriodicalId\":100644,\"journal\":{\"name\":\"IEEE Transactions on Quantum Engineering\",\"volume\":\"5 \",\"pages\":\"1-18\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10693533\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Quantum Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10693533/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Quantum Engineering","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10693533/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
FPGA-Based Distributed Union-Find Decoder for Surface Codes
A fault-tolerant quantum computer must decode and correct errors faster than they appear to prevent exponential slowdown due to error correction. The Union-Find (UF) decoder is promising with an average time complexity slightly higher than
$O(d^{3})$
. We report a distributed version of the UF decoder that exploits parallel computing resources for further speedup. Using a field-programmable gate array (FPGA)-based implementation, we empirically show that this distributed UF decoder has a sublinear average time complexity with regard to
$d$
, given
$O(d^{3})$
parallel computing resources. The decoding time per measurement round decreases as
$d$
increases, the first time for a quantum error decoder. The implementation employs a scalable architecture called Helios that organizes parallel computing resources into a hybrid tree-grid structure. Using a Xilinx VCU129 FPGA, we successfully implement
$d$
up to 21 with an average decoding time of 11.5 ns per measurement round under 0.1% phenomenological noise and 23.7 ns for
$d=17$
under equivalent circuit-level noise. This performance is significantly faster than any existing decoder implementation. Furthermore, we show that Helios can optimize for resource efficiency by decoding
$d=51$
on a Xilinx VCU129 FPGA with an average latency of 544 ns per measurement round.