Jing Ren , Pei-Yao Qu , Jia-Hong Yang , Xiang-Yu Zheng , Hui Zhang , Jie Ren , Guang-Ming Tang
{"title":"高性能快速单流量子位片算术逻辑单元","authors":"Jing Ren , Pei-Yao Qu , Jia-Hong Yang , Xiang-Yu Zheng , Hui Zhang , Jie Ren , Guang-Ming Tang","doi":"10.1016/j.supcon.2024.100116","DOIUrl":null,"url":null,"abstract":"<div><p>Two optimization technologies, namely, bypass and carry-control optimization, were demonstrated for enhancing the performance of a bit-slice Arithmetic Logic Unit (ALU) in 2<sup><em>n</em></sup>-bit Rapid Single-Flux-Quantum (RSFQ) microprocessors. These technologies can not only shorten the calculation time but also solve data hazards. Among them, the proposed bypass technology is applicable to any 2<sup><em>n</em></sup>-bit ALU, whether it is bit-serial, bit-slice or bit-parallel. The high performance bit-slice ALU was implemented using the 6 <em>k</em>A/cm<sup>2</sup> Nb/AlO<sub><em>x</em></sub>/Nb junction fabrication process from Superconducting Electronics Facility of Shanghai Institute of Microsystem and Information Technology. It consists of 1693 Josephson junctions with an area of 2.46 <span><math><mrow><mo>×</mo></mrow></math></span> 0.81 mm<sup>2</sup>. All ALU operations of the MIPS32 instruction set are implemented, including two extended instructions, i.e., addition with carry (ADDC) and subtraction with borrow (SUBB). All the ALU operations were successfully obtained in SFQ testing based on OCTOPUX and the measured DC bias current margin can reach 86<span><math><mrow><mo>%</mo></mrow></math></span> - 104<span><math><mrow><mo>%</mo></mrow></math></span>. The ALU achieves a 100<span><math><mrow><mo>%</mo></mrow></math></span> utilization rate, regardless of carry/borrow read-after-write correlations between instructions.</p></div>","PeriodicalId":101185,"journal":{"name":"Superconductivity","volume":"11 ","pages":"Article 100116"},"PeriodicalIF":5.6000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772830724000334/pdfft?md5=96f792fb9935383fe687678b7f003e24&pid=1-s2.0-S2772830724000334-main.pdf","citationCount":"0","resultStr":"{\"title\":\"High performance rapid single-flux-quantum bit-slice arithmetic logic unit\",\"authors\":\"Jing Ren , Pei-Yao Qu , Jia-Hong Yang , Xiang-Yu Zheng , Hui Zhang , Jie Ren , Guang-Ming Tang\",\"doi\":\"10.1016/j.supcon.2024.100116\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Two optimization technologies, namely, bypass and carry-control optimization, were demonstrated for enhancing the performance of a bit-slice Arithmetic Logic Unit (ALU) in 2<sup><em>n</em></sup>-bit Rapid Single-Flux-Quantum (RSFQ) microprocessors. These technologies can not only shorten the calculation time but also solve data hazards. Among them, the proposed bypass technology is applicable to any 2<sup><em>n</em></sup>-bit ALU, whether it is bit-serial, bit-slice or bit-parallel. The high performance bit-slice ALU was implemented using the 6 <em>k</em>A/cm<sup>2</sup> Nb/AlO<sub><em>x</em></sub>/Nb junction fabrication process from Superconducting Electronics Facility of Shanghai Institute of Microsystem and Information Technology. It consists of 1693 Josephson junctions with an area of 2.46 <span><math><mrow><mo>×</mo></mrow></math></span> 0.81 mm<sup>2</sup>. All ALU operations of the MIPS32 instruction set are implemented, including two extended instructions, i.e., addition with carry (ADDC) and subtraction with borrow (SUBB). All the ALU operations were successfully obtained in SFQ testing based on OCTOPUX and the measured DC bias current margin can reach 86<span><math><mrow><mo>%</mo></mrow></math></span> - 104<span><math><mrow><mo>%</mo></mrow></math></span>. The ALU achieves a 100<span><math><mrow><mo>%</mo></mrow></math></span> utilization rate, regardless of carry/borrow read-after-write correlations between instructions.</p></div>\",\"PeriodicalId\":101185,\"journal\":{\"name\":\"Superconductivity\",\"volume\":\"11 \",\"pages\":\"Article 100116\"},\"PeriodicalIF\":5.6000,\"publicationDate\":\"2024-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2772830724000334/pdfft?md5=96f792fb9935383fe687678b7f003e24&pid=1-s2.0-S2772830724000334-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Superconductivity\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772830724000334\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Superconductivity","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772830724000334","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
High performance rapid single-flux-quantum bit-slice arithmetic logic unit
Two optimization technologies, namely, bypass and carry-control optimization, were demonstrated for enhancing the performance of a bit-slice Arithmetic Logic Unit (ALU) in 2n-bit Rapid Single-Flux-Quantum (RSFQ) microprocessors. These technologies can not only shorten the calculation time but also solve data hazards. Among them, the proposed bypass technology is applicable to any 2n-bit ALU, whether it is bit-serial, bit-slice or bit-parallel. The high performance bit-slice ALU was implemented using the 6 kA/cm2 Nb/AlOx/Nb junction fabrication process from Superconducting Electronics Facility of Shanghai Institute of Microsystem and Information Technology. It consists of 1693 Josephson junctions with an area of 2.46 0.81 mm2. All ALU operations of the MIPS32 instruction set are implemented, including two extended instructions, i.e., addition with carry (ADDC) and subtraction with borrow (SUBB). All the ALU operations were successfully obtained in SFQ testing based on OCTOPUX and the measured DC bias current margin can reach 86 - 104. The ALU achieves a 100 utilization rate, regardless of carry/borrow read-after-write correlations between instructions.