Seok-Ju Yun;Jaehyuk Lee;Sungmeen Myung;Jangho An;Daekun Yoon;Seungchul Jung;Soonwan Kwon;Sangjoon Kim
{"title":"用于AI加速器的5nm高效高密度数字SRAM内存计算宏","authors":"Seok-Ju Yun;Jaehyuk Lee;Sungmeen Myung;Jangho An;Daekun Yoon;Seungchul Jung;Soonwan Kwon;Sangjoon Kim","doi":"10.1109/LSSC.2025.3532788","DOIUrl":null,"url":null,"abstract":"Two specialized digital SRAM In-memory computing (IMC) macros were implemented using a 5-nm process: 1) a high efficiency (HE) macro and 2) a high-density (HD) macro. The HE macro achieves an energy efficiency of 274 TOPS/W under 90% input bit sparsity, 50% weight bit sparsity, and 0.46-V supply voltage, by adopting a compact Wallace tree adder (WTA) and a tristate buffer optimized for pipelined operation. The HD macro achieves a state-of-the-art memory density performance of 5.67 Mb/mm2 by employing a multiply-cell consisting of a single transistor. Extensive sample measurements have confirmed the robust and reliable performance of the two digital IMC macros. The proposed HE and HD macros have demonstrated their significant potential as key building blocks for next-generation Artificial intelligence (AI) accelerators.","PeriodicalId":13032,"journal":{"name":"IEEE Solid-State Circuits Letters","volume":"8 ","pages":"269-272"},"PeriodicalIF":2.0000,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"5-nm High-Efficiency and High-Density Digital SRAM In-Memory-Computing Macros for AI Accelerators\",\"authors\":\"Seok-Ju Yun;Jaehyuk Lee;Sungmeen Myung;Jangho An;Daekun Yoon;Seungchul Jung;Soonwan Kwon;Sangjoon Kim\",\"doi\":\"10.1109/LSSC.2025.3532788\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Two specialized digital SRAM In-memory computing (IMC) macros were implemented using a 5-nm process: 1) a high efficiency (HE) macro and 2) a high-density (HD) macro. The HE macro achieves an energy efficiency of 274 TOPS/W under 90% input bit sparsity, 50% weight bit sparsity, and 0.46-V supply voltage, by adopting a compact Wallace tree adder (WTA) and a tristate buffer optimized for pipelined operation. The HD macro achieves a state-of-the-art memory density performance of 5.67 Mb/mm2 by employing a multiply-cell consisting of a single transistor. Extensive sample measurements have confirmed the robust and reliable performance of the two digital IMC macros. The proposed HE and HD macros have demonstrated their significant potential as key building blocks for next-generation Artificial intelligence (AI) accelerators.\",\"PeriodicalId\":13032,\"journal\":{\"name\":\"IEEE Solid-State Circuits Letters\",\"volume\":\"8 \",\"pages\":\"269-272\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2025-01-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Solid-State Circuits Letters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10849601/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Solid-State Circuits Letters","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10849601/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
5-nm High-Efficiency and High-Density Digital SRAM In-Memory-Computing Macros for AI Accelerators
Two specialized digital SRAM In-memory computing (IMC) macros were implemented using a 5-nm process: 1) a high efficiency (HE) macro and 2) a high-density (HD) macro. The HE macro achieves an energy efficiency of 274 TOPS/W under 90% input bit sparsity, 50% weight bit sparsity, and 0.46-V supply voltage, by adopting a compact Wallace tree adder (WTA) and a tristate buffer optimized for pipelined operation. The HD macro achieves a state-of-the-art memory density performance of 5.67 Mb/mm2 by employing a multiply-cell consisting of a single transistor. Extensive sample measurements have confirmed the robust and reliable performance of the two digital IMC macros. The proposed HE and HD macros have demonstrated their significant potential as key building blocks for next-generation Artificial intelligence (AI) accelerators.