Shuhan Liu;Robert M. Radway;Xinxin Wang;Filippo Moro;Jean-Francois Nodin;Koustav Jana;Lixian Yan;Shuting Du;Luke R. Upton;Wei-Chen Chen;Jimin Kang;Jian Chen;Haitong Li;Francois Andrieu;Elisa Vianello;Priyanka Raina;Subhasish Mitra;H.-S. Philip Wong
{"title":"Monolithic 3-D Integration of Diverse Memories: Resistive Switching (RRAM) and Gain Cell (GC) Memory Integrated on Si CMOS","authors":"Shuhan Liu;Robert M. Radway;Xinxin Wang;Filippo Moro;Jean-Francois Nodin;Koustav Jana;Lixian Yan;Shuting Du;Luke R. Upton;Wei-Chen Chen;Jimin Kang;Jian Chen;Haitong Li;Francois Andrieu;Elisa Vianello;Priyanka Raina;Subhasish Mitra;H.-S. Philip Wong","doi":"10.1109/TED.2025.3556113","DOIUrl":null,"url":null,"abstract":"The future memory is massive, diverse, and tightly integrated with computing. This research presents tight integration, both physically and architecturally, of two on-chip memory technologies, resistive switching random access memory (RRAM) and gain cell (GC) memory. HfO2 RRAM and indium tin oxide (ITO) GC memory are monolithically integrated on 130-nm Si CMOS technology to form a joint memory that enables low-energy training and low-standby-power inference for edge devices. High-bandwidth on-chip data transfer can have a bandwidth that is <inline-formula> <tex-math>$90\\times $ </tex-math></inline-formula> state-of-the-art (SoTA) HBM3E and <inline-formula> <tex-math>$211\\times $ </tex-math></inline-formula> PCIe 7.0, enabled by high-density monolithic 3-D interconnections between memory arrays and high-speed transfer circuit within the integrated joint memory macro. Fabricated atomic layer deposition (ALD) ITO FET exhibits positive <inline-formula> <tex-math>${V}_{\\text {TH}}$ </tex-math></inline-formula> of 0.67 V, excellent subthreshold slope (SS) of 65 mV/dec, high <sc>on</small>-current of <inline-formula> <tex-math>$20~\\mu $ </tex-math></inline-formula>A/<inline-formula> <tex-math>$\\mu $ </tex-math></inline-formula>m, and low <sc>off</small>-current of <inline-formula> <tex-math>$5\\times 10^{-{18}}$ </tex-math></inline-formula> A/<inline-formula> <tex-math>$\\mu $ </tex-math></inline-formula>m, as extracted from >5000 s retention. The joint memory macro consumes 78% less standby power and 95% less training energy for MobileBERT compared to SRAM with iso-capacity. This RRAM-GC joint memory facilitates efficient continual learning in edge devices, addressing the challenges of a resource-constrained environment while supporting adaptive artificial intelligence (AI) model updates.","PeriodicalId":13092,"journal":{"name":"IEEE Transactions on Electron Devices","volume":"72 5","pages":"2685-2690"},"PeriodicalIF":2.9000,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Electron Devices","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10966007/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
The future memory is massive, diverse, and tightly integrated with computing. This research presents tight integration, both physically and architecturally, of two on-chip memory technologies, resistive switching random access memory (RRAM) and gain cell (GC) memory. HfO2 RRAM and indium tin oxide (ITO) GC memory are monolithically integrated on 130-nm Si CMOS technology to form a joint memory that enables low-energy training and low-standby-power inference for edge devices. High-bandwidth on-chip data transfer can have a bandwidth that is $90\times $ state-of-the-art (SoTA) HBM3E and $211\times $ PCIe 7.0, enabled by high-density monolithic 3-D interconnections between memory arrays and high-speed transfer circuit within the integrated joint memory macro. Fabricated atomic layer deposition (ALD) ITO FET exhibits positive ${V}_{\text {TH}}$ of 0.67 V, excellent subthreshold slope (SS) of 65 mV/dec, high on-current of $20~\mu $ A/$\mu $ m, and low off-current of $5\times 10^{-{18}}$ A/$\mu $ m, as extracted from >5000 s retention. The joint memory macro consumes 78% less standby power and 95% less training energy for MobileBERT compared to SRAM with iso-capacity. This RRAM-GC joint memory facilitates efficient continual learning in edge devices, addressing the challenges of a resource-constrained environment while supporting adaptive artificial intelligence (AI) model updates.
期刊介绍:
IEEE Transactions on Electron Devices publishes original and significant contributions relating to the theory, modeling, design, performance and reliability of electron and ion integrated circuit devices and interconnects, involving insulators, metals, organic materials, micro-plasmas, semiconductors, quantum-effect structures, vacuum devices, and emerging materials with applications in bioelectronics, biomedical electronics, computation, communications, displays, microelectromechanics, imaging, micro-actuators, nanoelectronics, optoelectronics, photovoltaics, power ICs and micro-sensors. Tutorial and review papers on these subjects are also published and occasional special issues appear to present a collection of papers which treat particular areas in more depth and breadth.