Anup Ashok Kedilaya;Sirish Oruganti;Nishant Gupta;Xiuhao Zhang;Ilya Karpov;Mark A. Anders;Jaydeep P. Kulkarni
{"title":"Beyond Backside Power: Backside Signal Routing as Technology Booster for Standard-Cell Scaling","authors":"Anup Ashok Kedilaya;Sirish Oruganti;Nishant Gupta;Xiuhao Zhang;Ilya Karpov;Mark A. Anders;Jaydeep P. Kulkarni","doi":"10.1109/JXCDC.2025.3617784","DOIUrl":null,"url":null,"abstract":"Advances in process technology enabling backside metals (BSMs) and contacts offer new design–technology co-optimization (DTCO) opportunities to further enhance power, performance, and area gains (PPA) in sub-3-nm nodes. This work exploits backside (BS) contact technology within standard cells to extend both signal and clock routing to BSM layers, enabling standard-cell height reduction options. We design electrically equivalent (EEQ) standard cells with multiple layout variants based on front versus BS pin access, achieving a 2-M0-Track height reduction in 3-nm gate-all-around field-effect transistor (GAAFET) technology. Experimental evaluation across representative industrial benchmarks—including high-performance CPUs, GPUs, and general-purpose systems-on-chip (SoCs) demonstrates significant benefits. Cell height reduction delivers up to 35% area savings and 10%–15% total power reduction for GPU and GP-SoC designs. For high-performance CPUs, maximum performance improves by 15% at iso-power compared to backside power with buried power rails (BSBPR). Incorporating BS signal routing with cell height reduction also reduces worst case IR drop by 32% relative to BSBPR. These results show that BS clock (BSCLK) and signal routing represent the next phase of technology innovation beyond BS power delivery, enabling continued standard-cell scaling, improved intracell and intercell routability, and generational PPA gains while maintaining similar core transistor geometries in sub-3-nm technologies.","PeriodicalId":54149,"journal":{"name":"IEEE Journal on Exploratory Solid-State Computational Devices and Circuits","volume":"11 ","pages":"107-115"},"PeriodicalIF":2.7000,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11192533","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal on Exploratory Solid-State Computational Devices and Circuits","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11192533/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
Advances in process technology enabling backside metals (BSMs) and contacts offer new design–technology co-optimization (DTCO) opportunities to further enhance power, performance, and area gains (PPA) in sub-3-nm nodes. This work exploits backside (BS) contact technology within standard cells to extend both signal and clock routing to BSM layers, enabling standard-cell height reduction options. We design electrically equivalent (EEQ) standard cells with multiple layout variants based on front versus BS pin access, achieving a 2-M0-Track height reduction in 3-nm gate-all-around field-effect transistor (GAAFET) technology. Experimental evaluation across representative industrial benchmarks—including high-performance CPUs, GPUs, and general-purpose systems-on-chip (SoCs) demonstrates significant benefits. Cell height reduction delivers up to 35% area savings and 10%–15% total power reduction for GPU and GP-SoC designs. For high-performance CPUs, maximum performance improves by 15% at iso-power compared to backside power with buried power rails (BSBPR). Incorporating BS signal routing with cell height reduction also reduces worst case IR drop by 32% relative to BSBPR. These results show that BS clock (BSCLK) and signal routing represent the next phase of technology innovation beyond BS power delivery, enabling continued standard-cell scaling, improved intracell and intercell routability, and generational PPA gains while maintaining similar core transistor geometries in sub-3-nm technologies.