Gaurav Narang, Chukwufumnanya Ogbogu, Jana Doppa, P. Pande
{"title":"TEFLON:用于在多核 PIM 架构上加速 CNN 推断的热效数据流感知 3D NoC","authors":"Gaurav Narang, Chukwufumnanya Ogbogu, Jana Doppa, P. Pande","doi":"10.1145/3665279","DOIUrl":null,"url":null,"abstract":"\n Resistive random-access memory (ReRAM) based processing-in-memory (PIM) architectures are used extensively to accelerate inferencing/training with convolutional neural networks (CNNs). Three-dimensional (3D) integration is an enabling technology to integrate many PIM cores on a single chip. In this work, we propose the design of a thermally efficient dataflow-aware monolithic 3D (M3D) NoC architecture referred to as\n \n TEFLON\n \n to accelerate CNN inferencing without creating any thermal bottlenecks.\n \n TEFLON\n \n reduces the Energy-Delay-Product (EDP) by 4\n \n 2\\%\n \n ,\n \n 46\\%\n \n , and 45\n \n \\%\n \n on an average compared to a conventional 3D mesh NoC for systems with 36-, 64-, and 100-PIM cores respectively.\n \n TEFLON\n \n reduces the peak chip temperature by 25\n \n K\n \n and improves the inference accuracy by up to 11\n \n \\%\n \n compared to sole performance-optimized SFC-based counterpart for inferencing with diverse deep CNN models using CIFAR-10/100 datasets on a 3D system with 100-PIM cores.\n","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":"32 14","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2024-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"TEFLON: Thermally Efficient Dataflow-Aware 3D NoC for Accelerating CNN Inferencing on Manycore PIM Architectures\",\"authors\":\"Gaurav Narang, Chukwufumnanya Ogbogu, Jana Doppa, P. Pande\",\"doi\":\"10.1145/3665279\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n Resistive random-access memory (ReRAM) based processing-in-memory (PIM) architectures are used extensively to accelerate inferencing/training with convolutional neural networks (CNNs). Three-dimensional (3D) integration is an enabling technology to integrate many PIM cores on a single chip. In this work, we propose the design of a thermally efficient dataflow-aware monolithic 3D (M3D) NoC architecture referred to as\\n \\n TEFLON\\n \\n to accelerate CNN inferencing without creating any thermal bottlenecks.\\n \\n TEFLON\\n \\n reduces the Energy-Delay-Product (EDP) by 4\\n \\n 2\\\\%\\n \\n ,\\n \\n 46\\\\%\\n \\n , and 45\\n \\n \\\\%\\n \\n on an average compared to a conventional 3D mesh NoC for systems with 36-, 64-, and 100-PIM cores respectively.\\n \\n TEFLON\\n \\n reduces the peak chip temperature by 25\\n \\n K\\n \\n and improves the inference accuracy by up to 11\\n \\n \\\\%\\n \\n compared to sole performance-optimized SFC-based counterpart for inferencing with diverse deep CNN models using CIFAR-10/100 datasets on a 3D system with 100-PIM cores.\\n\",\"PeriodicalId\":2,\"journal\":{\"name\":\"ACS Applied Bio Materials\",\"volume\":\"32 14\",\"pages\":\"\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-05-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS Applied Bio Materials\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1145/3665279\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATERIALS SCIENCE, BIOMATERIALS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1145/3665279","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
TEFLON: Thermally Efficient Dataflow-Aware 3D NoC for Accelerating CNN Inferencing on Manycore PIM Architectures
Resistive random-access memory (ReRAM) based processing-in-memory (PIM) architectures are used extensively to accelerate inferencing/training with convolutional neural networks (CNNs). Three-dimensional (3D) integration is an enabling technology to integrate many PIM cores on a single chip. In this work, we propose the design of a thermally efficient dataflow-aware monolithic 3D (M3D) NoC architecture referred to as
TEFLON
to accelerate CNN inferencing without creating any thermal bottlenecks.
TEFLON
reduces the Energy-Delay-Product (EDP) by 4
2\%
,
46\%
, and 45
\%
on an average compared to a conventional 3D mesh NoC for systems with 36-, 64-, and 100-PIM cores respectively.
TEFLON
reduces the peak chip temperature by 25
K
and improves the inference accuracy by up to 11
\%
compared to sole performance-optimized SFC-based counterpart for inferencing with diverse deep CNN models using CIFAR-10/100 datasets on a 3D system with 100-PIM cores.
期刊介绍:
ACS Applied Bio Materials is an interdisciplinary journal publishing original research covering all aspects of biomaterials and biointerfaces including and beyond the traditional biosensing, biomedical and therapeutic applications.
The journal is devoted to reports of new and original experimental and theoretical research of an applied nature that integrates knowledge in the areas of materials, engineering, physics, bioscience, and chemistry into important bio applications. The journal is specifically interested in work that addresses the relationship between structure and function and assesses the stability and degradation of materials under relevant environmental and biological conditions.