Aqeeb Iqbal Arka, Biresh Kumar Joardar, R. Kim, D. Kim, J. Doppa, P. Pande
{"title":"HeM3D","authors":"Aqeeb Iqbal Arka, Biresh Kumar Joardar, R. Kim, D. Kim, J. Doppa, P. Pande","doi":"10.1145/3424239","DOIUrl":null,"url":null,"abstract":"Heterogeneous manycore architectures are the key to efficiently execute compute- and data-intensive applications. Through-silicon-via (TSV)-based 3D manycore system is a promising solution in this direction as it enables the integration of disparate computing cores on a single system. Recent industry trends show the viability of 3D integration in real products (e.g., Intel Lakefield SoC Architecture, the AMD Radeon R9 Fury X graphics card, and Xilinx Virtex-7 2000T/H580T, etc.). However, the achievable performance of conventional TSV-based 3D systems is ultimately bottlenecked by the horizontal wires (wires in each planar die). Moreover, current TSV 3D architectures suffer from thermal limitations. Hence, TSV-based architectures do not realize the full potential of 3D integration. Monolithic 3D (M3D) integration, a breakthrough technology to achieve “More Moore and More Than Moore,” opens up the possibility of designing cores and associated network routers using multiple layers by utilizing monolithic inter-tier vias (MIVs) and hence, reducing the effective wire length. Compared to TSV-based 3D integrated circuits (ICs), M3D offers the “true” benefits of vertical dimension for system integration: the size of an MIV used in M3D is over 100 × smaller than a TSV. This dramatic reduction in via size and the resulting increase in density opens up numerous opportunities for design optimizations in 3D manycore systems: designers can use up to millions of MIVs for ultra-fine-grained 3D optimization, where individual cores and routers can be spread across multiple tiers for extreme power and performance optimization. In this work, we demonstrate how M3D-enabled vertical core and uncore elements offer significant performance and thermal improvements in manycore heterogeneous architectures compared to its TSV-based counterpart. To overcome the difficult optimization challenges due to the large design space and complex interactions among the heterogeneous components (CPU, GPU, Last Level Cache, etc.) in a M3D-based manycore chip, we leverage novel design-space exploration algorithms to trade off different objectives. The proposed M3D-enabled heterogeneous architecture, called HeM3D, outperforms its state-of-the-art TSV-equivalent counterpart by up to 18.3% in execution time while being up to 19°C cooler.","PeriodicalId":6933,"journal":{"name":"ACM Transactions on Design Automation of Electronic Systems (TODAES)","volume":"120 1","pages":"1 - 21"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Design Automation of Electronic Systems (TODAES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3424239","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
摘要
异构多核体系结构是高效执行计算和数据密集型应用程序的关键。基于硅通孔(TSV)的3D多核系统在这个方向上是一个很有前途的解决方案,因为它可以在单个系统上集成不同的计算核心。最近的行业趋势显示了3D集成在实际产品中的可行性(例如,英特尔lakfield SoC架构,AMD Radeon R9 Fury X显卡和Xilinx Virtex-7 2000T/H580T等)。然而,传统的基于tsv的三维系统的可实现性能最终受到水平线(每个平面模具中的线)的瓶颈。此外,当前的TSV 3D架构受到热限制。因此,基于tsv的架构并没有实现3D集成的全部潜力。单片3D (M3D)集成技术是实现“More Moore and More Than Moore”的突破性技术,它利用单片层间通孔(miv),开辟了使用多层设计核心和相关网络路由器的可能性,从而减少了有效导线长度。与基于TSV的3D集成电路(ic)相比,M3D为系统集成提供了垂直尺寸的“真正”优势:M3D中使用的MIV尺寸比TSV小100倍以上。通过尺寸的大幅减小以及由此带来的密度的增加,为3D多核系统的设计优化开辟了许多机会:设计人员可以使用多达数百万个miv进行超细粒度的3D优化,其中单个核心和路由器可以分布在多个层上,以实现极致的功率和性能优化。在这项工作中,我们展示了与基于tsv的组件相比,支持m3d的垂直核心和非核心组件如何在多核异构架构中提供显着的性能和散热改进。为了克服由于基于m3d的多核芯片中大型设计空间和异构组件(CPU, GPU, Last Level Cache等)之间复杂的交互而导致的困难优化挑战,我们利用新颖的设计空间探索算法来权衡不同的目标。提出的支持m3d的异构架构,称为HeM3D,在执行时间上比最先进的tsv等效产品高出18.3%,同时温度可降低19°C。
Heterogeneous manycore architectures are the key to efficiently execute compute- and data-intensive applications. Through-silicon-via (TSV)-based 3D manycore system is a promising solution in this direction as it enables the integration of disparate computing cores on a single system. Recent industry trends show the viability of 3D integration in real products (e.g., Intel Lakefield SoC Architecture, the AMD Radeon R9 Fury X graphics card, and Xilinx Virtex-7 2000T/H580T, etc.). However, the achievable performance of conventional TSV-based 3D systems is ultimately bottlenecked by the horizontal wires (wires in each planar die). Moreover, current TSV 3D architectures suffer from thermal limitations. Hence, TSV-based architectures do not realize the full potential of 3D integration. Monolithic 3D (M3D) integration, a breakthrough technology to achieve “More Moore and More Than Moore,” opens up the possibility of designing cores and associated network routers using multiple layers by utilizing monolithic inter-tier vias (MIVs) and hence, reducing the effective wire length. Compared to TSV-based 3D integrated circuits (ICs), M3D offers the “true” benefits of vertical dimension for system integration: the size of an MIV used in M3D is over 100 × smaller than a TSV. This dramatic reduction in via size and the resulting increase in density opens up numerous opportunities for design optimizations in 3D manycore systems: designers can use up to millions of MIVs for ultra-fine-grained 3D optimization, where individual cores and routers can be spread across multiple tiers for extreme power and performance optimization. In this work, we demonstrate how M3D-enabled vertical core and uncore elements offer significant performance and thermal improvements in manycore heterogeneous architectures compared to its TSV-based counterpart. To overcome the difficult optimization challenges due to the large design space and complex interactions among the heterogeneous components (CPU, GPU, Last Level Cache, etc.) in a M3D-based manycore chip, we leverage novel design-space exploration algorithms to trade off different objectives. The proposed M3D-enabled heterogeneous architecture, called HeM3D, outperforms its state-of-the-art TSV-equivalent counterpart by up to 18.3% in execution time while being up to 19°C cooler.