Analysis of asymmetric 3D DRAM architecture in combination with L2 cache size reduction

A. Schönberger, K. Hofmann
{"title":"Analysis of asymmetric 3D DRAM architecture in combination with L2 cache size reduction","authors":"A. Schönberger, K. Hofmann","doi":"10.1109/HPCSim.2015.7237030","DOIUrl":null,"url":null,"abstract":"Memory is a heterogeneous complex in modern systems. Access time and bandwidth improvement of DRAM using die-stacking technology can only be evaluated by interacting with hardware components like underlying cache, CPU and software components like executed application and processed input. In this work we analyze encoding and decoding processes of JPEG2000 algorithm execution on MIPS I core for different picture sizes. Thereby we can observe that for picture sizes below particular critical value the DRAM share of execution time reaches max. 4%. Any DRAM improvement for this case would not lead to significant performance gain of whole system. Starting with particular picture size depending on last-level cache size the acceleration effect of cache falls off and DRAM influence rises up to 25% and remains for larger pictures. System-level estimation shows that our suggested 3D DRAM architecture can reduce that rise down to a third and is partially able to adopt cache functionality.","PeriodicalId":134009,"journal":{"name":"2015 International Conference on High Performance Computing & Simulation (HPCS)","volume":"260 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on High Performance Computing & Simulation (HPCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPCSim.2015.7237030","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Memory is a heterogeneous complex in modern systems. Access time and bandwidth improvement of DRAM using die-stacking technology can only be evaluated by interacting with hardware components like underlying cache, CPU and software components like executed application and processed input. In this work we analyze encoding and decoding processes of JPEG2000 algorithm execution on MIPS I core for different picture sizes. Thereby we can observe that for picture sizes below particular critical value the DRAM share of execution time reaches max. 4%. Any DRAM improvement for this case would not lead to significant performance gain of whole system. Starting with particular picture size depending on last-level cache size the acceleration effect of cache falls off and DRAM influence rises up to 25% and remains for larger pictures. System-level estimation shows that our suggested 3D DRAM architecture can reduce that rise down to a third and is partially able to adopt cache functionality.
非对称3D DRAM架构与L2缓存大小缩减的结合分析
在现代系统中,记忆是一个异构的复合体。使用芯片堆叠技术的DRAM的访问时间和带宽改进只能通过与底层缓存、CPU等硬件组件和执行应用程序和处理输入等软件组件的交互来评估。本文分析了JPEG2000算法在MIPS I核上对不同图像尺寸的编码和解码过程。因此,我们可以观察到,对于低于特定临界值的图片大小,DRAM的执行时间份额达到最大值。4%。在这种情况下,任何DRAM的改进都不会导致整个系统的性能显著提高。根据最后一级缓存大小,从特定的图片大小开始,缓存的加速效果下降,DRAM的影响上升到25%,并且对于更大的图片仍然保持不变。系统级估计表明,我们建议的3D DRAM架构可以将上升幅度降低到三分之一,并且部分能够采用缓存功能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信