对全局循环转换的内存层次结构使用估计

Qubo Hu, E. Brockmeyer, M. Palkovic, P. G. Kjeldsberg, F. Catthoor
{"title":"对全局循环转换的内存层次结构使用估计","authors":"Qubo Hu, E. Brockmeyer, M. Palkovic, P. G. Kjeldsberg, F. Catthoor","doi":"10.1109/NORCHP.2004.1423883","DOIUrl":null,"url":null,"abstract":"Major parts of the power dissipation for data dominated embedded system is due to huge amounts of data transfers to and from large consuming data memories. Global loop transformations play a crucial role in optimizing the memory accesses. By improving regularity and temporal locality of these memory accesses using loop transformations, data can be potentially stored closer to the datapath in smaller less power consuming memories. To steer the selection of which loop transformations to perform, high level memory estimation is used. In state of the art memory estimation techniques the mapping of data to different parts of the memory hierarchy are not considered since these decisions are made in later system design phases. However, estimates of these mapping decisions is crucial, since they greatly influence the consequences of different transformations. In this paper we propose a systematic methodology for hierarchical memory usage estimation which considers also future data mapping decisions for a memory hierarchy. The goal of the estimation is to evaluate global loop transformation decisions for different platforms and keep only the optimal decision (in terms of energy) for each platform. We demonstrate our methodology on a real- life multimedia video coder (QSD PCM) which shows that for 1k layer one memory, a factor of 2 improvement in total energy can be achieved.","PeriodicalId":208182,"journal":{"name":"Proceedings Norchip Conference, 2004.","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Memory hierarchy usage estimation for global loop transformations\",\"authors\":\"Qubo Hu, E. Brockmeyer, M. Palkovic, P. G. Kjeldsberg, F. Catthoor\",\"doi\":\"10.1109/NORCHP.2004.1423883\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Major parts of the power dissipation for data dominated embedded system is due to huge amounts of data transfers to and from large consuming data memories. Global loop transformations play a crucial role in optimizing the memory accesses. By improving regularity and temporal locality of these memory accesses using loop transformations, data can be potentially stored closer to the datapath in smaller less power consuming memories. To steer the selection of which loop transformations to perform, high level memory estimation is used. In state of the art memory estimation techniques the mapping of data to different parts of the memory hierarchy are not considered since these decisions are made in later system design phases. However, estimates of these mapping decisions is crucial, since they greatly influence the consequences of different transformations. In this paper we propose a systematic methodology for hierarchical memory usage estimation which considers also future data mapping decisions for a memory hierarchy. The goal of the estimation is to evaluate global loop transformation decisions for different platforms and keep only the optimal decision (in terms of energy) for each platform. We demonstrate our methodology on a real- life multimedia video coder (QSD PCM) which shows that for 1k layer one memory, a factor of 2 improvement in total energy can be achieved.\",\"PeriodicalId\":208182,\"journal\":{\"name\":\"Proceedings Norchip Conference, 2004.\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2004-11-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings Norchip Conference, 2004.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NORCHP.2004.1423883\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Norchip Conference, 2004.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NORCHP.2004.1423883","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

数据主导的嵌入式系统功耗的主要部分是由于大量的数据传输到和从大消耗的数据存储器。全局循环转换在优化内存访问中起着至关重要的作用。通过使用循环转换改进这些内存访问的规律性和时间局部性,数据可以存储在更靠近数据路径的更小的、功耗更低的内存中。为了指导选择要执行的循环转换,需要使用高级内存估计。在最先进的内存估计技术中,不考虑数据到内存层次结构不同部分的映射,因为这些决策是在后来的系统设计阶段做出的。然而,对这些映射决策的估计是至关重要的,因为它们极大地影响了不同转换的结果。在本文中,我们提出了一种系统的分层内存使用估计方法,该方法还考虑了未来内存层次结构的数据映射决策。评估的目标是评估不同平台的全局循环转换决策,并仅保留每个平台的最优决策(就能量而言)。我们在现实生活中的多媒体视频编码器(QSD PCM)上演示了我们的方法,结果表明,对于1k的第一层存储器,总能量可以提高2倍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Memory hierarchy usage estimation for global loop transformations
Major parts of the power dissipation for data dominated embedded system is due to huge amounts of data transfers to and from large consuming data memories. Global loop transformations play a crucial role in optimizing the memory accesses. By improving regularity and temporal locality of these memory accesses using loop transformations, data can be potentially stored closer to the datapath in smaller less power consuming memories. To steer the selection of which loop transformations to perform, high level memory estimation is used. In state of the art memory estimation techniques the mapping of data to different parts of the memory hierarchy are not considered since these decisions are made in later system design phases. However, estimates of these mapping decisions is crucial, since they greatly influence the consequences of different transformations. In this paper we propose a systematic methodology for hierarchical memory usage estimation which considers also future data mapping decisions for a memory hierarchy. The goal of the estimation is to evaluate global loop transformation decisions for different platforms and keep only the optimal decision (in terms of energy) for each platform. We demonstrate our methodology on a real- life multimedia video coder (QSD PCM) which shows that for 1k layer one memory, a factor of 2 improvement in total energy can be achieved.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信