自适应缓存升温更快的模拟

Gustaf Borgström, Andreas Sembrant, D. Black-Schaffer
{"title":"自适应缓存升温更快的模拟","authors":"Gustaf Borgström, Andreas Sembrant, D. Black-Schaffer","doi":"10.1145/3023973.3023974","DOIUrl":null,"url":null,"abstract":"The use of hardware-based virtualization allows modern simulators to very quickly fast-forward between sample points and regions of interest. This dramatically reduces the simulation time compared to traditional functional forwarding. However, as the fast-forwarding takes place through virtualized execution on the native hardware, it is unable to warm simulated structures, such as caches. As a result, sampled simulations taking advantage of virtualization for fast-forwarding find their execution time dominated by functional warming. To address the cost of warming, we present Adaptive Cache Warming (ACW), a new fast method that determines how much warming each sample/phase/application needs. ACW takes advantage of the virtualization-based fast-forwarding to search for the minimum warming time required during simulation. To determine when the cache is sufficiently warm, ACW uses heuristics based on the last-level cache's cold-set misses. Our results show that typical practice of conservatively warming last-level caches for around 100M instructions is a vast overkill for nearly all checkpoints. By using ACW, we can adapt the warming per-sample and speedup the simulation by 92--103× on average (512× speedup maximum) depending on cache size (2-32MB).","PeriodicalId":131314,"journal":{"name":"Proceedings of the 9th Workshop on Rapid Simulation and Performance Evaluation: Methods and Tools","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Adaptive Cache Warming for Faster Simulations\",\"authors\":\"Gustaf Borgström, Andreas Sembrant, D. Black-Schaffer\",\"doi\":\"10.1145/3023973.3023974\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The use of hardware-based virtualization allows modern simulators to very quickly fast-forward between sample points and regions of interest. This dramatically reduces the simulation time compared to traditional functional forwarding. However, as the fast-forwarding takes place through virtualized execution on the native hardware, it is unable to warm simulated structures, such as caches. As a result, sampled simulations taking advantage of virtualization for fast-forwarding find their execution time dominated by functional warming. To address the cost of warming, we present Adaptive Cache Warming (ACW), a new fast method that determines how much warming each sample/phase/application needs. ACW takes advantage of the virtualization-based fast-forwarding to search for the minimum warming time required during simulation. To determine when the cache is sufficiently warm, ACW uses heuristics based on the last-level cache's cold-set misses. Our results show that typical practice of conservatively warming last-level caches for around 100M instructions is a vast overkill for nearly all checkpoints. By using ACW, we can adapt the warming per-sample and speedup the simulation by 92--103× on average (512× speedup maximum) depending on cache size (2-32MB).\",\"PeriodicalId\":131314,\"journal\":{\"name\":\"Proceedings of the 9th Workshop on Rapid Simulation and Performance Evaluation: Methods and Tools\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-01-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 9th Workshop on Rapid Simulation and Performance Evaluation: Methods and Tools\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3023973.3023974\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 9th Workshop on Rapid Simulation and Performance Evaluation: Methods and Tools","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3023973.3023974","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

使用基于硬件的虚拟化允许现代模拟器在采样点和感兴趣的区域之间非常快速地快速前进。与传统的功能转发相比,这大大减少了模拟时间。但是,由于快速转发是通过本地硬件上的虚拟化执行进行的,因此无法加热模拟结构,例如缓存。因此,利用虚拟化进行快速转发的采样模拟发现,它们的执行时间主要受功能升温的影响。为了解决升温成本问题,我们提出了自适应缓存升温(ACW),这是一种新的快速方法,可以确定每个样本/阶段/应用需要多少升温。ACW利用基于虚拟化的快速转发来搜索仿真期间所需的最小升温时间。为了确定何时缓存足够热,ACW使用基于最后一级缓存冷集未命中的启发式方法。我们的结果表明,对于几乎所有检查点来说,保守地为大约100M指令预热最后一级缓存的典型做法都是过度的。通过使用ACW,我们可以调整每个样本的升温,并根据缓存大小(2-32MB)平均将模拟加速92- 103倍(最大加速512倍)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Adaptive Cache Warming for Faster Simulations
The use of hardware-based virtualization allows modern simulators to very quickly fast-forward between sample points and regions of interest. This dramatically reduces the simulation time compared to traditional functional forwarding. However, as the fast-forwarding takes place through virtualized execution on the native hardware, it is unable to warm simulated structures, such as caches. As a result, sampled simulations taking advantage of virtualization for fast-forwarding find their execution time dominated by functional warming. To address the cost of warming, we present Adaptive Cache Warming (ACW), a new fast method that determines how much warming each sample/phase/application needs. ACW takes advantage of the virtualization-based fast-forwarding to search for the minimum warming time required during simulation. To determine when the cache is sufficiently warm, ACW uses heuristics based on the last-level cache's cold-set misses. Our results show that typical practice of conservatively warming last-level caches for around 100M instructions is a vast overkill for nearly all checkpoints. By using ACW, we can adapt the warming per-sample and speedup the simulation by 92--103× on average (512× speedup maximum) depending on cache size (2-32MB).
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信