Racing to Hardware-Validated Simulation

2019 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) Pub Date : 2019-03-24 DOI:10.1109/ISPASS.2019.00014

Almutaz Adileh, Cecilia González-Alvarez, J. Ruiz, L. Eeckhout

{"title":"Racing to Hardware-Validated Simulation","authors":"Almutaz Adileh, Cecilia González-Alvarez, J. Ruiz, L. Eeckhout","doi":"10.1109/ISPASS.2019.00014","DOIUrl":null,"url":null,"abstract":"Processor simulators rely on detailed timing models of the processor pipeline to evaluate performance. The diversity in real-world processor designs mandates building flexible simulators that expose parts of the underlying model to the user in the form of configurable parameters. Consequently, the accuracy of modeling a real processor relies on both the accuracy of the pipeline model itself, and the accuracy of adjusting the configuration parameters according to the modeled processor. Unfortunately, processor vendors publicly disclose only a subset of their design decisions, raising the probability of introducing specification inaccuracies when modeling these processors. Inaccurately tuning model parameters deviates the simulated processor from the actual one. In the worst case, using improper parameters may lead to imbalanced pipeline models compromising the simulation output. Therefore, simulation models should be hardware-validated before using them for performance evaluation. As processors increase in complexity and diversity, validating a simulator model against real hardware becomes increasingly more challenging and time-consuming. In this work, we propose a methodology for validating simulation models against real hardware. We create a framework that relies on micro-benchmarks to collect performance statistics on real hardware, and machine learning-based algorithms to fine-tune the unknown parameters based on the accumulated statistics. We overhaul the Sniper simulator to support the ARM AArch64 instruction-set architecture (ISA), and introduce two new timing models for ARM-based in-order and out-of-order cores. Using our proposed simulator validation framework, we tune the in-order and out-of-order models to match the performance of a real-world implementation of the Cortex-A53 and Cortex-A72 cores with an average error of 7% and 15%, respectively, across a set of SPEC CPU2017 benchmarks.","PeriodicalId":137786,"journal":{"name":"2019 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPASS.2019.00014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Processor simulators rely on detailed timing models of the processor pipeline to evaluate performance. The diversity in real-world processor designs mandates building flexible simulators that expose parts of the underlying model to the user in the form of configurable parameters. Consequently, the accuracy of modeling a real processor relies on both the accuracy of the pipeline model itself, and the accuracy of adjusting the configuration parameters according to the modeled processor. Unfortunately, processor vendors publicly disclose only a subset of their design decisions, raising the probability of introducing specification inaccuracies when modeling these processors. Inaccurately tuning model parameters deviates the simulated processor from the actual one. In the worst case, using improper parameters may lead to imbalanced pipeline models compromising the simulation output. Therefore, simulation models should be hardware-validated before using them for performance evaluation. As processors increase in complexity and diversity, validating a simulator model against real hardware becomes increasingly more challenging and time-consuming. In this work, we propose a methodology for validating simulation models against real hardware. We create a framework that relies on micro-benchmarks to collect performance statistics on real hardware, and machine learning-based algorithms to fine-tune the unknown parameters based on the accumulated statistics. We overhaul the Sniper simulator to support the ARM AArch64 instruction-set architecture (ISA), and introduce two new timing models for ARM-based in-order and out-of-order cores. Using our proposed simulator validation framework, we tune the in-order and out-of-order models to match the performance of a real-world implementation of the Cortex-A53 and Cortex-A72 cores with an average error of 7% and 15%, respectively, across a set of SPEC CPU2017 benchmarks.

查看原文本刊更多论文

竞相进行硬件验证仿真

处理器模拟器依赖于处理器流水线的详细时序模型来评估性能。现实世界处理器设计的多样性要求构建灵活的模拟器，以可配置参数的形式将底层模型的部分公开给用户。因此，真实处理器建模的准确性既依赖于管道模型本身的准确性，也依赖于根据建模的处理器调整配置参数的准确性。不幸的是，处理器供应商只公开了其设计决策的一个子集，这增加了在对这些处理器建模时引入规范不准确的可能性。模型参数调优不准确会使仿真处理器偏离实际处理器。在最坏的情况下，使用不适当的参数可能导致管道模型不平衡，从而影响仿真输出。因此，在使用仿真模型进行性能评估之前，应该对其进行硬件验证。随着处理器的复杂性和多样性的增加，针对真实硬件验证模拟器模型变得越来越具有挑战性和耗时。在这项工作中，我们提出了一种针对真实硬件验证仿真模型的方法。我们创建了一个框架，该框架依赖于微基准来收集真实硬件上的性能统计数据，以及基于机器学习的算法来根据累积的统计数据微调未知参数。我们改进了Sniper模拟器以支持ARM AArch64指令集架构(ISA)，并为基于ARM的有序核和无序核引入了两种新的时序模型。使用我们提出的模拟器验证框架，我们调整了有序和无序模型，以匹配实际实现的Cortex-A53和Cortex-A72内核的性能，在一组SPEC CPU2017基准测试中，平均误差分别为7%和15%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)

自引率

0.00%

发文量