FastFwd: An efficient hardware acceleration technique for trace-driven network-on-chip simulation

2010 IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS) Pub Date : 2010-10-24 DOI:10.1145/1878961.1879006

G. Krishnaiah, B. Silpa, P. Panda, Anshul Kumar

{"title":"FastFwd: An efficient hardware acceleration technique for trace-driven network-on-chip simulation","authors":"G. Krishnaiah, B. Silpa, P. Panda, Anshul Kumar","doi":"10.1145/1878961.1879006","DOIUrl":null,"url":null,"abstract":"We present an efficient emulation-based technique to accelerate architecture exploration of networks-on-chip (NoCs). The large design space of NoC along with its growing complexity that results in low simulation speeds on host machines have motivated the need for hardware accelerators for speeding up the simulation. For example, simulation of applications with real life problem sizes could take weeks on a host machine. FPGA acceleration is a promising strategy for speeding up NoC simulations by several orders of magnitude. However, it is required to simulate a few billion network transactions of the application during NoC exploration, and this could still take tens of minutes even with an FPGA-based emulator. With the increasing complexity of architectures and applications, reducing emulation time is a key concern. We propose a technique, FastFwd, to minimize emulation time by efficiently identifying and eliminating redundant cycles during a trace-based NoC simulation. We have studied the implications of the additional FPGA hardware required for implementing our technique. A naïve implementation could lead to poor scalability and increase the required DRAM bandwidth, both of which ultimately impact the emulation speed negatively. We propose a hierarchical controller architecture to resolve the scalability issue, and a compressed representation of traces for mitigating the increased DRAM bandwidth requirement. Our experiments with several benchmarks have shown that the FPGA emulation with our technique reduces the average emulation time by a factor of 2 when compared to a conventional emulation.","PeriodicalId":118816,"journal":{"name":"2010 IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1878961.1879006","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

We present an efficient emulation-based technique to accelerate architecture exploration of networks-on-chip (NoCs). The large design space of NoC along with its growing complexity that results in low simulation speeds on host machines have motivated the need for hardware accelerators for speeding up the simulation. For example, simulation of applications with real life problem sizes could take weeks on a host machine. FPGA acceleration is a promising strategy for speeding up NoC simulations by several orders of magnitude. However, it is required to simulate a few billion network transactions of the application during NoC exploration, and this could still take tens of minutes even with an FPGA-based emulator. With the increasing complexity of architectures and applications, reducing emulation time is a key concern. We propose a technique, FastFwd, to minimize emulation time by efficiently identifying and eliminating redundant cycles during a trace-based NoC simulation. We have studied the implications of the additional FPGA hardware required for implementing our technique. A naïve implementation could lead to poor scalability and increase the required DRAM bandwidth, both of which ultimately impact the emulation speed negatively. We propose a hierarchical controller architecture to resolve the scalability issue, and a compressed representation of traces for mitigating the increased DRAM bandwidth requirement. Our experiments with several benchmarks have shown that the FPGA emulation with our technique reduces the average emulation time by a factor of 2 when compared to a conventional emulation.

查看原文本刊更多论文

FastFwd:一种有效的硬件加速技术，用于跟踪驱动的片上网络仿真

我们提出了一种有效的基于仿真的技术来加速片上网络(noc)的架构探索。NoC的巨大设计空间及其日益增长的复杂性导致主机上的模拟速度较低，这促使人们需要硬件加速器来加速模拟。例如，在主机上模拟具有实际问题大小的应用程序可能需要数周时间。FPGA加速是一种很有前途的策略，可以将NoC模拟的速度提高几个数量级。然而，在NoC勘探期间，需要模拟应用程序的数十亿个网络事务，即使使用基于fpga的模拟器，这仍然需要数十分钟。随着体系结构和应用程序的日益复杂，减少仿真时间是一个关键问题。我们提出了一种技术，FastFwd，通过在基于跟踪的NoC仿真中有效识别和消除冗余周期来最小化仿真时间。我们已经研究了实现我们的技术所需的额外FPGA硬件的含义。naïve实现可能导致较差的可伸缩性并增加所需的DRAM带宽，这两者最终都会对仿真速度产生负面影响。我们提出了一种分层控制器架构来解决可扩展性问题，并提出了一种压缩的跟踪表示来减轻DRAM带宽需求的增加。我们对几个基准测试的实验表明，与传统仿真相比，使用我们的技术的FPGA仿真将平均仿真时间减少了2倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2010 IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)

自引率

0.00%

发文量