Tatum: Parallel Timing Analysis for Faster Design Cycles and Improved Optimization

2018 International Conference on Field-Programmable Technology (FPT) Pub Date : 2018-12-01 DOI:10.1109/FPT.2018.00026

Kevin E. Murray, Vaughn Betz

引用次数: 15

Abstract

Static Timing Analysis (STA) is used to evaluate the correctness and performance of a digital circuit implementation. In addition to final sign-off checks, STA is called numerous times during placement and routing to guide optimization. As a result, STA consumes a significant fraction of the time required for design implementation; to make progress reducing FPGA compile times we need faster STA. We evaluate the suitability of both GPU and multi-core CPU platforms for accelerating STA. On core STA algorithms our GPU kernel achieves a 6.2 times kernel speed-up but data transfer overhead reduces this to 0.9 times. Our best CPU implementation achieves a 9.2 times parallel speed-up on 32 cores, yielding a 15.2 times overall speed-up compared to the VPR analyzer, and a 6.9 times larger parallel speed-up than a recent parallel ASIC timing analyzer. We then show how reducing the run-time cost of STA can be leveraged to improve optimization quality, reducing critical path delay by 4%.

查看原文本刊更多论文

并行时序分析，更快的设计周期和改进的优化

静态时序分析(STA)用于评估数字电路实现的正确性和性能。除了最后的签字检查外，在放置和路由过程中还会多次调用STA来指导优化。因此，STA消耗了设计实现所需时间的很大一部分;为了减少FPGA编译时间，我们需要更快的STA。我们评估了GPU和多核CPU平台对加速STA的适用性。在核心STA算法上，我们的GPU内核实现了6.2倍的内核加速，但数据传输开销将其降低到0.9倍。我们最好的CPU实现在32核上实现了9.2倍的并行加速，与VPR分析仪相比，总体速度提高了15.2倍，比最近的并行ASIC定时分析仪并行速度提高了6.9倍。然后，我们展示了如何利用减少STA的运行时成本来提高优化质量，将关键路径延迟减少4%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 International Conference on Field-Programmable Technology (FPT)

自引率

0.00%

发文量