Enhancing the Performance Portability of Heterogeneous Circuit Analysis Programs

2022 IEEE High Performance Extreme Computing Conference (HPEC) Pub Date : 2022-09-19 DOI:10.1109/HPEC55821.2022.9926380

Tsung-Wei Huang

引用次数: 0

Abstract

Recently, CPU-GPU heterogeneous parallelism has brought transformational performance milestones to static timing analysis (STA) algorithms. As the computing ecosystem continues to proliferate, performance portability has emerged as a new challenge when deploying the result to diverse heterogeneous computing platforms. Specifically, the optimal code written on a CPU-GPU architecture may not be optimal for other CPU-GPU architectures, due to various performance, interoperability, and availability constraints. As a result, we introduce in this paper a learning-based framework to enhance the performance portability of a GPU-accelerated STA program. We parameterize important performance parameters and leverage a neural network model to adapt performance optimization to any given computing platforms. We have demonstrated the effectiveness of our framework in real STA applications.

查看原文本刊更多论文

增强异构电路分析程序的性能可移植性

近年来，CPU-GPU异构并行性为静态时序分析(STA)算法带来了革命性的性能里程碑。随着计算生态系统的不断扩展，在将结果部署到各种异构计算平台时，性能可移植性已经成为一个新的挑战。具体来说，由于各种性能、互操作性和可用性限制，在CPU-GPU架构上编写的最佳代码对于其他CPU-GPU架构可能不是最佳的。因此，我们在本文中引入了一个基于学习的框架来增强gpu加速STA程序的性能可移植性。我们将重要的性能参数参数化，并利用神经网络模型来适应任何给定计算平台的性能优化。我们已经在实际的STA应用程序中展示了我们的框架的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE High Performance Extreme Computing Conference (HPEC)

自引率

0.00%

发文量