Improving accuracy of source level timing simulation for GPUs using a probabilistic resource model

2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS) Pub Date : 2015-07-19 DOI:10.1109/SAMOS.2015.7363655

Christoph Gerum, W. Rosenstiel, O. Bringmann

引用次数: 1

Abstract

After their success in the high performance and desktop market, Graphic Processing Units (GPUs), that can be used for general purpose computing are introduced for embedded systems on a chip (SOCs). Due to some advanced architectural features, like massive simultaneous multithreading, static performance analysis and high-level timing simulation are difficult to apply to code running on these systems. This paper extends a method for performance simulation of GPUs. The method uses automated performance annotations in the application's OpenCL C source code, and an extended performance model for derivation of a kernels runtime from metrics produced by the execution of annotated kernels. The final results are then generated using a probabilistic resource conflict model. The model reaches an accuracy of 90% on most test cases and delivers a higher average accuracy than previous methods.

查看原文本刊更多论文

利用概率资源模型提高gpu源级时序仿真的精度

在高性能和台式机市场取得成功后，用于通用计算的图形处理单元(gpu)被引入到芯片上的嵌入式系统(soc)中。由于一些高级的体系结构特性，如大规模同步多线程、静态性能分析和高级时序模拟，很难应用于在这些系统上运行的代码。本文扩展了一种gpu性能仿真方法。该方法在应用程序的OpenCL C源代码中使用自动性能注释，并使用扩展的性能模型，从执行注释的内核产生的度量中派生内核运行时。然后使用概率资源冲突模型生成最终结果。该模型在大多数测试用例上达到了90%的准确率，并且比以前的方法提供了更高的平均准确率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS)

自引率

0.00%

发文量