Comparative benchmarking: matrix multiplication on a multicore coprocessor and a GPU

2015 Computational Electromagnetics International Workshop (CEM) Pub Date : 2015-07-01 DOI:10.1109/CEM.2015.7237429

Md. Salim, Ali O. Akkirman, Mert Hidayetoglu, L. Gurel

引用次数: 11

Abstract

This paper reports the performances of an Intel Xeon Phi coprocessor and an Nvidia Tesla GPU for multiplication of large matrices. For this purpose, various libraries, such as Intel MKL and MAGMA, are employed with different execution modes of the coprocessor. We compare the performances of the coprocessor and the GPU in terms of running time, memory requirement, and programming difficulty for the special case of matrix-matrix multiplication.

查看原文本刊更多论文

比较基准测试:多核协处理器和GPU上的矩阵乘法

本文报道了Intel Xeon Phi协处理器和Nvidia Tesla GPU处理大矩阵乘法的性能。为此，各种库(如Intel MKL和MAGMA)被用于协处理器的不同执行模式。我们比较了协处理器和GPU在运行时间、内存需求和矩阵-矩阵乘法特殊情况下的编程难度方面的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 Computational Electromagnetics International Workshop (CEM)

自引率

0.00%

发文量