自动生成和调整GPU代码稀疏矩阵向量乘法从一个高级表示

GPGPU-4 Pub Date : 2011-03-05 DOI:10.1145/1964179.1964196

Dominik Grewe, Anton Lokhmotov

{"title":"自动生成和调整GPU代码稀疏矩阵向量乘法从一个高级表示","authors":"Dominik Grewe, Anton Lokhmotov","doi":"10.1145/1964179.1964196","DOIUrl":null,"url":null,"abstract":"We propose a system-independent representation of sparse matrix formats that allows a compiler to generate efficient, system-specific code for sparse matrix operations. To show the viability of such a representation we have developed a compiler that generates and tunes code for sparse matrix-vector multiplication (SpMV) on GPUs. We evaluate our framework on six state-of-the-art matrix formats and show that the generated code performs similar to or better than hand-optimized code.","PeriodicalId":317571,"journal":{"name":"GPGPU-4","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"57","resultStr":"{\"title\":\"Automatically generating and tuning GPU code for sparse matrix-vector multiplication from a high-level representation\",\"authors\":\"Dominik Grewe, Anton Lokhmotov\",\"doi\":\"10.1145/1964179.1964196\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We propose a system-independent representation of sparse matrix formats that allows a compiler to generate efficient, system-specific code for sparse matrix operations. To show the viability of such a representation we have developed a compiler that generates and tunes code for sparse matrix-vector multiplication (SpMV) on GPUs. We evaluate our framework on six state-of-the-art matrix formats and show that the generated code performs similar to or better than hand-optimized code.\",\"PeriodicalId\":317571,\"journal\":{\"name\":\"GPGPU-4\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-03-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"57\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"GPGPU-4\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/1964179.1964196\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"GPGPU-4","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1964179.1964196","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 57

摘要

我们提出了一种系统无关的稀疏矩阵格式表示，它允许编译器为稀疏矩阵操作生成高效的、系统特定的代码。为了证明这种表示的可行性，我们开发了一个编译器，用于在gpu上生成和调整稀疏矩阵向量乘法(SpMV)的代码。我们在六种最先进的矩阵格式上评估了我们的框架，并表明生成的代码的性能与手工优化的代码相似或更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Automatically generating and tuning GPU code for sparse matrix-vector multiplication from a high-level representation

We propose a system-independent representation of sparse matrix formats that allows a compiler to generate efficient, system-specific code for sparse matrix operations. To show the viability of such a representation we have developed a compiler that generates and tunes code for sparse matrix-vector multiplication (SpMV) on GPUs. We evaluate our framework on six state-of-the-art matrix formats and show that the generated code performs similar to or better than hand-optimized code.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

GPGPU-4

自引率

0.00%

发文量