gpu上稀疏矩阵向量乘法的CUDA参数自动调优

2010 International Conference on Computational and Information Sciences Pub Date : 2010-12-17 DOI:10.1109/ICCIS.2010.285

Ping Guo, Liqiang Wang

{"title":"gpu上稀疏矩阵向量乘法的CUDA参数自动调优","authors":"Ping Guo, Liqiang Wang","doi":"10.1109/ICCIS.2010.285","DOIUrl":null,"url":null,"abstract":"Graphics Processing Unit (GPU) has become an attractive coprocessor for scientific computing due to its massive processing capability. The sparse matrix-vector multiplication (SpMV) is a critical operation in a wide variety of scientific and engineering applications, such as sparse linear algebra and image processing. This paper presents an auto-tuning framework that can automatically compute and select CUDA parameters for SpMV to obtain the optimal performance on specific GPUs. The framework is evaluated on two NVIDIA GPU platforms, GeForce 9500 GTX and GeForce GTX 295.","PeriodicalId":227848,"journal":{"name":"2010 International Conference on Computational and Information Sciences","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"52","resultStr":"{\"title\":\"Auto-Tuning CUDA Parameters for Sparse Matrix-Vector Multiplication on GPUs\",\"authors\":\"Ping Guo, Liqiang Wang\",\"doi\":\"10.1109/ICCIS.2010.285\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Graphics Processing Unit (GPU) has become an attractive coprocessor for scientific computing due to its massive processing capability. The sparse matrix-vector multiplication (SpMV) is a critical operation in a wide variety of scientific and engineering applications, such as sparse linear algebra and image processing. This paper presents an auto-tuning framework that can automatically compute and select CUDA parameters for SpMV to obtain the optimal performance on specific GPUs. The framework is evaluated on two NVIDIA GPU platforms, GeForce 9500 GTX and GeForce GTX 295.\",\"PeriodicalId\":227848,\"journal\":{\"name\":\"2010 International Conference on Computational and Information Sciences\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"52\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 International Conference on Computational and Information Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCIS.2010.285\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 International Conference on Computational and Information Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIS.2010.285","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 52

摘要

图形处理器(GPU)由于其庞大的处理能力，已成为科学计算领域中极具吸引力的协处理器。稀疏矩阵-向量乘法(SpMV)是一项在各种科学和工程应用中非常重要的运算，如稀疏线性代数和图像处理。本文提出了一个自动调优框架，可以自动计算和选择SpMV的CUDA参数，以在特定gpu上获得最佳性能。该框架在两个NVIDIA GPU平台GeForce 9500 GTX和GeForce GTX 295上进行了评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Auto-Tuning CUDA Parameters for Sparse Matrix-Vector Multiplication on GPUs

Graphics Processing Unit (GPU) has become an attractive coprocessor for scientific computing due to its massive processing capability. The sparse matrix-vector multiplication (SpMV) is a critical operation in a wide variety of scientific and engineering applications, such as sparse linear algebra and image processing. This paper presents an auto-tuning framework that can automatically compute and select CUDA parameters for SpMV to obtain the optimal performance on specific GPUs. The framework is evaluated on two NVIDIA GPU platforms, GeForce 9500 GTX and GeForce GTX 295.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2010 International Conference on Computational and Information Sciences

自引率

0.00%

发文量