使用CUDA在GPU上进行高性能计算和模拟

2012 International Conference on High Performance Computing & Simulation (HPCS) Pub Date : 2012-07-02 DOI:10.1109/HPCSim.2012.6266884

M. Ujaldón

{"title":"使用CUDA在GPU上进行高性能计算和模拟","authors":"M. Ujaldón","doi":"10.1109/HPCSim.2012.6266884","DOIUrl":null,"url":null,"abstract":"The computational power and memory bandwidth of graphics processing units (GPUs) have turned them into attractive platforms for general-purpose applications at significant speed gains versus their CPU counterparts [1]. In addition, an increasing number of today's state-of-the-art supercomputers include commodity GPUs to bring us unprecedented levels of performance in terms of raw GFLOPS and GFLOPS/cost. In this paper, we provide an introduction to CUDA programming paradigm with an emphasis on simulations which can exploit SIMD parallelism and high memory bandwidth on GPUs. OpenCL is also briefly described as a recent standardization effort to set up an open standard API for general-purpose manycore architectures.","PeriodicalId":428764,"journal":{"name":"2012 International Conference on High Performance Computing & Simulation (HPCS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"High performance computing and simulations on the GPU using CUDA\",\"authors\":\"M. Ujaldón\",\"doi\":\"10.1109/HPCSim.2012.6266884\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The computational power and memory bandwidth of graphics processing units (GPUs) have turned them into attractive platforms for general-purpose applications at significant speed gains versus their CPU counterparts [1]. In addition, an increasing number of today's state-of-the-art supercomputers include commodity GPUs to bring us unprecedented levels of performance in terms of raw GFLOPS and GFLOPS/cost. In this paper, we provide an introduction to CUDA programming paradigm with an emphasis on simulations which can exploit SIMD parallelism and high memory bandwidth on GPUs. OpenCL is also briefly described as a recent standardization effort to set up an open standard API for general-purpose manycore architectures.\",\"PeriodicalId\":428764,\"journal\":{\"name\":\"2012 International Conference on High Performance Computing & Simulation (HPCS)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 International Conference on High Performance Computing & Simulation (HPCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HPCSim.2012.6266884\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 International Conference on High Performance Computing & Simulation (HPCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPCSim.2012.6266884","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 12

摘要

图形处理单元(gpu)的计算能力和内存带宽使它们成为通用应用程序的有吸引力的平台，与CPU相比，速度有显著提高[1]。此外，越来越多的当今最先进的超级计算机包括商品gpu，为我们带来前所未有的性能水平，在原始GFLOPS和GFLOPS/成本方面。在本文中，我们介绍了CUDA编程范例，重点介绍了可以在gpu上利用SIMD并行性和高内存带宽的仿真。OpenCL还被简要描述为最近的一项标准化工作，旨在为通用多核架构建立开放标准API。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

High performance computing and simulations on the GPU using CUDA

The computational power and memory bandwidth of graphics processing units (GPUs) have turned them into attractive platforms for general-purpose applications at significant speed gains versus their CPU counterparts [1]. In addition, an increasing number of today's state-of-the-art supercomputers include commodity GPUs to bring us unprecedented levels of performance in terms of raw GFLOPS and GFLOPS/cost. In this paper, we provide an introduction to CUDA programming paradigm with an emphasis on simulations which can exploit SIMD parallelism and high memory bandwidth on GPUs. OpenCL is also briefly described as a recent standardization effort to set up an open standard API for general-purpose manycore architectures.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2012 International Conference on High Performance Computing & Simulation (HPCS)

自引率

0.00%

发文量