在GPU上求解三对角线系统

20th Annual International Conference on High Performance Computing Pub Date : 2013-12-01 DOI:10.1109/HiPC.2013.6799117

B. J. Murphy

{"title":"在GPU上求解三对角线系统","authors":"B. J. Murphy","doi":"10.1109/HiPC.2013.6799117","DOIUrl":null,"url":null,"abstract":"We implement a parallel tridiagonal solver based on cyclic reduction (CR) for a graphics processing unit (GPU). The bane of such solvers is a low computation to communication ratio. With this our main consideration we focus our effort on lowering communication costs. In so doing we accelerate system solving. Further, in the diagonally dominant case computation is decoupled into independent partitions allowing for efficient processing of larger systems.","PeriodicalId":206307,"journal":{"name":"20th Annual International Conference on High Performance Computing","volume":"3 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Solving tridiagonal systems on a GPU\",\"authors\":\"B. J. Murphy\",\"doi\":\"10.1109/HiPC.2013.6799117\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We implement a parallel tridiagonal solver based on cyclic reduction (CR) for a graphics processing unit (GPU). The bane of such solvers is a low computation to communication ratio. With this our main consideration we focus our effort on lowering communication costs. In so doing we accelerate system solving. Further, in the diagonally dominant case computation is decoupled into independent partitions allowing for efficient processing of larger systems.\",\"PeriodicalId\":206307,\"journal\":{\"name\":\"20th Annual International Conference on High Performance Computing\",\"volume\":\"3 6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"20th Annual International Conference on High Performance Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HiPC.2013.6799117\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"20th Annual International Conference on High Performance Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HiPC.2013.6799117","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

为图形处理单元(GPU)实现了基于循环约简(CR)的并行三对角线求解器。这种求解器的缺点是计算通信比低。这是我们的主要考虑，我们集中精力降低通信成本。这样我们就加快了系统解决的速度。此外，在对角线占主导地位的情况下，计算被解耦到独立的分区，允许对更大的系统进行有效的处理。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Solving tridiagonal systems on a GPU

We implement a parallel tridiagonal solver based on cyclic reduction (CR) for a graphics processing unit (GPU). The bane of such solvers is a low computation to communication ratio. With this our main consideration we focus our effort on lowering communication costs. In so doing we accelerate system solving. Further, in the diagonally dominant case computation is decoupled into independent partitions allowing for efficient processing of larger systems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

20th Annual International Conference on High Performance Computing

自引率

0.00%

发文量