{"title":"Solving tridiagonal systems on a GPU","authors":"B. J. Murphy","doi":"10.1109/HiPC.2013.6799117","DOIUrl":null,"url":null,"abstract":"We implement a parallel tridiagonal solver based on cyclic reduction (CR) for a graphics processing unit (GPU). The bane of such solvers is a low computation to communication ratio. With this our main consideration we focus our effort on lowering communication costs. In so doing we accelerate system solving. Further, in the diagonally dominant case computation is decoupled into independent partitions allowing for efficient processing of larger systems.","PeriodicalId":206307,"journal":{"name":"20th Annual International Conference on High Performance Computing","volume":"3 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"20th Annual International Conference on High Performance Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HiPC.2013.6799117","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
We implement a parallel tridiagonal solver based on cyclic reduction (CR) for a graphics processing unit (GPU). The bane of such solvers is a low computation to communication ratio. With this our main consideration we focus our effort on lowering communication costs. In so doing we accelerate system solving. Further, in the diagonally dominant case computation is decoupled into independent partitions allowing for efficient processing of larger systems.