{"title":"Implementation of Variable Preconditioned GCR with mixed precision on GPU using CUDA","authors":"S. Ikuno, N. Fujita, Susumu Yamamoto, S. Nakata","doi":"10.1109/CEFC.2010.5481534","DOIUrl":null,"url":null,"abstract":"The Variable Preconditioned GVR (VPGCR) with mixed precision on Graphics Processing Unit (GPU) using Compute Unified Device Architecture (CUDA) is numerically investigated. The convergence theorem of VPGCR is guaranteed that the residual equation for the preconditioned procedure can be solved in the range of single precision operation. The results of computations show that VPGCR with mixed precision operation on GPU demonstrated significant achievement than that of CPU. Especially, VPGCR on GPU with mixed precision operation is 22.53 times faster than that of Central Processing Unit (CPU).","PeriodicalId":148739,"journal":{"name":"Digests of the 2010 14th Biennial IEEE Conference on Electromagnetic Field Computation","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digests of the 2010 14th Biennial IEEE Conference on Electromagnetic Field Computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CEFC.2010.5481534","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The Variable Preconditioned GVR (VPGCR) with mixed precision on Graphics Processing Unit (GPU) using Compute Unified Device Architecture (CUDA) is numerically investigated. The convergence theorem of VPGCR is guaranteed that the residual equation for the preconditioned procedure can be solved in the range of single precision operation. The results of computations show that VPGCR with mixed precision operation on GPU demonstrated significant achievement than that of CPU. Especially, VPGCR on GPU with mixed precision operation is 22.53 times faster than that of Central Processing Unit (CPU).