{"title":"A runtime/memory trade-off of the continous Ziggurat method on GPUs","authors":"C. Riesinger, T. Neckel","doi":"10.1109/HPCSim.2015.7237018","DOIUrl":null,"url":null,"abstract":"Pseudo random number generators are intensively used in many computational applications, e.g. the treatment of Uncertainty Quantification problems. For this reason, the optimization of such generators for various hardware architectures is of big interest. We present a runtime/memory trade-off for the popular Ziggurat method with focus on GPUs. Such a trade-off means that the runtime of pseudo random number generation can be reduced by investing more memory and vice versa. Especially GPUs benefit from this approach since it reduces warp divergence which occurs for rejection methods such as the Ziggurat method. To our knowledge, such a trade-off for the Ziggurat method has never been investigated before for GPUs. It is shown that this approach makes the Ziggurat method competitive against well established normal pseudo random number generators on GPUs. Optimal implementations and grid configurations are given for different GPU architectures.","PeriodicalId":134009,"journal":{"name":"2015 International Conference on High Performance Computing & Simulation (HPCS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on High Performance Computing & Simulation (HPCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPCSim.2015.7237018","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Pseudo random number generators are intensively used in many computational applications, e.g. the treatment of Uncertainty Quantification problems. For this reason, the optimization of such generators for various hardware architectures is of big interest. We present a runtime/memory trade-off for the popular Ziggurat method with focus on GPUs. Such a trade-off means that the runtime of pseudo random number generation can be reduced by investing more memory and vice versa. Especially GPUs benefit from this approach since it reduces warp divergence which occurs for rejection methods such as the Ziggurat method. To our knowledge, such a trade-off for the Ziggurat method has never been investigated before for GPUs. It is shown that this approach makes the Ziggurat method competitive against well established normal pseudo random number generators on GPUs. Optimal implementations and grid configurations are given for different GPU architectures.