Pei-Jen Wang, C. Liu, Chia-Heng Tu, Chen-Pang Lee, Shih-Hao Hung
{"title":"Acceleration of Monte-Carlo simulation on high performance computing platforms","authors":"Pei-Jen Wang, C. Liu, Chia-Heng Tu, Chen-Pang Lee, Shih-Hao Hung","doi":"10.1145/3264746.3264765","DOIUrl":null,"url":null,"abstract":"Monte Carlo methods are often used to solve computational problems with randomness. The random sampling helps avoid the deterministic results, but it requires intensive computations to obtain the results. Several attempts have been made to boost the performance of the Monte Carlo based algorithms by taking advantage of the parallel computers. In this paper, we use the photonic simulation application, MCML, as a case study to 1) parallelize the Monte Carlo method with OpenMP and vectorization, 2) compare the parallelization techniques, and 3) evaluate the parallelized programs on the platforms with the Xeon Phi processor. In particular, the OpenMP version incorporates the vectorization technique that utilizes the AVX-512 vector instructions on the Xeon Phi processor. Our experimental results show that the OpenMP code achieves up to 345x speedup on the Xeon Phi processor, compared with the original code runs on the Xeon E5 processor.","PeriodicalId":186790,"journal":{"name":"Proceedings of the 2018 Conference on Research in Adaptive and Convergent Systems","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2018 Conference on Research in Adaptive and Convergent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3264746.3264765","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Monte Carlo methods are often used to solve computational problems with randomness. The random sampling helps avoid the deterministic results, but it requires intensive computations to obtain the results. Several attempts have been made to boost the performance of the Monte Carlo based algorithms by taking advantage of the parallel computers. In this paper, we use the photonic simulation application, MCML, as a case study to 1) parallelize the Monte Carlo method with OpenMP and vectorization, 2) compare the parallelization techniques, and 3) evaluate the parallelized programs on the platforms with the Xeon Phi processor. In particular, the OpenMP version incorporates the vectorization technique that utilizes the AVX-512 vector instructions on the Xeon Phi processor. Our experimental results show that the OpenMP code achieves up to 345x speedup on the Xeon Phi processor, compared with the original code runs on the Xeon E5 processor.