Bin Huang, Anjun Liu, Min Tian, Jingshan Pan, Yu Zhang
{"title":"基于CUDA的格子玻尔兹曼方法软件Palabos并行性能及优化","authors":"Bin Huang, Anjun Liu, Min Tian, Jingshan Pan, Yu Zhang","doi":"10.1145/3546000.3546014","DOIUrl":null,"url":null,"abstract":"The open-source fluid dynamics software Palabos based on the Lattice Boltzmann Method (LBM) has been widely used in porous media, biological fluids, free interfaces and other physical problems. Palabos has excellent MPI parallel performance and can complete large scale computation of computational fluid dynamics. To realize the heterogeneous parallelism of Palabos, we test the performance of Palabos on a large-scale simulation on a general-purpose cluster at first. The experimental results show that Palabos has a good performance in 16000 MPI processes. Then, we designed a CUDA parallel optimization algorithm for the case of cavity flow according to address mapping and shared memory optimization. Numerical experiments results show that the speedup ratio can achieve about a 1.5x acceleration ratio when the number of the grid is 512×512×512.","PeriodicalId":196955,"journal":{"name":"Proceedings of the 6th International Conference on High Performance Compilation, Computing and Communications","volume":"54 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Parallel Performance and Optimization of the Lattice Boltzmann Method Software Palabos Using CUDA\",\"authors\":\"Bin Huang, Anjun Liu, Min Tian, Jingshan Pan, Yu Zhang\",\"doi\":\"10.1145/3546000.3546014\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The open-source fluid dynamics software Palabos based on the Lattice Boltzmann Method (LBM) has been widely used in porous media, biological fluids, free interfaces and other physical problems. Palabos has excellent MPI parallel performance and can complete large scale computation of computational fluid dynamics. To realize the heterogeneous parallelism of Palabos, we test the performance of Palabos on a large-scale simulation on a general-purpose cluster at first. The experimental results show that Palabos has a good performance in 16000 MPI processes. Then, we designed a CUDA parallel optimization algorithm for the case of cavity flow according to address mapping and shared memory optimization. Numerical experiments results show that the speedup ratio can achieve about a 1.5x acceleration ratio when the number of the grid is 512×512×512.\",\"PeriodicalId\":196955,\"journal\":{\"name\":\"Proceedings of the 6th International Conference on High Performance Compilation, Computing and Communications\",\"volume\":\"54 1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 6th International Conference on High Performance Compilation, Computing and Communications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3546000.3546014\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 6th International Conference on High Performance Compilation, Computing and Communications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3546000.3546014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Parallel Performance and Optimization of the Lattice Boltzmann Method Software Palabos Using CUDA
The open-source fluid dynamics software Palabos based on the Lattice Boltzmann Method (LBM) has been widely used in porous media, biological fluids, free interfaces and other physical problems. Palabos has excellent MPI parallel performance and can complete large scale computation of computational fluid dynamics. To realize the heterogeneous parallelism of Palabos, we test the performance of Palabos on a large-scale simulation on a general-purpose cluster at first. The experimental results show that Palabos has a good performance in 16000 MPI processes. Then, we designed a CUDA parallel optimization algorithm for the case of cavity flow according to address mapping and shared memory optimization. Numerical experiments results show that the speedup ratio can achieve about a 1.5x acceleration ratio when the number of the grid is 512×512×512.