Yaning Wang , Yuchen Wu , Yadong Zeng , Maoqiang Jiang , Zhaohui Liu
{"title":"基于块结构自适应网格的浸入边界晶格玻尔兹曼方法在cpu / gpu上模拟微粒流","authors":"Yaning Wang , Yuchen Wu , Yadong Zeng , Maoqiang Jiang , Zhaohui Liu","doi":"10.1016/j.cpc.2025.109674","DOIUrl":null,"url":null,"abstract":"<div><div>We developed a highly efficient CPUs/GPUs solver for the fully resolved simulation of particle-laden flows by combining the lattice Boltzmann method (LBM) for fluid and the immersed boundary method (IBM) for fluid-structure interaction into the framework of adaptive mesh refinement (AMR). The cell-centered finite volume LBM method is adopted to guarantee the mass conservation. The boundary thickening direct force-immersed boundary method is used to capture the surface of solids to satisfy the no-slip and no-permeability boundary conditions while retaining computational simplicity. The AMR algorithm is implemented on the open-source framework <em>AMReX</em>, where the solids are placed only on the finest level, and different levels advance the solutions with varying steps of time, greatly reducing the computational cost and improving stability. The developed solver achieves 97.95% weak scaling efficiency on up to 128 GPUs by optimization and specific selection of BoxSize. An 18.2-fold speedup is obtained in the case of 3-level AMR meshes, compared to that on a uniform mesh, while reducing memory usage by 97.4%. Several classic cases also validated the solver, including flow past a two-dimensional fixed/oscillating circle cylinder, flow past a three-dimensional fixed sphere, and particles freely settling with stable and unstable patterns. The qualitative and quantitative results show that this highly efficient solver has good accuracy, robustness, scalability and extensibility for the complex particle-laden flows conventionally with heavy computing cost.</div></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"314 ","pages":"Article 109674"},"PeriodicalIF":3.4000,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An immersed boundary lattice Boltzmann method on block-structured adaptive grids for the simulation of particle-laden flows on CPUs/GPUs\",\"authors\":\"Yaning Wang , Yuchen Wu , Yadong Zeng , Maoqiang Jiang , Zhaohui Liu\",\"doi\":\"10.1016/j.cpc.2025.109674\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>We developed a highly efficient CPUs/GPUs solver for the fully resolved simulation of particle-laden flows by combining the lattice Boltzmann method (LBM) for fluid and the immersed boundary method (IBM) for fluid-structure interaction into the framework of adaptive mesh refinement (AMR). The cell-centered finite volume LBM method is adopted to guarantee the mass conservation. The boundary thickening direct force-immersed boundary method is used to capture the surface of solids to satisfy the no-slip and no-permeability boundary conditions while retaining computational simplicity. The AMR algorithm is implemented on the open-source framework <em>AMReX</em>, where the solids are placed only on the finest level, and different levels advance the solutions with varying steps of time, greatly reducing the computational cost and improving stability. The developed solver achieves 97.95% weak scaling efficiency on up to 128 GPUs by optimization and specific selection of BoxSize. An 18.2-fold speedup is obtained in the case of 3-level AMR meshes, compared to that on a uniform mesh, while reducing memory usage by 97.4%. Several classic cases also validated the solver, including flow past a two-dimensional fixed/oscillating circle cylinder, flow past a three-dimensional fixed sphere, and particles freely settling with stable and unstable patterns. The qualitative and quantitative results show that this highly efficient solver has good accuracy, robustness, scalability and extensibility for the complex particle-laden flows conventionally with heavy computing cost.</div></div>\",\"PeriodicalId\":285,\"journal\":{\"name\":\"Computer Physics Communications\",\"volume\":\"314 \",\"pages\":\"Article 109674\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-05-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Physics Communications\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0010465525001766\",\"RegionNum\":2,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Physics Communications","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010465525001766","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
An immersed boundary lattice Boltzmann method on block-structured adaptive grids for the simulation of particle-laden flows on CPUs/GPUs
We developed a highly efficient CPUs/GPUs solver for the fully resolved simulation of particle-laden flows by combining the lattice Boltzmann method (LBM) for fluid and the immersed boundary method (IBM) for fluid-structure interaction into the framework of adaptive mesh refinement (AMR). The cell-centered finite volume LBM method is adopted to guarantee the mass conservation. The boundary thickening direct force-immersed boundary method is used to capture the surface of solids to satisfy the no-slip and no-permeability boundary conditions while retaining computational simplicity. The AMR algorithm is implemented on the open-source framework AMReX, where the solids are placed only on the finest level, and different levels advance the solutions with varying steps of time, greatly reducing the computational cost and improving stability. The developed solver achieves 97.95% weak scaling efficiency on up to 128 GPUs by optimization and specific selection of BoxSize. An 18.2-fold speedup is obtained in the case of 3-level AMR meshes, compared to that on a uniform mesh, while reducing memory usage by 97.4%. Several classic cases also validated the solver, including flow past a two-dimensional fixed/oscillating circle cylinder, flow past a three-dimensional fixed sphere, and particles freely settling with stable and unstable patterns. The qualitative and quantitative results show that this highly efficient solver has good accuracy, robustness, scalability and extensibility for the complex particle-laden flows conventionally with heavy computing cost.
期刊介绍:
The focus of CPC is on contemporary computational methods and techniques and their implementation, the effectiveness of which will normally be evidenced by the author(s) within the context of a substantive problem in physics. Within this setting CPC publishes two types of paper.
Computer Programs in Physics (CPiP)
These papers describe significant computer programs to be archived in the CPC Program Library which is held in the Mendeley Data repository. The submitted software must be covered by an approved open source licence. Papers and associated computer programs that address a problem of contemporary interest in physics that cannot be solved by current software are particularly encouraged.
Computational Physics Papers (CP)
These are research papers in, but are not limited to, the following themes across computational physics and related disciplines.
mathematical and numerical methods and algorithms;
computational models including those associated with the design, control and analysis of experiments; and
algebraic computation.
Each will normally include software implementation and performance details. The software implementation should, ideally, be available via GitHub, Zenodo or an institutional repository.In addition, research papers on the impact of advanced computer architecture and special purpose computers on computing in the physical sciences and software topics related to, and of importance in, the physical sciences may be considered.