复杂纤维增强超弹性心脏模型的全gpu加速、无基质浸入边界法

IF 7.3 1区 工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY
Pengfei Ma , Li Cai , Xuan Wang , Hao Gao
{"title":"复杂纤维增强超弹性心脏模型的全gpu加速、无基质浸入边界法","authors":"Pengfei Ma ,&nbsp;Li Cai ,&nbsp;Xuan Wang ,&nbsp;Hao Gao","doi":"10.1016/j.cma.2025.118353","DOIUrl":null,"url":null,"abstract":"<div><div>The immersed boundary (IB) method has become a leading approach in cardiac fluid-structure interaction (FSI) modeling due to its ability to handle large deformations and complex geometries without requiring mesh regeneration. However, the use of nonlinear, fiber-reinforced hyperelastic materials for modeling soft cardiac tissues introduces challenges in computational efficiency, particularly due to the additional projection steps required for stability in the IB framework. These steps often involve sparse matrix storage and computation, which can degrade GPU performance. In this work, we present a fully GPU-accelerated, matrix-free IB method for FSI in anatomically realistic cardiac models, which novelly integrates established components into a unified, GPU-optimized system. By employing nodal coupling, our method eliminates the need for projection operations in the finite element space. Additionally, we solve the Navier-Stokes equations using Chorin’s projection method combined with a matrix-free geometric multigrid solver, ensuring the entire FSI algorithm remains matrix-free and highly compatible with GPU acceleration. Our implementation features several GPU-specific optimizations, including the use of constant memory to store values of nodal basis functions and their derivatives at quadrature points, and texture memory to efficiently implement the semi-Lagrangian discretization of convection terms. These innovations maximize GPU utilization while preserving the complex mechanical behavior of soft cardiac tissue. Benchmark tests demonstrate that our GPU-accelerated solver achieves a <span><math><mrow><mn>50</mn><mo>×</mo><mspace></mspace><mo>−</mo><mspace></mspace><mn>100</mn><mo>×</mo></mrow></math></span> speedup compared to a 20-core CPU implementation, with comparable accuracy. Critically, this performance enables clinically viable cardiac valve FSI simulations to be completed within a few hours on a single consumer-grade GPU-an achievement that was previously infeasible using traditional CPU-based frameworks.</div></div>","PeriodicalId":55222,"journal":{"name":"Computer Methods in Applied Mechanics and Engineering","volume":"447 ","pages":"Article 118353"},"PeriodicalIF":7.3000,"publicationDate":"2025-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Fully GPU-accelerated, matrix-free immersed boundary method for complex fiber-reinforced hyperelastic cardiac models\",\"authors\":\"Pengfei Ma ,&nbsp;Li Cai ,&nbsp;Xuan Wang ,&nbsp;Hao Gao\",\"doi\":\"10.1016/j.cma.2025.118353\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The immersed boundary (IB) method has become a leading approach in cardiac fluid-structure interaction (FSI) modeling due to its ability to handle large deformations and complex geometries without requiring mesh regeneration. However, the use of nonlinear, fiber-reinforced hyperelastic materials for modeling soft cardiac tissues introduces challenges in computational efficiency, particularly due to the additional projection steps required for stability in the IB framework. These steps often involve sparse matrix storage and computation, which can degrade GPU performance. In this work, we present a fully GPU-accelerated, matrix-free IB method for FSI in anatomically realistic cardiac models, which novelly integrates established components into a unified, GPU-optimized system. By employing nodal coupling, our method eliminates the need for projection operations in the finite element space. Additionally, we solve the Navier-Stokes equations using Chorin’s projection method combined with a matrix-free geometric multigrid solver, ensuring the entire FSI algorithm remains matrix-free and highly compatible with GPU acceleration. Our implementation features several GPU-specific optimizations, including the use of constant memory to store values of nodal basis functions and their derivatives at quadrature points, and texture memory to efficiently implement the semi-Lagrangian discretization of convection terms. These innovations maximize GPU utilization while preserving the complex mechanical behavior of soft cardiac tissue. Benchmark tests demonstrate that our GPU-accelerated solver achieves a <span><math><mrow><mn>50</mn><mo>×</mo><mspace></mspace><mo>−</mo><mspace></mspace><mn>100</mn><mo>×</mo></mrow></math></span> speedup compared to a 20-core CPU implementation, with comparable accuracy. Critically, this performance enables clinically viable cardiac valve FSI simulations to be completed within a few hours on a single consumer-grade GPU-an achievement that was previously infeasible using traditional CPU-based frameworks.</div></div>\",\"PeriodicalId\":55222,\"journal\":{\"name\":\"Computer Methods in Applied Mechanics and Engineering\",\"volume\":\"447 \",\"pages\":\"Article 118353\"},\"PeriodicalIF\":7.3000,\"publicationDate\":\"2025-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Methods in Applied Mechanics and Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0045782525006255\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Methods in Applied Mechanics and Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045782525006255","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

浸入边界(IB)方法已成为心脏流体-结构相互作用(FSI)建模的主要方法,因为它能够处理大变形和复杂的几何形状,而不需要网格再生。然而,使用非线性、纤维增强的超弹性材料对软心脏组织建模带来了计算效率方面的挑战,特别是由于在IB框架中需要额外的投影步骤来保持稳定性。这些步骤通常涉及稀疏矩阵存储和计算,这可能会降低GPU的性能。在这项工作中,我们提出了一种完全gpu加速的无基质IB方法,用于解剖逼真的心脏模型的FSI,该方法将已建立的组件新颖地集成到一个统一的gpu优化系统中。通过采用节点耦合,我们的方法消除了在有限元空间中投影运算的需要。此外,我们使用Chorin的投影方法结合无矩阵几何多网格求解器来求解Navier-Stokes方程,确保整个FSI算法保持无矩阵并且与GPU加速高度兼容。我们的实现具有几个特定gpu的优化,包括使用恒定存储器来存储节点基函数及其在正交点的导数的值,以及纹理存储器来有效地实现对流项的半拉格朗日离散化。这些创新最大限度地提高了GPU的利用率,同时保留了心脏软组织的复杂力学行为。基准测试表明,与20核CPU实现相比,我们的gpu加速求解器实现了50 - 100倍的加速,并且具有相当的精度。关键的是,这种性能使得临床可行的心脏瓣膜FSI模拟可以在几个小时内在单个消费级gpu上完成,这一成就以前使用传统的基于cpu的框架是不可行的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Fully GPU-accelerated, matrix-free immersed boundary method for complex fiber-reinforced hyperelastic cardiac models
The immersed boundary (IB) method has become a leading approach in cardiac fluid-structure interaction (FSI) modeling due to its ability to handle large deformations and complex geometries without requiring mesh regeneration. However, the use of nonlinear, fiber-reinforced hyperelastic materials for modeling soft cardiac tissues introduces challenges in computational efficiency, particularly due to the additional projection steps required for stability in the IB framework. These steps often involve sparse matrix storage and computation, which can degrade GPU performance. In this work, we present a fully GPU-accelerated, matrix-free IB method for FSI in anatomically realistic cardiac models, which novelly integrates established components into a unified, GPU-optimized system. By employing nodal coupling, our method eliminates the need for projection operations in the finite element space. Additionally, we solve the Navier-Stokes equations using Chorin’s projection method combined with a matrix-free geometric multigrid solver, ensuring the entire FSI algorithm remains matrix-free and highly compatible with GPU acceleration. Our implementation features several GPU-specific optimizations, including the use of constant memory to store values of nodal basis functions and their derivatives at quadrature points, and texture memory to efficiently implement the semi-Lagrangian discretization of convection terms. These innovations maximize GPU utilization while preserving the complex mechanical behavior of soft cardiac tissue. Benchmark tests demonstrate that our GPU-accelerated solver achieves a 50×100× speedup compared to a 20-core CPU implementation, with comparable accuracy. Critically, this performance enables clinically viable cardiac valve FSI simulations to be completed within a few hours on a single consumer-grade GPU-an achievement that was previously infeasible using traditional CPU-based frameworks.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
12.70
自引率
15.30%
发文量
719
审稿时长
44 days
期刊介绍: Computer Methods in Applied Mechanics and Engineering stands as a cornerstone in the realm of computational science and engineering. With a history spanning over five decades, the journal has been a key platform for disseminating papers on advanced mathematical modeling and numerical solutions. Interdisciplinary in nature, these contributions encompass mechanics, mathematics, computer science, and various scientific disciplines. The journal welcomes a broad range of computational methods addressing the simulation, analysis, and design of complex physical problems, making it a vital resource for researchers in the field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信