Jiachen Guo , Gino Domel , Chanwook Park , Hantao Zhang , Ozgur Can Gumus , Ye Lu , Gregory J. Wagner , Dong Qian , Jian Cao , Thomas J.R. Hughes , Wing Kam Liu
{"title":"基于张量分解的先验代理模型(TAPS)在超大规模模拟中的应用","authors":"Jiachen Guo , Gino Domel , Chanwook Park , Hantao Zhang , Ozgur Can Gumus , Ye Lu , Gregory J. Wagner , Dong Qian , Jian Cao , Thomas J.R. Hughes , Wing Kam Liu","doi":"10.1016/j.cma.2025.118101","DOIUrl":null,"url":null,"abstract":"<div><div>A data-free predictive scientific AI model, termed Tensor-decomposition-based A Priori Surrogate (TAPS), is proposed for tackling ultra large-scale engineering simulations with significant speedup, memory savings, and storage gain. TAPS does not require any training data and can effectively obtain surrogate models for high-dimensional parametric problems with equivalently zetta-scale (<span><math><mrow><mn>1</mn><msup><mrow><mn>0</mn></mrow><mrow><mn>21</mn></mrow></msup></mrow></math></span>) degrees of freedom (DoFs) using a single GPU. TAPS achieves this by directly obtaining reduced-order models through solving the weak form of the governing equations with multiple independent variables such as spatial coordinates, parameters, and time. The paper first introduces an AI-enhanced finite element-type interpolation function called convolution hierarchical deep-learning neural network (C-HiDeNN) with tensor decomposition (TD). Subsequently, the generalized space-parameter-time Galerkin weak form and the corresponding matrix form are derived. Through the choice of TAPS hyperparameters, different convergence rates can be achieved. To show the capabilities of this framework, TAPS is then used to simulate a large-scale additive manufacturing process and achieves around 1,370x speedup, 14.8x memory savings, and 955x storage gain compared to the finite difference method with 3.46 billion spatial DoFs. As a result, the TAPS framework opens a new avenue for many challenging ultra large-scale engineering problems, such as additive manufacturing and integrated circuit design, among others.</div></div>","PeriodicalId":55222,"journal":{"name":"Computer Methods in Applied Mechanics and Engineering","volume":"444 ","pages":"Article 118101"},"PeriodicalIF":6.9000,"publicationDate":"2025-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Tensor-decomposition-based A Priori Surrogate (TAPS) modeling for ultra large-scale simulations\",\"authors\":\"Jiachen Guo , Gino Domel , Chanwook Park , Hantao Zhang , Ozgur Can Gumus , Ye Lu , Gregory J. Wagner , Dong Qian , Jian Cao , Thomas J.R. Hughes , Wing Kam Liu\",\"doi\":\"10.1016/j.cma.2025.118101\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>A data-free predictive scientific AI model, termed Tensor-decomposition-based A Priori Surrogate (TAPS), is proposed for tackling ultra large-scale engineering simulations with significant speedup, memory savings, and storage gain. TAPS does not require any training data and can effectively obtain surrogate models for high-dimensional parametric problems with equivalently zetta-scale (<span><math><mrow><mn>1</mn><msup><mrow><mn>0</mn></mrow><mrow><mn>21</mn></mrow></msup></mrow></math></span>) degrees of freedom (DoFs) using a single GPU. TAPS achieves this by directly obtaining reduced-order models through solving the weak form of the governing equations with multiple independent variables such as spatial coordinates, parameters, and time. The paper first introduces an AI-enhanced finite element-type interpolation function called convolution hierarchical deep-learning neural network (C-HiDeNN) with tensor decomposition (TD). Subsequently, the generalized space-parameter-time Galerkin weak form and the corresponding matrix form are derived. Through the choice of TAPS hyperparameters, different convergence rates can be achieved. To show the capabilities of this framework, TAPS is then used to simulate a large-scale additive manufacturing process and achieves around 1,370x speedup, 14.8x memory savings, and 955x storage gain compared to the finite difference method with 3.46 billion spatial DoFs. As a result, the TAPS framework opens a new avenue for many challenging ultra large-scale engineering problems, such as additive manufacturing and integrated circuit design, among others.</div></div>\",\"PeriodicalId\":55222,\"journal\":{\"name\":\"Computer Methods in Applied Mechanics and Engineering\",\"volume\":\"444 \",\"pages\":\"Article 118101\"},\"PeriodicalIF\":6.9000,\"publicationDate\":\"2025-06-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Methods in Applied Mechanics and Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0045782525003731\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Methods in Applied Mechanics and Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045782525003731","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
Tensor-decomposition-based A Priori Surrogate (TAPS) modeling for ultra large-scale simulations
A data-free predictive scientific AI model, termed Tensor-decomposition-based A Priori Surrogate (TAPS), is proposed for tackling ultra large-scale engineering simulations with significant speedup, memory savings, and storage gain. TAPS does not require any training data and can effectively obtain surrogate models for high-dimensional parametric problems with equivalently zetta-scale () degrees of freedom (DoFs) using a single GPU. TAPS achieves this by directly obtaining reduced-order models through solving the weak form of the governing equations with multiple independent variables such as spatial coordinates, parameters, and time. The paper first introduces an AI-enhanced finite element-type interpolation function called convolution hierarchical deep-learning neural network (C-HiDeNN) with tensor decomposition (TD). Subsequently, the generalized space-parameter-time Galerkin weak form and the corresponding matrix form are derived. Through the choice of TAPS hyperparameters, different convergence rates can be achieved. To show the capabilities of this framework, TAPS is then used to simulate a large-scale additive manufacturing process and achieves around 1,370x speedup, 14.8x memory savings, and 955x storage gain compared to the finite difference method with 3.46 billion spatial DoFs. As a result, the TAPS framework opens a new avenue for many challenging ultra large-scale engineering problems, such as additive manufacturing and integrated circuit design, among others.
期刊介绍:
Computer Methods in Applied Mechanics and Engineering stands as a cornerstone in the realm of computational science and engineering. With a history spanning over five decades, the journal has been a key platform for disseminating papers on advanced mathematical modeling and numerical solutions. Interdisciplinary in nature, these contributions encompass mechanics, mathematics, computer science, and various scientific disciplines. The journal welcomes a broad range of computational methods addressing the simulation, analysis, and design of complex physical problems, making it a vital resource for researchers in the field.