Mixed-depth physics-informed neural network with nested activation mechanism in solving partial differential equations

IF 7.3 1区 工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY
Tianhao Wang , Guirong Liu , Eric Li , Xu Xu
{"title":"Mixed-depth physics-informed neural network with nested activation mechanism in solving partial differential equations","authors":"Tianhao Wang ,&nbsp;Guirong Liu ,&nbsp;Eric Li ,&nbsp;Xu Xu","doi":"10.1016/j.cma.2025.118356","DOIUrl":null,"url":null,"abstract":"<div><div>Physics-informed neural networks (PINNs) have become promising tools for solving complex partial differential equations (PDEs), but traditional PINNs suffered from slow convergence, vanishing gradients, and poor handling of local physical features. This paper proposes a mixed-depth physics-informed neural network (<em><span>md</span></em>-PINN) for solving the complex PDEs, aiming to improve the efficiency of network structure and activation function. The contributions are two aspects: (1) the <em><span>md</span></em>-PINN includes the various mixed-depth blocks, each of which contains parallel connected deep sub-network and shallow sub-network. The deep sub-network captures complex physical features, ensuring a comprehensive understanding of the system; while the shallow sub-network focuses on the basic physical features, facilitating the stable training; (2) the <em><span>md</span></em>-PINN introduces a new <em><span>nest-tanh</span></em>(.) activation functions with nested mechanism in shallow sub-networks to enable efficient extraction of complex features using fewer hidden layers, reducing reliance on deep networks. By incorporating mixed-depth structures, <em><span>md</span></em>-PINN enables more efficient information sharing across different layer, leading to faster convergence and improved training efficiency. Theoretical analysis demonstrates that <em><span>md</span></em>-PINN avoids suboptimal convergence with appropriate initialization. The proposed approach is validated across multiple PDEs, including heat transfer scenarios with complex boundaries, bi-material solid mechanical problems, Allen-Cahn equation, fluid dynamics, and the higher order Kuramoto-Sivashinsky equation. Results show that <em><span>md</span></em>-PINN exhibits the superior capabilities in approximating and capturing intricate system features. These findings underscore the computational efficiency and potential of <em><span>md</span></em>-PINN in tackling real-world and complex problems.</div></div>","PeriodicalId":55222,"journal":{"name":"Computer Methods in Applied Mechanics and Engineering","volume":"447 ","pages":"Article 118356"},"PeriodicalIF":7.3000,"publicationDate":"2025-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Methods in Applied Mechanics and Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045782525006280","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Physics-informed neural networks (PINNs) have become promising tools for solving complex partial differential equations (PDEs), but traditional PINNs suffered from slow convergence, vanishing gradients, and poor handling of local physical features. This paper proposes a mixed-depth physics-informed neural network (md-PINN) for solving the complex PDEs, aiming to improve the efficiency of network structure and activation function. The contributions are two aspects: (1) the md-PINN includes the various mixed-depth blocks, each of which contains parallel connected deep sub-network and shallow sub-network. The deep sub-network captures complex physical features, ensuring a comprehensive understanding of the system; while the shallow sub-network focuses on the basic physical features, facilitating the stable training; (2) the md-PINN introduces a new nest-tanh(.) activation functions with nested mechanism in shallow sub-networks to enable efficient extraction of complex features using fewer hidden layers, reducing reliance on deep networks. By incorporating mixed-depth structures, md-PINN enables more efficient information sharing across different layer, leading to faster convergence and improved training efficiency. Theoretical analysis demonstrates that md-PINN avoids suboptimal convergence with appropriate initialization. The proposed approach is validated across multiple PDEs, including heat transfer scenarios with complex boundaries, bi-material solid mechanical problems, Allen-Cahn equation, fluid dynamics, and the higher order Kuramoto-Sivashinsky equation. Results show that md-PINN exhibits the superior capabilities in approximating and capturing intricate system features. These findings underscore the computational efficiency and potential of md-PINN in tackling real-world and complex problems.
具有嵌套激活机制的混合深度物理信息神经网络求解偏微分方程
物理信息神经网络(pinn)已经成为求解复杂偏微分方程(PDEs)的有前途的工具,但传统的pinn存在收敛缓慢、梯度消失以及对局部物理特征处理不力的问题。本文提出了一种混合深度物理信息神经网络(md-PINN)来求解复杂偏微分方程,旨在提高网络结构和激活函数的效率。贡献有两个方面:(1)md-PINN包含各种混合深度块,每个块包含并行连接的深子网和浅子网。深层子网捕获复杂的物理特征,确保对系统的全面理解;浅层子网络侧重于基本物理特征,便于稳定训练;(2) md-PINN在浅层子网络中引入了一种新的具有嵌套机制的nest-tanh(.)激活函数,可以使用更少的隐藏层高效地提取复杂特征,减少对深层网络的依赖。通过结合混合深度结构,md-PINN可以实现更高效的跨层信息共享,从而加快收敛速度,提高训练效率。理论分析表明,通过适当的初始化,md-PINN可以避免次优收敛。该方法在多个偏微分方程中得到了验证,包括具有复杂边界的传热场景、双材料固体力学问题、Allen-Cahn方程、流体动力学和高阶Kuramoto-Sivashinsky方程。结果表明,md-PINN在逼近和捕获复杂系统特征方面表现出优越的能力。这些发现强调了md-PINN在解决现实世界和复杂问题方面的计算效率和潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
12.70
自引率
15.30%
发文量
719
审稿时长
44 days
期刊介绍: Computer Methods in Applied Mechanics and Engineering stands as a cornerstone in the realm of computational science and engineering. With a history spanning over five decades, the journal has been a key platform for disseminating papers on advanced mathematical modeling and numerical solutions. Interdisciplinary in nature, these contributions encompass mechanics, mathematics, computer science, and various scientific disciplines. The journal welcomes a broad range of computational methods addressing the simulation, analysis, and design of complex physical problems, making it a vital resource for researchers in the field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信