多偏微分方程突变遗忘自由语境算子学习的神经组合小波神经算子

IF 3.4 2区物理与天体物理 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Computer Physics Communications Pub Date : 2025-10-01 DOI:10.1016/j.cpc.2025.109882

Tapas Tripura , Souvik Chakraborty

{"title":"多偏微分方程突变遗忘自由语境算子学习的神经组合小波神经算子","authors":"Tapas Tripura , Souvik Chakraborty","doi":"10.1016/j.cpc.2025.109882","DOIUrl":null,"url":null,"abstract":"<div><div>Machine learning has witnessed substantial growth in recent years, leading to the development of advanced deep learning models crafted to address a wide range of real-world challenges spanning various domains, including the acceleration of scientific computing. Contemporary deep learning approaches to solving partial differential equations (PDEs) involve approximating either the function mapping of a specific problem or the solution operators of a pre-defined physical system. Consequently, solving multiple PDEs representing a variety of physical systems requires training of multiple deep learning models. The creation of physics-specific models from scratch for each new physical system remains a resource-intensive undertaking, demanding considerable (i) computational time, (ii) memory resources, (iii) energy, (iv) intensive physics-specific manual tuning, and (v) large problem-specific training datasets. A more generalized machine learning-enhanced computational approach would be to learn a single unified deep learning model (commonly defined as the foundation model) instead of training multiple solvers from scratch. Besides accelerating computational simulations, such unified models will address all the above challenges. In this study, we introduce the Neural Combinatorial Wavelet Neural Operator (NCWNO) as a foundational model for scientific computing. The NCWNO leverages a gated structure that employs local wavelet integral blocks to acquire shared features across multiple physical systems, complemented by a memory-based ensembling approach among these local wavelet experts. The proposed NCWNO offers two key advantages: (i) it can simultaneously learn solution operators for multiple parametric PDEs, and (ii) with pre-training, it can be fine-tuned to new parametric PDEs with reduced training datasets and time. The proposed NCWNO is the first kernel-based foundational operator learning algorithm distinguished by its (i) integral-kernel-based learning structure, (ii) robustness against catastrophic forgetting of old PDEs, and (iii) the facilitation of knowledge transfer across dissimilar physical systems. Through an extensive set of benchmark examples, we demonstrate that the NCWNO can outperform existing multiphysics and task-specific baseline operator learning frameworks.</div></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"318 ","pages":"Article 109882"},"PeriodicalIF":3.4000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Neural combinatorial wavelet neural operator for catastrophic forgetting free in-context operator learning of multiple partial differential equations\",\"authors\":\"Tapas Tripura , Souvik Chakraborty\",\"doi\":\"10.1016/j.cpc.2025.109882\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Machine learning has witnessed substantial growth in recent years, leading to the development of advanced deep learning models crafted to address a wide range of real-world challenges spanning various domains, including the acceleration of scientific computing. Contemporary deep learning approaches to solving partial differential equations (PDEs) involve approximating either the function mapping of a specific problem or the solution operators of a pre-defined physical system. Consequently, solving multiple PDEs representing a variety of physical systems requires training of multiple deep learning models. The creation of physics-specific models from scratch for each new physical system remains a resource-intensive undertaking, demanding considerable (i) computational time, (ii) memory resources, (iii) energy, (iv) intensive physics-specific manual tuning, and (v) large problem-specific training datasets. A more generalized machine learning-enhanced computational approach would be to learn a single unified deep learning model (commonly defined as the foundation model) instead of training multiple solvers from scratch. Besides accelerating computational simulations, such unified models will address all the above challenges. In this study, we introduce the Neural Combinatorial Wavelet Neural Operator (NCWNO) as a foundational model for scientific computing. The NCWNO leverages a gated structure that employs local wavelet integral blocks to acquire shared features across multiple physical systems, complemented by a memory-based ensembling approach among these local wavelet experts. The proposed NCWNO offers two key advantages: (i) it can simultaneously learn solution operators for multiple parametric PDEs, and (ii) with pre-training, it can be fine-tuned to new parametric PDEs with reduced training datasets and time. The proposed NCWNO is the first kernel-based foundational operator learning algorithm distinguished by its (i) integral-kernel-based learning structure, (ii) robustness against catastrophic forgetting of old PDEs, and (iii) the facilitation of knowledge transfer across dissimilar physical systems. Through an extensive set of benchmark examples, we demonstrate that the NCWNO can outperform existing multiphysics and task-specific baseline operator learning frameworks.</div></div>\",\"PeriodicalId\":285,\"journal\":{\"name\":\"Computer Physics Communications\",\"volume\":\"318 \",\"pages\":\"Article 109882\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Physics Communications\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0010465525003844\",\"RegionNum\":2,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Physics Communications","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010465525003844","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

摘要

近年来，机器学习取得了长足的发展，导致了先进深度学习模型的发展，这些模型旨在解决跨越各个领域的各种现实挑战，包括科学计算的加速。当代解决偏微分方程（PDEs）的深度学习方法涉及近似特定问题的函数映射或预定义物理系统的解算子。因此，求解表示各种物理系统的多个偏微分方程需要训练多个深度学习模型。为每个新的物理系统从零开始创建特定于物理的模型仍然是一项资源密集型的工作，需要大量的(i)计算时间，（ii）内存资源，（iii）能量，（iv）密集的特定于物理的手动调优，以及(v)大型问题特定的训练数据集。一种更广义的机器学习增强计算方法是学习一个统一的深度学习模型（通常定义为基础模型），而不是从头开始训练多个求解器。除了加速计算模拟外，这种统一模型将解决上述所有挑战。在本研究中，我们引入神经组合小波神经算子（NCWNO）作为科学计算的基础模型。NCWNO利用门控结构，利用局部小波积分块获取多个物理系统的共享特征，并辅以这些局部小波专家之间基于记忆的集成方法。所提出的NCWNO具有两个关键优势：(i)它可以同时学习多个参数偏微分方程的解算子；（ii）通过预训练，它可以通过减少训练数据集和时间来微调到新的参数偏微分方程。提出的NCWNO是第一个基于核的基础算子学习算法，其特点是(i)基于积分核的学习结构，（ii）对旧偏微分方程的灾难性遗忘的鲁棒性，以及（iii）促进不同物理系统之间的知识转移。通过一组广泛的基准示例，我们证明NCWNO可以优于现有的多物理场和任务特定基线算子学习框架。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Neural combinatorial wavelet neural operator for catastrophic forgetting free in-context operator learning of multiple partial differential equations

Machine learning has witnessed substantial growth in recent years, leading to the development of advanced deep learning models crafted to address a wide range of real-world challenges spanning various domains, including the acceleration of scientific computing. Contemporary deep learning approaches to solving partial differential equations (PDEs) involve approximating either the function mapping of a specific problem or the solution operators of a pre-defined physical system. Consequently, solving multiple PDEs representing a variety of physical systems requires training of multiple deep learning models. The creation of physics-specific models from scratch for each new physical system remains a resource-intensive undertaking, demanding considerable (i) computational time, (ii) memory resources, (iii) energy, (iv) intensive physics-specific manual tuning, and (v) large problem-specific training datasets. A more generalized machine learning-enhanced computational approach would be to learn a single unified deep learning model (commonly defined as the foundation model) instead of training multiple solvers from scratch. Besides accelerating computational simulations, such unified models will address all the above challenges. In this study, we introduce the Neural Combinatorial Wavelet Neural Operator (NCWNO) as a foundational model for scientific computing. The NCWNO leverages a gated structure that employs local wavelet integral blocks to acquire shared features across multiple physical systems, complemented by a memory-based ensembling approach among these local wavelet experts. The proposed NCWNO offers two key advantages: (i) it can simultaneously learn solution operators for multiple parametric PDEs, and (ii) with pre-training, it can be fine-tuned to new parametric PDEs with reduced training datasets and time. The proposed NCWNO is the first kernel-based foundational operator learning algorithm distinguished by its (i) integral-kernel-based learning structure, (ii) robustness against catastrophic forgetting of old PDEs, and (iii) the facilitation of knowledge transfer across dissimilar physical systems. Through an extensive set of benchmark examples, we demonstrate that the NCWNO can outperform existing multiphysics and task-specific baseline operator learning frameworks.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computer Physics Communications 物理-计算机：跨学科应用

CiteScore

12.10

自引率

3.20%

发文量

287

审稿时长

5.3 months

期刊介绍： The focus of CPC is on contemporary computational methods and techniques and their implementation, the effectiveness of which will normally be evidenced by the author(s) within the context of a substantive problem in physics. Within this setting CPC publishes two types of paper. Computer Programs in Physics (CPiP) These papers describe significant computer programs to be archived in the CPC Program Library which is held in the Mendeley Data repository. The submitted software must be covered by an approved open source licence. Papers and associated computer programs that address a problem of contemporary interest in physics that cannot be solved by current software are particularly encouraged. Computational Physics Papers (CP) These are research papers in, but are not limited to, the following themes across computational physics and related disciplines. mathematical and numerical methods and algorithms; computational models including those associated with the design, control and analysis of experiments; and algebraic computation. Each will normally include software implementation and performance details. The software implementation should, ideally, be available via GitHub, Zenodo or an institutional repository.In addition, research papers on the impact of advanced computer architecture and special purpose computers on computing in the physical sciences and software topics related to, and of importance in, the physical sciences may be considered.