Understanding the role of autoencoders for stiff dynamical systems using information theory

IF 9.6 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Energy and AI Pub Date : 2025-07-29 DOI:10.1016/j.egyai.2025.100567

Vijayamanikandan Vijayarangan , Harshavardhana A. Uranakara , Francisco E. Hernández–Pérez , Hong G. Im

{"title":"Understanding the role of autoencoders for stiff dynamical systems using information theory","authors":"Vijayamanikandan Vijayarangan , Harshavardhana A. Uranakara , Francisco E. Hernández–Pérez , Hong G. Im","doi":"10.1016/j.egyai.2025.100567","DOIUrl":null,"url":null,"abstract":"<div><div>Using information theory, this study provides insights into how the construction of latent space of autoencoder (AE) using deep neural network (DNN) training finds a smooth (non-stiff) low-dimensional manifold in the stiff dynamical system. Our recent study (Vijayarangan et al. 2023) reported that an AE combined with neural ODE (NODE) as a surrogate reduced order model (ROM) for the integration of stiff chemically reacting systems led to a significant reduction in the temporal stiffness, and the behavior was attributed to the identification of a slow invariant manifold by the nonlinear projection using the AE. The present work offers a fundamental understanding of the mechanism of formation of a non-stiff latent space and stiffness reduction by employing concepts from information theory and better mixing. The learning mechanisms of both the encoder and the decoder are explained by plotting the evolution of mutual information and identifying two different phases. Subsequently, the density distribution is plotted for the physical and latent variables, which shows the transformation of the <em>rare event</em> in the physical space to a <em>highly likely</em> (more probable) event in the latent space provided by the nonlinear autoencoder. Finally, the nonlinear transformation leading to density redistribution is explained using concepts from information theory and probability.</div></div>","PeriodicalId":34138,"journal":{"name":"Energy and AI","volume":"21 ","pages":"Article 100567"},"PeriodicalIF":9.6000,"publicationDate":"2025-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy and AI","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666546825000990","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Using information theory, this study provides insights into how the construction of latent space of autoencoder (AE) using deep neural network (DNN) training finds a smooth (non-stiff) low-dimensional manifold in the stiff dynamical system. Our recent study (Vijayarangan et al. 2023) reported that an AE combined with neural ODE (NODE) as a surrogate reduced order model (ROM) for the integration of stiff chemically reacting systems led to a significant reduction in the temporal stiffness, and the behavior was attributed to the identification of a slow invariant manifold by the nonlinear projection using the AE. The present work offers a fundamental understanding of the mechanism of formation of a non-stiff latent space and stiffness reduction by employing concepts from information theory and better mixing. The learning mechanisms of both the encoder and the decoder are explained by plotting the evolution of mutual information and identifying two different phases. Subsequently, the density distribution is plotted for the physical and latent variables, which shows the transformation of the rare event in the physical space to a highly likely (more probable) event in the latent space provided by the nonlinear autoencoder. Finally, the nonlinear transformation leading to density redistribution is explained using concepts from information theory and probability.

Abstract Image

查看原文本刊更多论文

利用信息论理解自编码器在刚性动力系统中的作用

本文运用信息论的方法，探讨了利用深度神经网络（DNN）训练构建自编码器（AE）的潜在空间如何在刚性动力系统中找到光滑（非刚性）低维流形。我们最近的研究（Vijayarangan et al. 2023）报道，将AE与神经ODE （NODE）相结合作为替代降阶模型（ROM），用于整合刚性化学反应系统，导致时间刚度显著降低，这种行为归因于使用AE通过非线性投影识别慢不变流形。本研究通过运用信息论和更好的混合概念，对非刚性潜空间的形成机制和刚度降低提供了一个基本的理解。编码器和解码器的学习机制是通过绘制互信息的演变和识别两个不同的阶段来解释的。随后，绘制了物理变量和潜变量的密度分布，显示了由非线性自编码器提供的物理空间中的罕见事件向潜空间中的高可能（更可能）事件的转换。最后，利用信息论和概率论的概念解释了导致密度重分布的非线性变换。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊