Transformations establishing equivalence across neural networks: When have two networks learned the same task?

IF 4.6 Q2 MATERIALS SCIENCE, BIOMATERIALS

ACS Applied Bio Materials Pub Date : 2024-07-01 DOI:10.1063/5.0206406

Tom Bertalan, Felix Dietrich, Ioannis G Kevrekidis

{"title":"Transformations establishing equivalence across neural networks: When have two networks learned the same task?","authors":"Tom Bertalan, Felix Dietrich, Ioannis G Kevrekidis","doi":"10.1063/5.0206406","DOIUrl":null,"url":null,"abstract":"<p><p>Transformations are a key tool in the qualitative study of dynamical systems: transformations to a normal form, for example, underpin the study of instabilities and bifurcations. In this work, we test, and when possible establish, an equivalence between two different artificial neural networks by attempting to construct a data-driven transformation between them, using diffusion maps with a Mahalanobis-like metric. If the construction succeeds, the two networks can be thought of as belonging to the same equivalence class. We first discuss transformation functions between only the outputs of the two networks; we then also consider transformations that take into account outputs (activations) of a number of internal neurons from each network. Whitney's theorem dictates the number of (generic) measurements from one of the networks required to reconstruct each and every feature of the second network. The construction of the transformation function relies on a consistent, intrinsic representation of the network input space. We illustrate our algorithm by matching neural network pairs trained to learn (a) observations of scalar functions, (b) observations of two-dimensional vector fields, and (c) representations of images of a moving three-dimensional object (a rotating horse). We also demonstrate reconstruction of a network's input (and output) from minimal partial observations of intermediate neuron activations. The construction of equivalences across different network instantiations clearly relates to transfer learning and will also be valuable in establishing equivalence between different machine learning-based tools.</p>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1063/5.0206406","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}

引用次数: 0

Abstract

Transformations are a key tool in the qualitative study of dynamical systems: transformations to a normal form, for example, underpin the study of instabilities and bifurcations. In this work, we test, and when possible establish, an equivalence between two different artificial neural networks by attempting to construct a data-driven transformation between them, using diffusion maps with a Mahalanobis-like metric. If the construction succeeds, the two networks can be thought of as belonging to the same equivalence class. We first discuss transformation functions between only the outputs of the two networks; we then also consider transformations that take into account outputs (activations) of a number of internal neurons from each network. Whitney's theorem dictates the number of (generic) measurements from one of the networks required to reconstruct each and every feature of the second network. The construction of the transformation function relies on a consistent, intrinsic representation of the network input space. We illustrate our algorithm by matching neural network pairs trained to learn (a) observations of scalar functions, (b) observations of two-dimensional vector fields, and (c) representations of images of a moving three-dimensional object (a rotating horse). We also demonstrate reconstruction of a network's input (and output) from minimal partial observations of intermediate neuron activations. The construction of equivalences across different network instantiations clearly relates to transfer learning and will also be valuable in establishing equivalence between different machine learning-based tools.

查看原文本刊更多论文

建立神经网络间等效性的转换：两个网络何时学习了相同的任务？

变换是动态系统定性研究的重要工具：例如，向正常形式的变换是不稳定性和分岔研究的基础。在这项研究中，我们利用具有类似马哈拉诺比斯度量的扩散图，尝试在两个不同的人工神经网络之间构建数据驱动的变换，从而测试并在可能的情况下建立它们之间的等价性。如果构建成功，则可以认为这两个网络属于同一等价类。我们首先讨论的是两个网络输出之间的转换函数；然后，我们还考虑了将每个网络的一些内部神经元的输出（激活）考虑在内的转换。惠特尼定理规定了重建第二个网络的每个特征所需的其中一个网络的（一般）测量值的数量。转换函数的构建依赖于网络输入空间一致的内在表示。我们通过匹配经过训练的神经网络对来说明我们的算法：(a) 标量函数的观察结果；(b) 二维向量场的观察结果；(c) 运动的三维物体（一匹旋转的马）的图像表示。我们还演示了通过对中间神经元激活的最小部分观察重建网络的输入（和输出）。构建不同网络实例之间的等价关系显然与迁移学习有关，而且对于建立基于机器学习的不同工具之间的等价关系也很有价值。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACS Applied Bio Materials Chemistry-Chemistry (all)

CiteScore

9.40

自引率

2.10%

发文量

464