Generalization error bounds for iterative recovery algorithms unfolded as neural networks

IF 1.6 4区数学 Q2 MATHEMATICS, APPLIED

Information and Inference-A Journal of the Ima Pub Date : 2023-04-27 DOI:10.1093/imaiai/iaad023

Ekkehard Schnoor, Arash Behboodi, Holger Rauhut

{"title":"Generalization error bounds for iterative recovery algorithms unfolded as neural networks","authors":"Ekkehard Schnoor, Arash Behboodi, Holger Rauhut","doi":"10.1093/imaiai/iaad023","DOIUrl":null,"url":null,"abstract":"Abstract Motivated by the learned iterative soft thresholding algorithm (LISTA), we introduce a general class of neural networks suitable for sparse reconstruction from few linear measurements. By allowing a wide range of degrees of weight-sharing between the flayers, we enable a unified analysis for very different neural network types, ranging from recurrent ones to networks more similar to standard feedforward neural networks. Based on training samples, via empirical risk minimization, we aim at learning the optimal network parameters and thereby the optimal network that reconstructs signals from their low-dimensional linear measurements. We derive generalization bounds by analyzing the Rademacher complexity of hypothesis classes consisting of such deep networks, that also take into account the thresholding parameters. We obtain estimates of the sample complexity that essentially depend only linearly on the number of parameters and on the depth. We apply our main result to obtain specific generalization bounds for several practical examples, including different algorithms for (implicit) dictionary learning, and convolutional neural networks.","PeriodicalId":45437,"journal":{"name":"Information and Inference-A Journal of the Ima","volume":"263 1","pages":"0"},"PeriodicalIF":1.6000,"publicationDate":"2023-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information and Inference-A Journal of the Ima","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/imaiai/iaad023","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}

引用次数: 0

Abstract

Abstract Motivated by the learned iterative soft thresholding algorithm (LISTA), we introduce a general class of neural networks suitable for sparse reconstruction from few linear measurements. By allowing a wide range of degrees of weight-sharing between the flayers, we enable a unified analysis for very different neural network types, ranging from recurrent ones to networks more similar to standard feedforward neural networks. Based on training samples, via empirical risk minimization, we aim at learning the optimal network parameters and thereby the optimal network that reconstructs signals from their low-dimensional linear measurements. We derive generalization bounds by analyzing the Rademacher complexity of hypothesis classes consisting of such deep networks, that also take into account the thresholding parameters. We obtain estimates of the sample complexity that essentially depend only linearly on the number of parameters and on the depth. We apply our main result to obtain specific generalization bounds for several practical examples, including different algorithms for (implicit) dictionary learning, and convolutional neural networks.

查看原文本刊更多论文

迭代恢复算法的泛化误差边界以神经网络的形式展开

摘要:在学习迭代软阈值算法(LISTA)的激励下，我们引入了一类适用于从少量线性测量稀疏重建的神经网络。通过允许在剥层器之间广泛程度的权重共享，我们能够对非常不同的神经网络类型进行统一分析，从循环网络到更类似于标准前馈神经网络的网络。基于训练样本，通过经验风险最小化，我们的目标是学习最优网络参数，从而获得从低维线性测量中重建信号的最优网络。我们通过分析由这种深度网络组成的假设类的Rademacher复杂度来推导泛化边界，并且考虑了阈值参数。我们得到的样本复杂度的估计基本上只线性地依赖于参数的数量和深度。我们将我们的主要结果应用于几个实际示例，包括(隐式)字典学习和卷积神经网络的不同算法，以获得特定的泛化界限。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Information and Inference-A Journal of the Ima Multiple-

CiteScore

3.90

自引率

0.00%

发文量