An Inexact Augmented Lagrangian Algorithm for Training Leaky ReLU Neural Network with Group Sparsity

J. Mach. Learn. Res. Pub Date : 2022-05-11 DOI:10.48550/arXiv.2205.05428

Wei Liu, Xin Liu, Xiaojun Chen

引用次数: 2

Abstract

The leaky ReLU network with a group sparse regularization term has been widely used in the recent years. However, training such a network yields a nonsmooth nonconvex optimization problem and there exists a lack of approaches to compute a stationary point deterministically. In this paper, we first resolve the multi-layer composite term in the original optimization problem by introducing auxiliary variables and additional constraints. We show the new model has a nonempty and bounded solution set and its feasible set satisfies the Mangasarian-Fromovitz constraint qualification. Moreover, we show the relationship between the new model and the original problem. Remarkably, we propose an inexact augmented Lagrangian algorithm for solving the new model and show the convergence of the algorithm to a KKT point. Numerical experiments demonstrate that our algorithm is more efficient for training sparse leaky ReLU neural networks than some well-known algorithms.

查看原文本刊更多论文

群稀疏型漏型ReLU神经网络的非精确增广拉格朗日训练算法

带群稀疏正则化项的泄漏ReLU网络近年来得到了广泛的应用。然而，训练这样的网络会产生一个非光滑的非凸优化问题，并且缺乏确定性地计算驻点的方法。本文首先通过引入辅助变量和附加约束来解决原优化问题中的多层复合项。证明了该模型具有非空有界解集，其可行集满足Mangasarian-Fromovitz约束条件。此外，我们还展示了新模型与原问题之间的关系。值得注意的是，我们提出了一种非精确增广拉格朗日算法来求解新模型，并证明了该算法收敛到一个KKT点。数值实验表明，该算法比一些已知算法更有效地训练稀疏泄漏ReLU神经网络。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

J. Mach. Learn. Res.

自引率

0.00%

发文量