{"title":"Regularized Nesterov’s accelerated damped BFGS method for stochastic optimization","authors":"Siwakon Suppalap , Dawrawee Makmuang , Vipavee Damminsed , Rabian Wangkeeree","doi":"10.1016/j.cam.2025.116616","DOIUrl":null,"url":null,"abstract":"<div><div>A regularization term is introduced into the approximate Hessian update in the stochastic Broyden–Fletcher–Goldfarb–Shanno (BFGS) method for convex stochastic optimization problems to help avoid near-singularity issues. Additionally, Nesterov acceleration, with a momentum coefficient that dynamically adjusts between a constant value and zero based on the objective function, has been incorporated to enhance convergence speed. However, the inflexibility of the constant momentum coefficient still may lead to overshooting problems, and evaluating objective functions on large datasets is computationally costly. Moreover, this approach presents challenges in solving nonconvex optimization problems. To address these challenges, we propose a regularized stochastic BFGS method that integrates Nesterov acceleration with an adaptive momentum coefficient designed for solving nonconvex stochastic optimization problems. This coefficient adjusts flexibly between a decreasing value and zero based on selected dataset samples, helping to avoid overshooting problems and reduce computational costs. We demonstrated almost sure convergence to stationary points and analyze the complexity. Numerical results on convex and nonconvex classification problems using a support vector machine show that our method outperforms existing approaches.</div></div>","PeriodicalId":50226,"journal":{"name":"Journal of Computational and Applied Mathematics","volume":"467 ","pages":"Article 116616"},"PeriodicalIF":2.1000,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational and Applied Mathematics","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0377042725001311","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
引用次数: 0
Abstract
A regularization term is introduced into the approximate Hessian update in the stochastic Broyden–Fletcher–Goldfarb–Shanno (BFGS) method for convex stochastic optimization problems to help avoid near-singularity issues. Additionally, Nesterov acceleration, with a momentum coefficient that dynamically adjusts between a constant value and zero based on the objective function, has been incorporated to enhance convergence speed. However, the inflexibility of the constant momentum coefficient still may lead to overshooting problems, and evaluating objective functions on large datasets is computationally costly. Moreover, this approach presents challenges in solving nonconvex optimization problems. To address these challenges, we propose a regularized stochastic BFGS method that integrates Nesterov acceleration with an adaptive momentum coefficient designed for solving nonconvex stochastic optimization problems. This coefficient adjusts flexibly between a decreasing value and zero based on selected dataset samples, helping to avoid overshooting problems and reduce computational costs. We demonstrated almost sure convergence to stationary points and analyze the complexity. Numerical results on convex and nonconvex classification problems using a support vector machine show that our method outperforms existing approaches.
期刊介绍:
The Journal of Computational and Applied Mathematics publishes original papers of high scientific value in all areas of computational and applied mathematics. The main interest of the Journal is in papers that describe and analyze new computational techniques for solving scientific or engineering problems. Also the improved analysis, including the effectiveness and applicability, of existing methods and algorithms is of importance. The computational efficiency (e.g. the convergence, stability, accuracy, ...) should be proved and illustrated by nontrivial numerical examples. Papers describing only variants of existing methods, without adding significant new computational properties are not of interest.
The audience consists of: applied mathematicians, numerical analysts, computational scientists and engineers.