{"title":"A Convergence Study of SGD-Type Methods for Stochastic Optimization","authors":"Tiannan Xiao, Guoguo Yang","doi":"10.4208/nmtma.oa-2022-0179","DOIUrl":null,"url":null,"abstract":"In this paper, we first reinvestigate the convergence of vanilla SGD method in the sense of $L^2$ under more general learning rates conditions and a more general convex assumption, which relieves the conditions on learning rates and do not need the problem to be strongly convex. Then, by taking advantage of the Lyapunov function technique, we present the convergence of the momentum SGD and Nesterov accelerated SGD methods for the convex and non-convex problem under $L$-smooth assumption that extends the bounded gradient limitation to a certain extent. The convergence of time averaged SGD was also analyzed.","PeriodicalId":51146,"journal":{"name":"Numerical Mathematics-Theory Methods and Applications","volume":" ","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2022-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Numerical Mathematics-Theory Methods and Applications","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.4208/nmtma.oa-2022-0179","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper, we first reinvestigate the convergence of vanilla SGD method in the sense of $L^2$ under more general learning rates conditions and a more general convex assumption, which relieves the conditions on learning rates and do not need the problem to be strongly convex. Then, by taking advantage of the Lyapunov function technique, we present the convergence of the momentum SGD and Nesterov accelerated SGD methods for the convex and non-convex problem under $L$-smooth assumption that extends the bounded gradient limitation to a certain extent. The convergence of time averaged SGD was also analyzed.
期刊介绍:
Numerical Mathematics: Theory, Methods and Applications (NM-TMA) publishes high-quality original research papers on the construction, analysis and application of numerical methods for solving scientific and engineering problems. Important research and expository papers devoted to the numerical solution of mathematical equations arising in all areas of science and technology are expected. The journal originates from the journal Numerical Mathematics: A Journal of Chinese Universities (English Edition). NM-TMA is a refereed international journal sponsored by Nanjing University and the Ministry of Education of China. As an international journal, NM-TMA is published in a timely fashion in printed and electronic forms.