Optimal methods for convex nested stochastic composite optimization

IF 4.6 Q2 MATERIALS SCIENCE, BIOMATERIALS
Zhe Zhang, Guanghui Lan
{"title":"Optimal methods for convex nested stochastic composite optimization","authors":"Zhe Zhang, Guanghui Lan","doi":"10.1007/s10107-024-02090-3","DOIUrl":null,"url":null,"abstract":"<p>Recently, convex nested stochastic composite optimization (NSCO) has received considerable interest for its applications in reinforcement learning and risk-averse optimization. However, existing NSCO algorithms have worse stochastic oracle complexities, by orders of magnitude, than those for simpler stochastic optimization problems without nested structures. Additionally, these algorithms require all outer-layer functions to be smooth, a condition violated by some important applications. This raises a question regarding whether the nested composition make stochastic optimization more difficult in terms of oracle complexity. In this paper, we answer the question by developing order-optimal algorithms for convex NSCO problems constructed from an arbitrary composition of smooth, structured non-smooth, and general non-smooth layer functions. When all outer-layer functions are smooth, we propose a stochastic sequential dual (SSD) method to achieve an oracle complexity of <span>\\(\\mathcal {O}(1/\\epsilon ^2)\\)</span> (resp., <span>\\(\\mathcal {O}(1/\\epsilon )\\)</span>) when the problem is convex (resp., strongly convex). If any outer-layer function is non-smooth, we propose a non-smooth stochastic sequential dual (nSSD) method to achieve an <span>\\(\\mathcal {O}(1/\\epsilon ^2)\\)</span> oracle complexity. We provide a lower complexity bound to show the latter <span>\\(\\mathcal {O}(1/\\epsilon ^2)\\)</span> complexity to be unimprovable, even under a strongly convex setting. All these complexity results seem to be new in the literature, and they indicate that convex NSCO problems have the same order of oracle complexity as problems without nested composition, except in the strongly convex and outer non-smooth cases.</p>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s10107-024-02090-3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0

Abstract

Recently, convex nested stochastic composite optimization (NSCO) has received considerable interest for its applications in reinforcement learning and risk-averse optimization. However, existing NSCO algorithms have worse stochastic oracle complexities, by orders of magnitude, than those for simpler stochastic optimization problems without nested structures. Additionally, these algorithms require all outer-layer functions to be smooth, a condition violated by some important applications. This raises a question regarding whether the nested composition make stochastic optimization more difficult in terms of oracle complexity. In this paper, we answer the question by developing order-optimal algorithms for convex NSCO problems constructed from an arbitrary composition of smooth, structured non-smooth, and general non-smooth layer functions. When all outer-layer functions are smooth, we propose a stochastic sequential dual (SSD) method to achieve an oracle complexity of \(\mathcal {O}(1/\epsilon ^2)\) (resp., \(\mathcal {O}(1/\epsilon )\)) when the problem is convex (resp., strongly convex). If any outer-layer function is non-smooth, we propose a non-smooth stochastic sequential dual (nSSD) method to achieve an \(\mathcal {O}(1/\epsilon ^2)\) oracle complexity. We provide a lower complexity bound to show the latter \(\mathcal {O}(1/\epsilon ^2)\) complexity to be unimprovable, even under a strongly convex setting. All these complexity results seem to be new in the literature, and they indicate that convex NSCO problems have the same order of oracle complexity as problems without nested composition, except in the strongly convex and outer non-smooth cases.

Abstract Image

凸嵌套随机复合优化的最优方法
最近,凸嵌套随机复合优化(NSCO)因其在强化学习和风险规避优化中的应用而受到广泛关注。然而,现有的 NSCO 算法的随机甲骨文复杂度比那些没有嵌套结构的简单随机优化问题的复杂度要差,差了几个数量级。此外,这些算法要求所有外层函数都是平滑的,而一些重要应用却违反了这一条件。这就提出了一个问题:嵌套结构是否会增加随机优化的甲骨文复杂度?在本文中,我们通过开发由光滑层函数、结构非光滑层函数和一般非光滑层函数任意组成的凸 NSCO 问题的阶优算法来回答这个问题。当所有外层函数都是平滑的时候,我们提出了一种随机顺序对偶(SSD)方法,当问题是凸的(或者说,强凸的)时,它的oracle复杂度为\(\mathcal {O}(1/\epsilon ^2)\)(或者说,\(\mathcal {O}(1/\epsilon )\))。如果任何外层函数是非光滑的,我们提出了一种非光滑随机顺序对偶(nSSD)方法,以实现 \(\mathcal {O}(1/\epsilon ^2)\) oracle 复杂性。我们提供了一个较低的复杂度约束,以证明后者的 \(\mathcal {O}(1/\epsilon ^2)\)复杂度是不可改进的,即使在强凸设置下也是如此。所有这些复杂度结果在文献中似乎都是新的,它们表明,除了强凸和外部非光滑情况外,凸NSCO问题与没有嵌套组合的问题具有相同的oracle复杂度阶数。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
ACS Applied Bio Materials
ACS Applied Bio Materials Chemistry-Chemistry (all)
CiteScore
9.40
自引率
2.10%
发文量
464
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信