Gradient-Based Framework for Bilevel Optimization of Black-Box Functions: Synergizing Model-Free Reinforcement Learning and Implicit Function Differentiation

IF 3.9 3区工程技术 Q2 ENGINEERING, CHEMICAL

Industrial & Engineering Chemistry Research Pub Date : 2025-01-23 DOI:10.1021/acs.iecr.4c0358410.1021/acs.iecr.4c03584

Thomas Banker, and , Ali Mesbah*,

{"title":"Gradient-Based Framework for Bilevel Optimization of Black-Box Functions: Synergizing Model-Free Reinforcement Learning and Implicit Function Differentiation","authors":"Thomas Banker,  and , Ali Mesbah*, ","doi":"10.1021/acs.iecr.4c0358410.1021/acs.iecr.4c03584","DOIUrl":null,"url":null,"abstract":"<p >Bilevel optimization problems are challenging to solve due to the complex interplay between upper-level and lower-level decision variables. Classical solution methods generally simplify the bilevel problem to a single level problem, whereas more recent methods such as evolutionary algorithms and Bayesian optimization take a black-box view that can suffer from scalability to larger problems. While advantageous for handling high-dimensional and nonconvex optimization problems, the application of gradient-based solution methods to bilevel problems is impeded by the implicit relationship between the upper-level and lower-level decision variables. Additionally, lack of an equation-oriented relationship between decision variables and the upper-level objective can further impede differentiability. To this end, we present a gradient-based optimization framework that leverages implicit function theorem and model-free reinforcement learning (RL) to solve bilevel optimization problems wherein only zeroth-order observations of the upper-level objective are available. Implicit differentiation allows for differentiating the optimality conditions of the lower-level problem to enable calculation of gradients of the upper-level objective. Using policy gradient RL, gradient-based updates of the upper-level decisions can then be performed in a scalable manner for high-dimension problems. The proposed framework is applied to the bilevel problem of learning optimization-based control policies for uncertain systems. Simulation results on two benchmark problems illustrate the effectiveness of the framework for goal-oriented learning of model predictive control policies. Synergizing derivative-free optimization via model-free RL and gradient calculation via implicit function differentiation can create new avenues for scalable and efficient solution of bilevel problems with black-box upper-level objective as compared to black-box optimization methods that discard the problem structure.</p>","PeriodicalId":39,"journal":{"name":"Industrial & Engineering Chemistry Research","volume":"64 5","pages":"2831–2844 2831–2844"},"PeriodicalIF":3.9000,"publicationDate":"2025-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/epdf/10.1021/acs.iecr.4c03584","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Industrial & Engineering Chemistry Research","FirstCategoryId":"5","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.iecr.4c03584","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}

引用次数: 0

Abstract

Bilevel optimization problems are challenging to solve due to the complex interplay between upper-level and lower-level decision variables. Classical solution methods generally simplify the bilevel problem to a single level problem, whereas more recent methods such as evolutionary algorithms and Bayesian optimization take a black-box view that can suffer from scalability to larger problems. While advantageous for handling high-dimensional and nonconvex optimization problems, the application of gradient-based solution methods to bilevel problems is impeded by the implicit relationship between the upper-level and lower-level decision variables. Additionally, lack of an equation-oriented relationship between decision variables and the upper-level objective can further impede differentiability. To this end, we present a gradient-based optimization framework that leverages implicit function theorem and model-free reinforcement learning (RL) to solve bilevel optimization problems wherein only zeroth-order observations of the upper-level objective are available. Implicit differentiation allows for differentiating the optimality conditions of the lower-level problem to enable calculation of gradients of the upper-level objective. Using policy gradient RL, gradient-based updates of the upper-level decisions can then be performed in a scalable manner for high-dimension problems. The proposed framework is applied to the bilevel problem of learning optimization-based control policies for uncertain systems. Simulation results on two benchmark problems illustrate the effectiveness of the framework for goal-oriented learning of model predictive control policies. Synergizing derivative-free optimization via model-free RL and gradient calculation via implicit function differentiation can create new avenues for scalable and efficient solution of bilevel problems with black-box upper-level objective as compared to black-box optimization methods that discard the problem structure.

查看原文本刊更多论文

基于梯度的黑盒函数双层优化框架：无模型强化学习和隐函数微分的协同

由于上下决策变量之间复杂的相互作用，双层优化问题的求解具有一定的挑战性。经典的解决方法通常将双层问题简化为单层问题，而最近的方法，如进化算法和贝叶斯优化，则采用黑盒视图，这可能会影响到更大问题的可扩展性。基于梯度的求解方法虽然有利于处理高维非凸优化问题，但由于上下决策变量之间的隐式关系，阻碍了其在两层问题中的应用。此外，决策变量与上层目标之间缺乏面向方程的关系会进一步阻碍可微性。为此，我们提出了一个基于梯度的优化框架，该框架利用隐函数定理和无模型强化学习（RL）来解决只有上层目标的零阶观测值可用的双层优化问题。隐式微分允许对低级问题的最优性条件进行微分，以便计算高级目标的梯度。使用策略梯度强化学习，可以以可扩展的方式对高维问题执行基于梯度的上层决策更新。将所提出的框架应用于不确定系统的基于学习优化控制策略的双层问题。两个基准问题的仿真结果表明了该框架用于模型预测控制策略的目标学习的有效性。与抛弃问题结构的黑箱优化方法相比，通过无模型RL进行无导数优化和通过隐函数微分进行梯度计算的协同可以为具有黑盒上层目标的双层问题的可扩展和有效解决提供新的途径。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Industrial & Engineering Chemistry Research 工程技术-工程：化工

CiteScore

7.40

自引率

7.10%

发文量

1467

审稿时长

2.8 months

期刊介绍： ndustrial & Engineering Chemistry, with variations in title and format, has been published since 1909 by the American Chemical Society. Industrial & Engineering Chemistry Research is a weekly publication that reports industrial and academic research in the broad fields of applied chemistry and chemical engineering with special focus on fundamentals, processes, and products.