径向基函数插值下Epsilon-Greedy全局优化的收敛速度

Q1 Mathematics

Stochastic Systems Pub Date : 2022-08-02 DOI:10.1287/stsy.2022.0096

Jialin Li, I. Ryzhov

{"title":"径向基函数插值下Epsilon-Greedy全局优化的收敛速度","authors":"Jialin Li, I. Ryzhov","doi":"10.1287/stsy.2022.0096","DOIUrl":null,"url":null,"abstract":"We study a global optimization problem where the objective function can be observed exactly at individual design points with no derivative information. We suppose that the design points are determined sequentially using an epsilon-greedy algorithm, that is, by sampling uniformly on the design space with a certain probability and otherwise sampling in a local neighborhood of the current estimate of the best solution. We study the rate at which the estimate converges to the global optimum and derive two types of bounds: an asymptotic pathwise rate and a concentration inequality measuring the likelihood that the asymptotic rate has not yet gone into effect. The order of the rate becomes faster when the width of the local search neighborhood is made to shrink over time at a suitably chosen speed.","PeriodicalId":36337,"journal":{"name":"Stochastic Systems","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Convergence Rates of Epsilon-Greedy Global Optimization Under Radial Basis Function Interpolation\",\"authors\":\"Jialin Li, I. Ryzhov\",\"doi\":\"10.1287/stsy.2022.0096\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We study a global optimization problem where the objective function can be observed exactly at individual design points with no derivative information. We suppose that the design points are determined sequentially using an epsilon-greedy algorithm, that is, by sampling uniformly on the design space with a certain probability and otherwise sampling in a local neighborhood of the current estimate of the best solution. We study the rate at which the estimate converges to the global optimum and derive two types of bounds: an asymptotic pathwise rate and a concentration inequality measuring the likelihood that the asymptotic rate has not yet gone into effect. The order of the rate becomes faster when the width of the local search neighborhood is made to shrink over time at a suitably chosen speed.\",\"PeriodicalId\":36337,\"journal\":{\"name\":\"Stochastic Systems\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Stochastic Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1287/stsy.2022.0096\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Mathematics\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stochastic Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1287/stsy.2022.0096","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Mathematics","Score":null,"Total":0}

引用次数: 4

摘要

我们研究了一个全局优化问题，在没有导数信息的情况下，目标函数可以在单个设计点精确地观察到。我们假设设计点是使用epsilon-greedy算法顺序确定的，即在设计空间上以一定的概率均匀采样，否则在当前估计的最优解的局部邻域内采样。我们研究了估计收敛到全局最优的速率，并导出了两种类型的界:渐近路径速率和浓度不等式，测量渐近速率尚未生效的可能性。当以适当选择的速度使局部搜索邻域的宽度随时间缩小时，速率的顺序变得更快。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Convergence Rates of Epsilon-Greedy Global Optimization Under Radial Basis Function Interpolation

We study a global optimization problem where the objective function can be observed exactly at individual design points with no derivative information. We suppose that the design points are determined sequentially using an epsilon-greedy algorithm, that is, by sampling uniformly on the design space with a certain probability and otherwise sampling in a local neighborhood of the current estimate of the best solution. We study the rate at which the estimate converges to the global optimum and derive two types of bounds: an asymptotic pathwise rate and a concentration inequality measuring the likelihood that the asymptotic rate has not yet gone into effect. The order of the rate becomes faster when the width of the local search neighborhood is made to shrink over time at a suitably chosen speed.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Stochastic Systems Decision Sciences-Statistics, Probability and Uncertainty

CiteScore

3.70

自引率

0.00%

发文量