{"title":"径向基函数插值下Epsilon-Greedy全局优化的收敛速度","authors":"Jialin Li, I. Ryzhov","doi":"10.1287/stsy.2022.0096","DOIUrl":null,"url":null,"abstract":"We study a global optimization problem where the objective function can be observed exactly at individual design points with no derivative information. We suppose that the design points are determined sequentially using an epsilon-greedy algorithm, that is, by sampling uniformly on the design space with a certain probability and otherwise sampling in a local neighborhood of the current estimate of the best solution. We study the rate at which the estimate converges to the global optimum and derive two types of bounds: an asymptotic pathwise rate and a concentration inequality measuring the likelihood that the asymptotic rate has not yet gone into effect. The order of the rate becomes faster when the width of the local search neighborhood is made to shrink over time at a suitably chosen speed.","PeriodicalId":36337,"journal":{"name":"Stochastic Systems","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Convergence Rates of Epsilon-Greedy Global Optimization Under Radial Basis Function Interpolation\",\"authors\":\"Jialin Li, I. Ryzhov\",\"doi\":\"10.1287/stsy.2022.0096\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We study a global optimization problem where the objective function can be observed exactly at individual design points with no derivative information. We suppose that the design points are determined sequentially using an epsilon-greedy algorithm, that is, by sampling uniformly on the design space with a certain probability and otherwise sampling in a local neighborhood of the current estimate of the best solution. We study the rate at which the estimate converges to the global optimum and derive two types of bounds: an asymptotic pathwise rate and a concentration inequality measuring the likelihood that the asymptotic rate has not yet gone into effect. The order of the rate becomes faster when the width of the local search neighborhood is made to shrink over time at a suitably chosen speed.\",\"PeriodicalId\":36337,\"journal\":{\"name\":\"Stochastic Systems\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Stochastic Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1287/stsy.2022.0096\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Mathematics\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stochastic Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1287/stsy.2022.0096","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Mathematics","Score":null,"Total":0}
Convergence Rates of Epsilon-Greedy Global Optimization Under Radial Basis Function Interpolation
We study a global optimization problem where the objective function can be observed exactly at individual design points with no derivative information. We suppose that the design points are determined sequentially using an epsilon-greedy algorithm, that is, by sampling uniformly on the design space with a certain probability and otherwise sampling in a local neighborhood of the current estimate of the best solution. We study the rate at which the estimate converges to the global optimum and derive two types of bounds: an asymptotic pathwise rate and a concentration inequality measuring the likelihood that the asymptotic rate has not yet gone into effect. The order of the rate becomes faster when the width of the local search neighborhood is made to shrink over time at a suitably chosen speed.