{"title":"Model-free Optimization: The Exploration-Exploitation Paradigm","authors":"Mariya Raphel, Revati Gunjal, S. Wagh, N. Singh","doi":"10.1109/ICC56513.2022.10093277","DOIUrl":null,"url":null,"abstract":"The trade-off between exploration and exploitation has been crucial in the field of optimization where the cost function is expensive and takes time to converge. The use of Exploration-Exploitation concept in the acquisition function helps to find the optimal values of the objective function completely model-free. Steady-state input-output map of a dynamical system gives an input-output relation that can be utilised to replace the optimizer of the objective function with the control signal. Hence, solving an expensive optimization problem of a control application gradient-free and without injecting perturbation signal. Initial sample size, sampling technique, and the type of acquisition function influences the rate of convergence of the objective function to its optimum. As the sample size increases the function value converges to its optimum faster with less computation time. The use of Expected Improvement as an acquisition function converges the function value closer to its optimum value and gives better-approximated results as compared to other acquisition functions. Model-free optimization using exploration and exploitation can be used widely in data-driven based control application to compute black-box functions solely based on input and output measurements making it computationally less burden.","PeriodicalId":101654,"journal":{"name":"2022 Eighth Indian Control Conference (ICC)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Eighth Indian Control Conference (ICC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICC56513.2022.10093277","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The trade-off between exploration and exploitation has been crucial in the field of optimization where the cost function is expensive and takes time to converge. The use of Exploration-Exploitation concept in the acquisition function helps to find the optimal values of the objective function completely model-free. Steady-state input-output map of a dynamical system gives an input-output relation that can be utilised to replace the optimizer of the objective function with the control signal. Hence, solving an expensive optimization problem of a control application gradient-free and without injecting perturbation signal. Initial sample size, sampling technique, and the type of acquisition function influences the rate of convergence of the objective function to its optimum. As the sample size increases the function value converges to its optimum faster with less computation time. The use of Expected Improvement as an acquisition function converges the function value closer to its optimum value and gives better-approximated results as compared to other acquisition functions. Model-free optimization using exploration and exploitation can be used widely in data-driven based control application to compute black-box functions solely based on input and output measurements making it computationally less burden.