{"title":"Improving Bayesian optimization via hierarchical variation modeling for combinatorial experiments given limited runs guided by process knowledge","authors":"An-Tsun Wei , Shu Liu , Steven Lenhert , Hui Wang","doi":"10.1016/j.knosys.2024.112596","DOIUrl":null,"url":null,"abstract":"<div><div>Active learning based on Bayesian optimization (BO) is a popular black-box combinatorial search method, particularly effective for autonomous experimentation. However, existing BO methods did not consider the joint variation caused by the process degradation over time and input-dependent variation. The challenge is more significant when the affordable experimental runs are very limited. State-of-the-art approaches did not address allocating limited experimental runs that can jointly cover (1) representative inputs over large search space for identifying the best combination, (2) replicates reflecting the true input-dependent testing variation, and (3) process variations that increase over time due to process degradation. This paper proposed Empirical Bayesian Hierarchical Variation Modeling in Bayesian Optimization (EHVBO) guided by the process knowledge to maximize the exploration of potential combinations in sequential experiments given limited experimental runs. The method first mitigates the process degradation effect through generalized linear modeling of grouped variations, guided by the knowledge of the re-calibration cycle of process conditions. Then, EHVBO introduces an empirical Bayesian hierarchical model to reduce the replicates for learning the input-dependent variation, leveraging the process knowledge of the common structure shared across different testing combinations. This way can reduce the necessary replicates for each input condition. Furthermore, the paper developed a heuristics-based strategy incorporated in EHVBO to improve search efficiency by selectively refining the search space over pivotal regions and excluding less-promising regions. A case study based on real experimental data demonstrates that the proposed method outperforms testing results from various optimization models.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":null,"pages":null},"PeriodicalIF":7.2000,"publicationDate":"2024-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705124012309","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Active learning based on Bayesian optimization (BO) is a popular black-box combinatorial search method, particularly effective for autonomous experimentation. However, existing BO methods did not consider the joint variation caused by the process degradation over time and input-dependent variation. The challenge is more significant when the affordable experimental runs are very limited. State-of-the-art approaches did not address allocating limited experimental runs that can jointly cover (1) representative inputs over large search space for identifying the best combination, (2) replicates reflecting the true input-dependent testing variation, and (3) process variations that increase over time due to process degradation. This paper proposed Empirical Bayesian Hierarchical Variation Modeling in Bayesian Optimization (EHVBO) guided by the process knowledge to maximize the exploration of potential combinations in sequential experiments given limited experimental runs. The method first mitigates the process degradation effect through generalized linear modeling of grouped variations, guided by the knowledge of the re-calibration cycle of process conditions. Then, EHVBO introduces an empirical Bayesian hierarchical model to reduce the replicates for learning the input-dependent variation, leveraging the process knowledge of the common structure shared across different testing combinations. This way can reduce the necessary replicates for each input condition. Furthermore, the paper developed a heuristics-based strategy incorporated in EHVBO to improve search efficiency by selectively refining the search space over pivotal regions and excluding less-promising regions. A case study based on real experimental data demonstrates that the proposed method outperforms testing results from various optimization models.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.