Improving Bayesian optimization via hierarchical variation modeling for combinatorial experiments given limited runs guided by process knowledge

IF 7.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge-Based Systems Pub Date : 2024-10-05 DOI:10.1016/j.knosys.2024.112596

An-Tsun Wei , Shu Liu , Steven Lenhert , Hui Wang

{"title":"Improving Bayesian optimization via hierarchical variation modeling for combinatorial experiments given limited runs guided by process knowledge","authors":"An-Tsun Wei , Shu Liu , Steven Lenhert , Hui Wang","doi":"10.1016/j.knosys.2024.112596","DOIUrl":null,"url":null,"abstract":"<div><div>Active learning based on Bayesian optimization (BO) is a popular black-box combinatorial search method, particularly effective for autonomous experimentation. However, existing BO methods did not consider the joint variation caused by the process degradation over time and input-dependent variation. The challenge is more significant when the affordable experimental runs are very limited. State-of-the-art approaches did not address allocating limited experimental runs that can jointly cover (1) representative inputs over large search space for identifying the best combination, (2) replicates reflecting the true input-dependent testing variation, and (3) process variations that increase over time due to process degradation. This paper proposed Empirical Bayesian Hierarchical Variation Modeling in Bayesian Optimization (EHVBO) guided by the process knowledge to maximize the exploration of potential combinations in sequential experiments given limited experimental runs. The method first mitigates the process degradation effect through generalized linear modeling of grouped variations, guided by the knowledge of the re-calibration cycle of process conditions. Then, EHVBO introduces an empirical Bayesian hierarchical model to reduce the replicates for learning the input-dependent variation, leveraging the process knowledge of the common structure shared across different testing combinations. This way can reduce the necessary replicates for each input condition. Furthermore, the paper developed a heuristics-based strategy incorporated in EHVBO to improve search efficiency by selectively refining the search space over pivotal regions and excluding less-promising regions. A case study based on real experimental data demonstrates that the proposed method outperforms testing results from various optimization models.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":null,"pages":null},"PeriodicalIF":7.2000,"publicationDate":"2024-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705124012309","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Active learning based on Bayesian optimization (BO) is a popular black-box combinatorial search method, particularly effective for autonomous experimentation. However, existing BO methods did not consider the joint variation caused by the process degradation over time and input-dependent variation. The challenge is more significant when the affordable experimental runs are very limited. State-of-the-art approaches did not address allocating limited experimental runs that can jointly cover (1) representative inputs over large search space for identifying the best combination, (2) replicates reflecting the true input-dependent testing variation, and (3) process variations that increase over time due to process degradation. This paper proposed Empirical Bayesian Hierarchical Variation Modeling in Bayesian Optimization (EHVBO) guided by the process knowledge to maximize the exploration of potential combinations in sequential experiments given limited experimental runs. The method first mitigates the process degradation effect through generalized linear modeling of grouped variations, guided by the knowledge of the re-calibration cycle of process conditions. Then, EHVBO introduces an empirical Bayesian hierarchical model to reduce the replicates for learning the input-dependent variation, leveraging the process knowledge of the common structure shared across different testing combinations. This way can reduce the necessary replicates for each input condition. Furthermore, the paper developed a heuristics-based strategy incorporated in EHVBO to improve search efficiency by selectively refining the search space over pivotal regions and excluding less-promising regions. A case study based on real experimental data demonstrates that the proposed method outperforms testing results from various optimization models.

查看原文本刊更多论文

通过分层变异建模改进贝叶斯优化，用于在过程知识指导下进行有限运行的组合实验

基于贝叶斯优化（BO）的主动学习是一种流行的黑盒组合搜索方法，对于自主实验尤为有效。然而，现有的贝叶斯优化方法并没有考虑过程随时间退化和输入相关性变化所引起的联合变化。当可负担的实验次数非常有限时，这一挑战就更为严峻。最先进的方法没有考虑到分配有限的实验运行，以共同涵盖：（1）在大搜索空间中具有代表性的输入，以确定最佳组合；（2）反映真实的与输入相关的测试变化的重复；以及（3）由于工艺退化而随时间增加的工艺变化。本文提出了贝叶斯优化中的经验贝叶斯分层变异建模（EHVBO），以工艺知识为指导，在有限的实验运行中最大限度地探索连续实验中的潜在组合。该方法首先在工艺条件重新校准周期知识的指导下，通过分组变化的广义线性建模来减轻工艺退化效应。然后，EHVBO 引入经验贝叶斯分层模型，利用不同测试组合共享的共同结构的工艺知识，减少学习输入相关变异的重复次数。这种方法可以减少每个输入条件所需的重复次数。此外，论文还开发了一种基于启发式的策略，将其纳入 EHVBO，通过有选择地细化关键区域的搜索空间并排除前景较差的区域来提高搜索效率。基于真实实验数据的案例研究表明，所提出的方法优于各种优化模型的测试结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Knowledge-Based Systems 工程技术-计算机：人工智能

CiteScore

14.80

自引率

12.50%

发文量

1245

审稿时长

7.8 months

期刊介绍： Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.