Bayesian optimization for interval selection in PLS models

IF 3.8 2区 化学 Q2 AUTOMATION & CONTROL SYSTEMS
Nicolás Hernández , Yoonsun Choi , Tom Fearn
{"title":"Bayesian optimization for interval selection in PLS models","authors":"Nicolás Hernández ,&nbsp;Yoonsun Choi ,&nbsp;Tom Fearn","doi":"10.1016/j.chemolab.2025.105541","DOIUrl":null,"url":null,"abstract":"<div><div>We propose a novel Bayesian optimization framework for interval selection in Partial Least Squares (PLS) regression. Unlike traditional iPLS variants that rely on fixed or grid-based intervals, our approach adaptively searches over the discrete space of interval positions of a pre-defined width using a Gaussian Process surrogate model and an acquisition function. This enables the selection of one or more informative spectral regions without exhaustive enumeration or manual tuning. Through synthetic and real-world spectroscopic datasets, we demonstrate that the proposed method consistently identifies chemically relevant intervals, reduces model complexity, and improves predictive accuracy compared to full-spectrum PLS and stepwise interval selection techniques. A Monte Carlo study further confirms the robustness and convergence of the algorithm across varying signal complexities and uncertainty levels. This flexible, data-efficient approach offers an interpretable and computationally scalable alternative for chemometric applications.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"267 ","pages":"Article 105541"},"PeriodicalIF":3.8000,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemometrics and Intelligent Laboratory Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169743925002266","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

We propose a novel Bayesian optimization framework for interval selection in Partial Least Squares (PLS) regression. Unlike traditional iPLS variants that rely on fixed or grid-based intervals, our approach adaptively searches over the discrete space of interval positions of a pre-defined width using a Gaussian Process surrogate model and an acquisition function. This enables the selection of one or more informative spectral regions without exhaustive enumeration or manual tuning. Through synthetic and real-world spectroscopic datasets, we demonstrate that the proposed method consistently identifies chemically relevant intervals, reduces model complexity, and improves predictive accuracy compared to full-spectrum PLS and stepwise interval selection techniques. A Monte Carlo study further confirms the robustness and convergence of the algorithm across varying signal complexities and uncertainty levels. This flexible, data-efficient approach offers an interpretable and computationally scalable alternative for chemometric applications.
PLS模型中区间选择的贝叶斯优化
我们提出了一种新的贝叶斯优化框架,用于偏最小二乘(PLS)回归的区间选择。与依赖于固定或基于网格的间隔的传统iPLS变体不同,我们的方法使用高斯过程代理模型和获取函数自适应地搜索预定义宽度的间隔位置的离散空间。这使得一个或多个信息光谱区域的选择没有详尽的枚举或手动调谐。通过合成和真实光谱数据集,我们证明了与全光谱PLS和逐步区间选择技术相比,所提出的方法一致地识别化学相关区间,降低了模型复杂性,提高了预测精度。蒙特卡罗研究进一步证实了该算法在不同信号复杂性和不确定性水平下的鲁棒性和收敛性。这种灵活、数据高效的方法为化学计量学应用提供了一种可解释和计算可扩展的替代方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.50
自引率
7.70%
发文量
169
审稿时长
3.4 months
期刊介绍: Chemometrics and Intelligent Laboratory Systems publishes original research papers, short communications, reviews, tutorials and Original Software Publications reporting on development of novel statistical, mathematical, or computer techniques in Chemistry and related disciplines. Chemometrics is the chemical discipline that uses mathematical and statistical methods to design or select optimal procedures and experiments, and to provide maximum chemical information by analysing chemical data. The journal deals with the following topics: 1) Development of new statistical, mathematical and chemometrical methods for Chemistry and related fields (Environmental Chemistry, Biochemistry, Toxicology, System Biology, -Omics, etc.) 2) Novel applications of chemometrics to all branches of Chemistry and related fields (typical domains of interest are: process data analysis, experimental design, data mining, signal processing, supervised modelling, decision making, robust statistics, mixture analysis, multivariate calibration etc.) Routine applications of established chemometrical techniques will not be considered. 3) Development of new software that provides novel tools or truly advances the use of chemometrical methods. 4) Well characterized data sets to test performance for the new methods and software. The journal complies with International Committee of Medical Journal Editors'' Uniform requirements for manuscripts.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信