局部线性多元校正的局部自适应融合回归(LAFR):在大数据集上的应用。

IF 2.2 3区 化学 Q2 INSTRUMENTS & INSTRUMENTATION
Applied Spectroscopy Pub Date : 2025-05-01 Epub Date: 2025-01-23 DOI:10.1177/00037028241308538
Robert Spiers, John H Kalivas
{"title":"局部线性多元校正的局部自适应融合回归(LAFR):在大数据集上的应用。","authors":"Robert Spiers, John H Kalivas","doi":"10.1177/00037028241308538","DOIUrl":null,"url":null,"abstract":"<p><p>Impeding linear calibration models from accurately predicting target sample analyte amounts are the target sample-wise deviations in measurement profiles (e.g., spectra) relative to calibration samples. Target sample measurement shifts are due to uncontrollable factors, compositely termed matrix effects, such as temperature, instrument drift, and sample composition divergences relative to analyte and other species amounts altering inter and intramolecular interactions. One approach to circumvent the matrix effect matching problem is to use local modeling where a library with thousands of samples and respective reference analyte values is mined for unique calibration sets matched to each target sample, including analyte amounts between calibration and target samples. Current local modeling methods suffer because it is wrongly assumed similar measurements between calibration and target samples translate to a complete locally matched calibration set. Measurements can be similar, but the underlying matrix effects (and analyte amount) can be drastically different. The presented procedure named local adaptive fusion regression (LAFR) solves this matrix effect matching problem with crucial local modeling paradigm shifts. Expertise with LAFR is unnecessary because input hyperparameters are self-optimized. The capabilities of LAFR to form highly dense localized linear calibration sets matched to target samples spectrally and analyte amounts are verified using a well-studied nonlinear benchmark near-infrared (NIR) meat dataset, a NIR sugarcane dataset covering four major process steps with multiple subgroups within, and a NIR soil database of 98 910 samples spanning the contiguous USA. While LAFR is tested on NIR datasets, it is applicable to other measurement systems affected by matrix effects in a broad sense.</p>","PeriodicalId":8253,"journal":{"name":"Applied Spectroscopy","volume":" ","pages":"797-807"},"PeriodicalIF":2.2000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Local Adaptive Fusion Regression (LAFR) for Local Linear Multivariate Calibration: Application to Large Datasets.\",\"authors\":\"Robert Spiers, John H Kalivas\",\"doi\":\"10.1177/00037028241308538\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Impeding linear calibration models from accurately predicting target sample analyte amounts are the target sample-wise deviations in measurement profiles (e.g., spectra) relative to calibration samples. Target sample measurement shifts are due to uncontrollable factors, compositely termed matrix effects, such as temperature, instrument drift, and sample composition divergences relative to analyte and other species amounts altering inter and intramolecular interactions. One approach to circumvent the matrix effect matching problem is to use local modeling where a library with thousands of samples and respective reference analyte values is mined for unique calibration sets matched to each target sample, including analyte amounts between calibration and target samples. Current local modeling methods suffer because it is wrongly assumed similar measurements between calibration and target samples translate to a complete locally matched calibration set. Measurements can be similar, but the underlying matrix effects (and analyte amount) can be drastically different. The presented procedure named local adaptive fusion regression (LAFR) solves this matrix effect matching problem with crucial local modeling paradigm shifts. Expertise with LAFR is unnecessary because input hyperparameters are self-optimized. The capabilities of LAFR to form highly dense localized linear calibration sets matched to target samples spectrally and analyte amounts are verified using a well-studied nonlinear benchmark near-infrared (NIR) meat dataset, a NIR sugarcane dataset covering four major process steps with multiple subgroups within, and a NIR soil database of 98 910 samples spanning the contiguous USA. While LAFR is tested on NIR datasets, it is applicable to other measurement systems affected by matrix effects in a broad sense.</p>\",\"PeriodicalId\":8253,\"journal\":{\"name\":\"Applied Spectroscopy\",\"volume\":\" \",\"pages\":\"797-807\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Spectroscopy\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1177/00037028241308538\",\"RegionNum\":3,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/23 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"INSTRUMENTS & INSTRUMENTATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Spectroscopy","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1177/00037028241308538","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/23 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"INSTRUMENTS & INSTRUMENTATION","Score":null,"Total":0}
引用次数: 0

摘要

阻碍线性校准模型准确预测目标样品分析物量的是相对于校准样品的测量剖面(例如,光谱)中目标样品的偏差。目标样品测量位移是由于不可控因素造成的,这些因素综合称为基质效应,如温度、仪器漂移、相对于分析物和其他物种数量的样品组成差异,改变了分子间和分子内的相互作用。规避矩阵效应匹配问题的一种方法是使用局部建模,其中挖掘包含数千个样本和各自参考分析物值的库,以获得与每个目标样本匹配的唯一校准集,包括校准样本和目标样本之间的分析物量。当前的局部建模方法受到影响,因为它错误地假设校准和目标样本之间的相似测量值转化为完整的局部匹配校准集。测量结果可能是相似的,但潜在的基质效应(和分析物的数量)可能完全不同。提出的局部自适应融合回归(LAFR)方法解决了具有关键局部建模范式转换的矩阵效应匹配问题。LAFR的专业知识是不必要的,因为输入超参数是自优化的。LAFR形成与目标样品光谱和分析物数量相匹配的高密度局部线性校准集的能力通过一个经过充分研究的非线性基准近红外(NIR)肉类数据集、一个覆盖四个主要过程步骤的近红外甘蔗数据集以及一个覆盖美国连续地区98910个样品的近红外土壤数据库进行了验证。虽然LAFR是在近红外数据集上测试的,但它在广义上适用于受矩阵效应影响的其他测量系统。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Local Adaptive Fusion Regression (LAFR) for Local Linear Multivariate Calibration: Application to Large Datasets.

Impeding linear calibration models from accurately predicting target sample analyte amounts are the target sample-wise deviations in measurement profiles (e.g., spectra) relative to calibration samples. Target sample measurement shifts are due to uncontrollable factors, compositely termed matrix effects, such as temperature, instrument drift, and sample composition divergences relative to analyte and other species amounts altering inter and intramolecular interactions. One approach to circumvent the matrix effect matching problem is to use local modeling where a library with thousands of samples and respective reference analyte values is mined for unique calibration sets matched to each target sample, including analyte amounts between calibration and target samples. Current local modeling methods suffer because it is wrongly assumed similar measurements between calibration and target samples translate to a complete locally matched calibration set. Measurements can be similar, but the underlying matrix effects (and analyte amount) can be drastically different. The presented procedure named local adaptive fusion regression (LAFR) solves this matrix effect matching problem with crucial local modeling paradigm shifts. Expertise with LAFR is unnecessary because input hyperparameters are self-optimized. The capabilities of LAFR to form highly dense localized linear calibration sets matched to target samples spectrally and analyte amounts are verified using a well-studied nonlinear benchmark near-infrared (NIR) meat dataset, a NIR sugarcane dataset covering four major process steps with multiple subgroups within, and a NIR soil database of 98 910 samples spanning the contiguous USA. While LAFR is tested on NIR datasets, it is applicable to other measurement systems affected by matrix effects in a broad sense.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Applied Spectroscopy
Applied Spectroscopy 工程技术-光谱学
CiteScore
6.60
自引率
5.70%
发文量
139
审稿时长
3.5 months
期刊介绍: Applied Spectroscopy is one of the world''s leading spectroscopy journals, publishing high-quality peer-reviewed articles, both fundamental and applied, covering all aspects of spectroscopy. Established in 1951, the journal is owned by the Society for Applied Spectroscopy and is published monthly. The journal is dedicated to fulfilling the mission of the Society to “…advance and disseminate knowledge and information concerning the art and science of spectroscopy and other allied sciences.”
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信