增强数据点重要性:多变量校准中变量的分层重要性

IF 5.7 2区 化学 Q1 CHEMISTRY, ANALYTICAL
Somaye Vali Zade , Klaus Neymeyr , Mathias Sawall , Hamid Abdollahi
{"title":"增强数据点重要性:多变量校准中变量的分层重要性","authors":"Somaye Vali Zade ,&nbsp;Klaus Neymeyr ,&nbsp;Mathias Sawall ,&nbsp;Hamid Abdollahi","doi":"10.1016/j.aca.2024.343357","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>The Enhanced Data Point Importance (EDPI) method, a systematic approach for evaluating the importance of data points in multivariate calibration, is introduced. Factor decomposition methods allow for the evaluation of the impact of variables on maintaining the structural pattern of data in the abstract space. Essential data points play a key role in these patterns and the method of Data Point Importance (DPI) aims to evaluate the essential data points in terms of their importance. All other points are rated by zero. In this contribution, DPI is extended to include inner points to evaluate the importance of these points in the absence of the essential points. The EDPI method employs convex peeling to sort data points systematically.</div></div><div><h3>Results</h3><div>EDPI method was applied to near-infrared and Raman spectroscopy data sets, including corn and alcohol mixtures and simulated data, to rank and select important variables. EDPI effectively identified variables that contributed to the preservation of the data structure and highlighted key spectral regions with different degrees of selectivity. In the alcohol dataset, EDPI revealed important physicochemical insights by focusing on specific regions where non-analytes spectra overlapped. It performed in a similar way to the Variable Importance in Projection (VIP) method, but with fewer variables selected.</div></div><div><h3>Significance</h3><div>The experimental results obtained from calibrating near-infrared and Raman spectroscopic datasets using partial least squares highlight the effectiveness of the proposed EDPI strategy when contrasted with the conventional variable importance in projection (VIP) method for variable selection.</div></div>","PeriodicalId":240,"journal":{"name":"Analytica Chimica Acta","volume":"1332 ","pages":"Article 343357"},"PeriodicalIF":5.7000,"publicationDate":"2024-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhanced data point importance: Layered significance of variables in multivariate calibration\",\"authors\":\"Somaye Vali Zade ,&nbsp;Klaus Neymeyr ,&nbsp;Mathias Sawall ,&nbsp;Hamid Abdollahi\",\"doi\":\"10.1016/j.aca.2024.343357\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>The Enhanced Data Point Importance (EDPI) method, a systematic approach for evaluating the importance of data points in multivariate calibration, is introduced. Factor decomposition methods allow for the evaluation of the impact of variables on maintaining the structural pattern of data in the abstract space. Essential data points play a key role in these patterns and the method of Data Point Importance (DPI) aims to evaluate the essential data points in terms of their importance. All other points are rated by zero. In this contribution, DPI is extended to include inner points to evaluate the importance of these points in the absence of the essential points. The EDPI method employs convex peeling to sort data points systematically.</div></div><div><h3>Results</h3><div>EDPI method was applied to near-infrared and Raman spectroscopy data sets, including corn and alcohol mixtures and simulated data, to rank and select important variables. EDPI effectively identified variables that contributed to the preservation of the data structure and highlighted key spectral regions with different degrees of selectivity. In the alcohol dataset, EDPI revealed important physicochemical insights by focusing on specific regions where non-analytes spectra overlapped. It performed in a similar way to the Variable Importance in Projection (VIP) method, but with fewer variables selected.</div></div><div><h3>Significance</h3><div>The experimental results obtained from calibrating near-infrared and Raman spectroscopic datasets using partial least squares highlight the effectiveness of the proposed EDPI strategy when contrasted with the conventional variable importance in projection (VIP) method for variable selection.</div></div>\",\"PeriodicalId\":240,\"journal\":{\"name\":\"Analytica Chimica Acta\",\"volume\":\"1332 \",\"pages\":\"Article 343357\"},\"PeriodicalIF\":5.7000,\"publicationDate\":\"2024-10-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Analytica Chimica Acta\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0003267024011589\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, ANALYTICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytica Chimica Acta","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0003267024011589","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

摘要

背景介绍增强数据点重要性(EDPI)方法是一种评估多元校准中数据点重要性的系统方法。因子分解方法可以评估变量对维持抽象空间中数据结构模式的影响。基本数据点在这些模式中起着关键作用,数据点重要性(DPI)方法旨在评估基本数据点的重要性。所有其他点的评分均为零。在本文中,DPI 扩展到包括内部点,以便在没有基本点的情况下评估这些点的重要性。结果将 EDPI 方法应用于近红外和拉曼光谱数据集,包括玉米和酒精混合物以及模拟数据,对重要变量进行排序和选择。EDPI 有效地识别了有助于保持数据结构的变量,并突出了具有不同选择性的关键光谱区域。在酒精数据集中,EDPI 通过关注非分析物光谱重叠的特定区域,揭示了重要的物理化学观点。它的性能与投影变量重要性(VIP)方法类似,但选择的变量更少。重要意义使用偏最小二乘法校准近红外和拉曼光谱数据集所获得的实验结果,与传统的投影变量重要性(VIP)变量选择方法相比,凸显了所提出的 EDPI 策略的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Enhanced data point importance: Layered significance of variables in multivariate calibration

Enhanced data point importance: Layered significance of variables in multivariate calibration

Enhanced data point importance: Layered significance of variables in multivariate calibration

Background

The Enhanced Data Point Importance (EDPI) method, a systematic approach for evaluating the importance of data points in multivariate calibration, is introduced. Factor decomposition methods allow for the evaluation of the impact of variables on maintaining the structural pattern of data in the abstract space. Essential data points play a key role in these patterns and the method of Data Point Importance (DPI) aims to evaluate the essential data points in terms of their importance. All other points are rated by zero. In this contribution, DPI is extended to include inner points to evaluate the importance of these points in the absence of the essential points. The EDPI method employs convex peeling to sort data points systematically.

Results

EDPI method was applied to near-infrared and Raman spectroscopy data sets, including corn and alcohol mixtures and simulated data, to rank and select important variables. EDPI effectively identified variables that contributed to the preservation of the data structure and highlighted key spectral regions with different degrees of selectivity. In the alcohol dataset, EDPI revealed important physicochemical insights by focusing on specific regions where non-analytes spectra overlapped. It performed in a similar way to the Variable Importance in Projection (VIP) method, but with fewer variables selected.

Significance

The experimental results obtained from calibrating near-infrared and Raman spectroscopic datasets using partial least squares highlight the effectiveness of the proposed EDPI strategy when contrasted with the conventional variable importance in projection (VIP) method for variable selection.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Analytica Chimica Acta
Analytica Chimica Acta 化学-分析化学
CiteScore
10.40
自引率
6.50%
发文量
1081
审稿时长
38 days
期刊介绍: Analytica Chimica Acta has an open access mirror journal Analytica Chimica Acta: X, sharing the same aims and scope, editorial team, submission system and rigorous peer review. Analytica Chimica Acta provides a forum for the rapid publication of original research, and critical, comprehensive reviews dealing with all aspects of fundamental and applied modern analytical chemistry. The journal welcomes the submission of research papers which report studies concerning the development of new and significant analytical methodologies. In determining the suitability of submitted articles for publication, particular scrutiny will be placed on the degree of novelty and impact of the research and the extent to which it adds to the existing body of knowledge in analytical chemistry.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信