{"title":"增强数据点重要性:多变量校准中变量的分层重要性","authors":"Somaye Vali Zade , Klaus Neymeyr , Mathias Sawall , Hamid Abdollahi","doi":"10.1016/j.aca.2024.343357","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>The Enhanced Data Point Importance (EDPI) method, a systematic approach for evaluating the importance of data points in multivariate calibration, is introduced. Factor decomposition methods allow for the evaluation of the impact of variables on maintaining the structural pattern of data in the abstract space. Essential data points play a key role in these patterns and the method of Data Point Importance (DPI) aims to evaluate the essential data points in terms of their importance. All other points are rated by zero. In this contribution, DPI is extended to include inner points to evaluate the importance of these points in the absence of the essential points. The EDPI method employs convex peeling to sort data points systematically.</div></div><div><h3>Results</h3><div>EDPI method was applied to near-infrared and Raman spectroscopy data sets, including corn and alcohol mixtures and simulated data, to rank and select important variables. EDPI effectively identified variables that contributed to the preservation of the data structure and highlighted key spectral regions with different degrees of selectivity. In the alcohol dataset, EDPI revealed important physicochemical insights by focusing on specific regions where non-analytes spectra overlapped. It performed in a similar way to the Variable Importance in Projection (VIP) method, but with fewer variables selected.</div></div><div><h3>Significance</h3><div>The experimental results obtained from calibrating near-infrared and Raman spectroscopic datasets using partial least squares highlight the effectiveness of the proposed EDPI strategy when contrasted with the conventional variable importance in projection (VIP) method for variable selection.</div></div>","PeriodicalId":240,"journal":{"name":"Analytica Chimica Acta","volume":"1332 ","pages":"Article 343357"},"PeriodicalIF":5.7000,"publicationDate":"2024-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhanced data point importance: Layered significance of variables in multivariate calibration\",\"authors\":\"Somaye Vali Zade , Klaus Neymeyr , Mathias Sawall , Hamid Abdollahi\",\"doi\":\"10.1016/j.aca.2024.343357\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>The Enhanced Data Point Importance (EDPI) method, a systematic approach for evaluating the importance of data points in multivariate calibration, is introduced. Factor decomposition methods allow for the evaluation of the impact of variables on maintaining the structural pattern of data in the abstract space. Essential data points play a key role in these patterns and the method of Data Point Importance (DPI) aims to evaluate the essential data points in terms of their importance. All other points are rated by zero. In this contribution, DPI is extended to include inner points to evaluate the importance of these points in the absence of the essential points. The EDPI method employs convex peeling to sort data points systematically.</div></div><div><h3>Results</h3><div>EDPI method was applied to near-infrared and Raman spectroscopy data sets, including corn and alcohol mixtures and simulated data, to rank and select important variables. EDPI effectively identified variables that contributed to the preservation of the data structure and highlighted key spectral regions with different degrees of selectivity. In the alcohol dataset, EDPI revealed important physicochemical insights by focusing on specific regions where non-analytes spectra overlapped. It performed in a similar way to the Variable Importance in Projection (VIP) method, but with fewer variables selected.</div></div><div><h3>Significance</h3><div>The experimental results obtained from calibrating near-infrared and Raman spectroscopic datasets using partial least squares highlight the effectiveness of the proposed EDPI strategy when contrasted with the conventional variable importance in projection (VIP) method for variable selection.</div></div>\",\"PeriodicalId\":240,\"journal\":{\"name\":\"Analytica Chimica Acta\",\"volume\":\"1332 \",\"pages\":\"Article 343357\"},\"PeriodicalIF\":5.7000,\"publicationDate\":\"2024-10-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Analytica Chimica Acta\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0003267024011589\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, ANALYTICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytica Chimica Acta","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0003267024011589","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
Enhanced data point importance: Layered significance of variables in multivariate calibration
Background
The Enhanced Data Point Importance (EDPI) method, a systematic approach for evaluating the importance of data points in multivariate calibration, is introduced. Factor decomposition methods allow for the evaluation of the impact of variables on maintaining the structural pattern of data in the abstract space. Essential data points play a key role in these patterns and the method of Data Point Importance (DPI) aims to evaluate the essential data points in terms of their importance. All other points are rated by zero. In this contribution, DPI is extended to include inner points to evaluate the importance of these points in the absence of the essential points. The EDPI method employs convex peeling to sort data points systematically.
Results
EDPI method was applied to near-infrared and Raman spectroscopy data sets, including corn and alcohol mixtures and simulated data, to rank and select important variables. EDPI effectively identified variables that contributed to the preservation of the data structure and highlighted key spectral regions with different degrees of selectivity. In the alcohol dataset, EDPI revealed important physicochemical insights by focusing on specific regions where non-analytes spectra overlapped. It performed in a similar way to the Variable Importance in Projection (VIP) method, but with fewer variables selected.
Significance
The experimental results obtained from calibrating near-infrared and Raman spectroscopic datasets using partial least squares highlight the effectiveness of the proposed EDPI strategy when contrasted with the conventional variable importance in projection (VIP) method for variable selection.
期刊介绍:
Analytica Chimica Acta has an open access mirror journal Analytica Chimica Acta: X, sharing the same aims and scope, editorial team, submission system and rigorous peer review.
Analytica Chimica Acta provides a forum for the rapid publication of original research, and critical, comprehensive reviews dealing with all aspects of fundamental and applied modern analytical chemistry. The journal welcomes the submission of research papers which report studies concerning the development of new and significant analytical methodologies. In determining the suitability of submitted articles for publication, particular scrutiny will be placed on the degree of novelty and impact of the research and the extent to which it adds to the existing body of knowledge in analytical chemistry.