Integration of advanced chemometric and machine learning techniques: Stacking model and XGBoost dynamic correction for aniline detection with an unmodified carbon paste electrode
{"title":"Integration of advanced chemometric and machine learning techniques: Stacking model and XGBoost dynamic correction for aniline detection with an unmodified carbon paste electrode","authors":"Meryem NINI, Mohamed NOHAIR","doi":"10.1016/j.ijleo.2025.172369","DOIUrl":null,"url":null,"abstract":"<div><div>This study proposes a novel Chemometric approach for the precise and reliable quantification of aniline using an unmodified carbon paste electrode (CPE), even in the presence of interfering species such as phenol. The proposed methodology integrates partial least squares (PLS), random forest (RF) and ridge regression in a stacked model framework, combined with particle swarm optimization (PSO) for potential selection. To enhance the prediction accuracy, a dynamic correction based on XGBoost was applied, which effectively minimized residual errors and improved model robustness. The model developed demonstrated high performance on the training (R<sup>2</sup> = 0.9993, RMSE = 0.6260 and MAE = 0.4773), test (R<sup>2</sup> = 0.9979, RMSE = 0.7616 and MAE = 0.6839) and cross-validation datasets (R<sup>2</sup> = 0.9988, RMSE = 0.8312 and MAE = 0.5467), thus confirming its stability and reliability. The high coefficient of determination (R2), as well as the low root mean square error (RMSE) and mean absolute deviation (MAE) values, underscore the model's robust predictive capabilities and its capacity to accommodate intricate electrochemical interactions. Moreover, the calculated recovery rates and tolerance limits signify high precision and consistency across varying concentration levels. Utilizing an unmodified EPC, this approach proffers a cost-effective, sensitive, and selective strategy for aniline quantification, rendering it especially well-suited for environmental and industrial monitoring. The model's strong predictive capabilities underscore its potential for real-world contaminant detection, offering a reliable solution for complex analytical environments.</div></div>","PeriodicalId":19513,"journal":{"name":"Optik","volume":"330 ","pages":"Article 172369"},"PeriodicalIF":3.1000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optik","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0030402625001573","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 0
Abstract
This study proposes a novel Chemometric approach for the precise and reliable quantification of aniline using an unmodified carbon paste electrode (CPE), even in the presence of interfering species such as phenol. The proposed methodology integrates partial least squares (PLS), random forest (RF) and ridge regression in a stacked model framework, combined with particle swarm optimization (PSO) for potential selection. To enhance the prediction accuracy, a dynamic correction based on XGBoost was applied, which effectively minimized residual errors and improved model robustness. The model developed demonstrated high performance on the training (R2 = 0.9993, RMSE = 0.6260 and MAE = 0.4773), test (R2 = 0.9979, RMSE = 0.7616 and MAE = 0.6839) and cross-validation datasets (R2 = 0.9988, RMSE = 0.8312 and MAE = 0.5467), thus confirming its stability and reliability. The high coefficient of determination (R2), as well as the low root mean square error (RMSE) and mean absolute deviation (MAE) values, underscore the model's robust predictive capabilities and its capacity to accommodate intricate electrochemical interactions. Moreover, the calculated recovery rates and tolerance limits signify high precision and consistency across varying concentration levels. Utilizing an unmodified EPC, this approach proffers a cost-effective, sensitive, and selective strategy for aniline quantification, rendering it especially well-suited for environmental and industrial monitoring. The model's strong predictive capabilities underscore its potential for real-world contaminant detection, offering a reliable solution for complex analytical environments.
期刊介绍:
Optik publishes articles on all subjects related to light and electron optics and offers a survey on the state of research and technical development within the following fields:
Optics:
-Optics design, geometrical and beam optics, wave optics-
Optical and micro-optical components, diffractive optics, devices and systems-
Photoelectric and optoelectronic devices-
Optical properties of materials, nonlinear optics, wave propagation and transmission in homogeneous and inhomogeneous materials-
Information optics, image formation and processing, holographic techniques, microscopes and spectrometer techniques, and image analysis-
Optical testing and measuring techniques-
Optical communication and computing-
Physiological optics-
As well as other related topics.