Chemometrics and Intelligent Laboratory Systems最新文献_第2页

Advancing QSAR models in drug discovery for best practices, theoretical foundations, and applications in targeting nuclear factor-κB inhibitors- A bright future in pharmaceutical chemistry 推进QSAR模型在药物发现中的最佳实践，理论基础，以及针对核因子κ b抑制剂的应用-药物化学的光明前景

IF 3.8 2区化学

Chemometrics and Intelligent Laboratory Systems Pub Date : 2025-10-03 DOI: 10.1016/j.chemolab.2025.105544

Nour-El-Houda Hammoudi , Oussama Lalaoui , Widad Sobhi , Alessandro Erto , Luca Micoli , Byong-Hun Jeon , Yacine Benguerba , Walid Elfalleh , Mohamed A.M. Ali , Nasir A. Ibrahim , Hichem Tahraoui , Abdeltif Amrane

{"title":"Advancing QSAR models in drug discovery for best practices, theoretical foundations, and applications in targeting nuclear factor-κB inhibitors- A bright future in pharmaceutical chemistry","authors":"Nour-El-Houda Hammoudi , Oussama Lalaoui , Widad Sobhi , Alessandro Erto , Luca Micoli , Byong-Hun Jeon , Yacine Benguerba , Walid Elfalleh , Mohamed A.M. Ali , Nasir A. Ibrahim , Hichem Tahraoui , Abdeltif Amrane","doi":"10.1016/j.chemolab.2025.105544","DOIUrl":"10.1016/j.chemolab.2025.105544","url":null,"abstract":"<div><div>Developing robust and valuable quantitative structure-activity relationship (QSAR) models has become increasingly significant in modern drug design. These models play a crucial role by enabling the determination of molecular properties of compounds and predicting their bioactivities for therapeutic targets. QSAR models utilize various machine learning methods, such as support vector machines (SVM), multiple linear regression (MLR), and artificial neural networks (ANNs). These widely applicable methods have substantial implications for developing more precise medicines. The effectiveness of QSAR research dramatically relies on how each process step is conducted and how the analysis is carried out. This paper discusses the essential steps in developing and validating QSAR models using machine learning. A case study is presented to provide a clear example, focusing on 121 compounds acting as potent nuclear factor-κB inhibitors (NF-κB). The study compares multiple predictive QSAR models based primarily on linear and non-linear regression techniques.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"267 ","pages":"Article 105544"},"PeriodicalIF":3.8,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145262447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An automated preprocessing framework for near infrared spectroscopic data 近红外光谱数据的自动预处理框架

IF 3.8 2区化学

Chemometrics and Intelligent Laboratory Systems Pub Date : 2025-09-28 DOI: 10.1016/j.chemolab.2025.105542

Xiaojing Chen , Zhonghao Xie , Roma Tauler , Yong He , Pengcheng Nie , Yankun Peng , Liang Shu , Shujat Ali , Guangzao Huang , Wen Shi , Xi Chen , Leiming Yuan

{"title":"An automated preprocessing framework for near infrared spectroscopic data","authors":"Xiaojing Chen , Zhonghao Xie , Roma Tauler , Yong He , Pengcheng Nie , Yankun Peng , Liang Shu , Shujat Ali , Guangzao Huang , Wen Shi , Xi Chen , Leiming Yuan","doi":"10.1016/j.chemolab.2025.105542","DOIUrl":"10.1016/j.chemolab.2025.105542","url":null,"abstract":"<div><div>Preprocessing plays a vital role in the analysis of Near-infrared spectroscopy (NIRS) data as it aims to remove unintended artifacts. This process involves a series of steps, each with a specific focus on a particular artifact. However, due to the diverse range of NIRS applications, selecting the optimal combination of preprocessing methods remains a challenge. To address this issue, we propose an automated preprocessing framework that can quickly identify the optimal preprocessing strategy. The framework initially constructs a workflow consisting of multiple types of preprocessing methods. Then, a genetic algorithm (GA) technique is used to optimize the best pipeline, avoiding exhaustive searches. In addition, we impose a penalty for the loss function of the GA process to obtain a parsimonious solution. Results on three real-world datasets demonstrate that our approach outperforms several state-of-the-art ensemble preprocessing methods in terms of prediction error. Compared to the raw data, the optimal preprocessing method can improve model performance by at least 48%. Furthermore, our framework enables the identification of the most effective preprocessing methods included in the best pipeline. The source code for our approach is available on GitHub and can be easily integrated with other existing preprocessing techniques.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"267 ","pages":"Article 105542"},"PeriodicalIF":3.8,"publicationDate":"2025-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145217363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Conformalized outlier detection for mass spectrometry data 质谱数据的规范化离群值检测

IF 3.8 2区化学

Chemometrics and Intelligent Laboratory Systems Pub Date : 2025-09-23 DOI: 10.1016/j.chemolab.2025.105539

Yangha Chung , Johan Lim , Xinlei Wang , Soohyun Ahn

引用次数: 0

Self-attention based Difference Long Short-Term Memory Network for Industrial Data-driven Modeling 基于自注意的差分长短期记忆网络用于工业数据驱动建模

IF 3.8 2区化学

Chemometrics and Intelligent Laboratory Systems Pub Date : 2025-09-20 DOI: 10.1016/j.chemolab.2025.105535

Xiaoqing Zheng, Bo Peng, Anke Xue, Ming Ge, Yaguang Kong, Aipeng Jiang

{"title":"Self-attention based Difference Long Short-Term Memory Network for Industrial Data-driven Modeling","authors":"Xiaoqing Zheng, Bo Peng, Anke Xue, Ming Ge, Yaguang Kong, Aipeng Jiang","doi":"10.1016/j.chemolab.2025.105535","DOIUrl":"10.1016/j.chemolab.2025.105535","url":null,"abstract":"<div><div>In modern industry, soft sensors provide real-time predictions of quality variables that are difficult to measure directly with physical sensors. However, in industrial processes, changes in material properties, catalyst deactivation, and other factors often lead to shifts in data distribution. Existing soft sensor models often overlook the impact of these distribution changes on performance. To address the issue of performance degradation due to changes in data distribution, this paper proposes a self-attention based Difference Long Short-Term Memory (SA-DLSTM) network for soft sensor modeling. By employing self-attention, industrial raw data is refined to facilitate the extraction of nonlinear features, thereby reducing the difficulty in modeling. A Difference Channel is designed to perform correlation analysis and select significant features from the raw data, followed by extracting the difference information that can reveal changes in the data distribution. The SA-DLSTM soft sensor model is established and validated on two benchmark industrial datasets: Debutanizer Column and Sulfur Recovery Unit. Comparisons with benchmark models, and state-of-the-art models show that SA-DLSTM achieves the best performance across all evaluation metrics, demonstrating the effectiveness of the proposed model.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"267 ","pages":"Article 105535"},"PeriodicalIF":3.8,"publicationDate":"2025-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145109706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Artificial neural network-assisted study on thermohydrodynamic behavior of tetrahybrid nanofluids in a porous stretching cylinder 人工神经网络辅助下多孔拉伸圆柱体中四杂化纳米流体热流体动力学行为的研究

IF 3.8 2区化学

Chemometrics and Intelligent Laboratory Systems Pub Date : 2025-09-19 DOI: 10.1016/j.chemolab.2025.105537

Pooja Devi, Bhuvaneshvar Kumar

引用次数: 0

Federated learning with local–global collaboration for predicting acute coronary syndrome 局部-全局联合学习预测急性冠脉综合征

IF 3.8 2区化学

Chemometrics and Intelligent Laboratory Systems Pub Date : 2025-09-18 DOI: 10.1016/j.chemolab.2025.105515

Yonggong Ren , Jia Shang , Meiwei Zhang , Xiaolu Xu , Zhaohong Geng

{"title":"Federated learning with local–global collaboration for predicting acute coronary syndrome","authors":"Yonggong Ren , Jia Shang , Meiwei Zhang , Xiaolu Xu , Zhaohong Geng","doi":"10.1016/j.chemolab.2025.105515","DOIUrl":"10.1016/j.chemolab.2025.105515","url":null,"abstract":"<div><div>Acute Coronary Syndrome (ACS) is a prevalent cardiovascular disease characterized by high incidence and mortality rates. Numerous studies have focused on utilizing artificial intelligence and machine learning algorithms to assess and predict the risk of ACS in patients. However, due to the sensitivity and privacy of medical data, training machine learning models on a centralized server that aggregates ACS data from various institutions poses certain risks. For the first time, this study validates the effectiveness of utilizing federated learning to collaboratively analyze medical data for predicting ACS. A federated learning-based ACS prediction model, i.e., FedLG, which incorporates local–global collaboration for mutual correction, is presented accordingly. On the client side, a regularization term is added to the loss function to reduce deviations caused by heterogeneous data, helping the global model remain accurate and representative. On the server side, gradient normalization is applied to balance contributions from clients with different update frequencies, resulting in a more stable and reliable global model. Comprehensive experiments on the ACS dataset from a tertiary hospital in China show that FedLG consistently outperforms models trained on individual clients, as well as three other federated baselines, across seven evaluation metrics under both IID and non-IID settings. Temporal hold-out validation further indicates that FedLG maintains better generalizability than other baselines. In addition, analysis of feature importance shows that FedLG identifies lipid-related biomarkers, which aligns with clinical knowledge, enhancing the interpretability of the results. The source code of FedLG is freely available at <span><span>https://github.com/bioinformatics-xu/FedLG</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"267 ","pages":"Article 105515"},"PeriodicalIF":3.8,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145119320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Moisture content prediction in durian husk biomass via near infrared spectroscopy coupled with aquaphotomics and explainable machine learning 利用近红外光谱结合水光组学和可解释机器学习预测榴莲果皮生物量中的水分含量

IF 3.8 2区化学

Chemometrics and Intelligent Laboratory Systems Pub Date : 2025-09-18 DOI: 10.1016/j.chemolab.2025.105538

Zenisha Shrestha , Bijendra Shrestha , Panmanas Sirisomboon , Umed Kumar Pun , Tri Ratna Bajracharya , Bim Prasad Shrestha , Pimpen Pornchaloempong

{"title":"Moisture content prediction in durian husk biomass via near infrared spectroscopy coupled with aquaphotomics and explainable machine learning","authors":"Zenisha Shrestha , Bijendra Shrestha , Panmanas Sirisomboon , Umed Kumar Pun , Tri Ratna Bajracharya , Bim Prasad Shrestha , Pimpen Pornchaloempong","doi":"10.1016/j.chemolab.2025.105538","DOIUrl":"10.1016/j.chemolab.2025.105538","url":null,"abstract":"<div><div>Accurate determination of moisture content is essential for energy efficiency and biomass management for fuel materials such as durian husk. Traditional methods of determining biomass moisture content are time-consuming and require specialized expertise, posing challenges for continuous monitoring. To address this limitation, this study applies Near Infrared Spectroscopy (NIRS) combined with machine learning models to rapidly and accurately assess moisture content. Both linear Partial Least Squares Regression (PLSR) and non-linear approaches were used, including Support Vector Machines (SVM), Artificial Neural Networks (ANN), and Extreme Gradient Boosting (XGB). The application of preprocessing techniques, notably the Savitzky-Golay second derivative (SD) and Standard Normal Variate (SNV), significantly augmented the predictive performance, highlighting the importance of data preprocessing in spectral analysis. Synthetic spectral augmentation using Gaussian noise revealed that while SVM and ANN exhibited near-perfect performance, SVM demonstrated quantifiable reliability. This study also demonstrates SVM as the most sensitive and reliable method for detecting and quantifying moisture content in durian husk. This research contributes novel insights to biomass analysis, highlighting the benefits of integrating NIRS and feasibility of explainable machine learning techniques to identify water related spectral parameters to advance aquaphotomics, thereby advancing rapid and accurate biomass characterization.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"267 ","pages":"Article 105538"},"PeriodicalIF":3.8,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145119319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Application of DebNet combined with fluorescence spectroscopy for rapid multi-pesticide residue classification DebNet结合荧光光谱技术在多种农药残留快速分类中的应用

IF 3.8 2区化学

Chemometrics and Intelligent Laboratory Systems Pub Date : 2025-09-17 DOI: 10.1016/j.chemolab.2025.105540

Libo Deng , Huitian Du , Jing Sun , Hongli Xu , Zhuo Chen , Hangwen Qu , Guangfen Wei , Pingjian Wang , Zhuhui Qiao , Zhonghai Lin

{"title":"Application of DebNet combined with fluorescence spectroscopy for rapid multi-pesticide residue classification","authors":"Libo Deng , Huitian Du , Jing Sun , Hongli Xu , Zhuo Chen , Hangwen Qu , Guangfen Wei , Pingjian Wang , Zhuhui Qiao , Zhonghai Lin","doi":"10.1016/j.chemolab.2025.105540","DOIUrl":"10.1016/j.chemolab.2025.105540","url":null,"abstract":"<div><div>The illegal use of pesticides has led to severe residual pollution, posing serious threats to both human health and the environment. This situation underscores the urgent need for rapid and highly accurate classification methods for multi-pesticide residue detection. Although fluorescence spectroscopy remains a mainstream technique in this field, its classification performance is often limited by spectral overlap and background noise. To address these challenges, this study proposes DebNet, a deep learning model based on one-dimensional fluorescence spectral data. DebNet integrates one-dimensional convolutional neural networks (1D-CNN), long short-term memory (LSTM) networks, and self-attention mechanisms to collaboratively mitigate spectral interference. Experimental results demonstrate that DebNet achieves a classification accuracy of 99.83 % on preprocessed data, with a training time of approximately 5 min. It enables fast and accurate classification of four high-risk pesticides, including cyromazine, captan, metolachlor and thiamethoxam. Overall, the proposed method offers a lightweight and effective solution for real-time monitoring of pesticide residues in agricultural environments. Its robustness under spectral overlap conditions makes it particularly suitable for on-site applications requiring rapid and accurate classification.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"267 ","pages":"Article 105540"},"PeriodicalIF":3.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145119318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Simple methods for uncertainty estimation in neural networks applied to spectral data processing: A case study on mango dry matter prediction 光谱数据处理中神经网络不确定性估计的简单方法——以芒果干物质预测为例

IF 3.8 2区化学

Chemometrics and Intelligent Laboratory Systems Pub Date : 2025-09-16 DOI: 10.1016/j.chemolab.2025.105532

Metz Maxime , Khadija Lamdibih , Jean-Michel Roger , David Esteve , Ryad Bendoula , Florent Abdelghafour

{"title":"Simple methods for uncertainty estimation in neural networks applied to spectral data processing: A case study on mango dry matter prediction","authors":"Metz Maxime , Khadija Lamdibih , Jean-Michel Roger , David Esteve , Ryad Bendoula , Florent Abdelghafour","doi":"10.1016/j.chemolab.2025.105532","DOIUrl":"10.1016/j.chemolab.2025.105532","url":null,"abstract":"<div><div>The growing complexity of real-world chemometric applications, particularly in spectroscopy, has exposed the limitations of traditional linear models in capturing non-linear patterns in spectral data. Deep learning models offer a powerful alternative but remain underutilised in chemometrics due to concerns about interpretability and trust, particularly in high-risk applications where uncertainty estimation is critical. This study investigates and compares three uncertainty estimation techniques suitable for neural networks: Monte Carlo Dropout (MC dropout), model averaging, and Stochastic Weight Averaging-Gaussian (SWAG). These methods are evaluated using a spectral deep learning architecture. The analysis focuses on identifying key hyper-parameters affecting both predictive performance and uncertainty calibration. Results show that while MC Dropout offers a good balance between accuracy and uncertainty estimation at low computational cost, model averaging provides robust performance but at the expense of greater training time and storage. SWAG emerges as a middle-ground method requiring careful tuning. Importantly, a trade-off between predictive accuracy and uncertainty calibration is observed, underscoring the need to consider uncertainty as an integral part of model evaluation. These findings highlight the relevance of deep learning uncertainty estimation in chemometrics and open new directions for optimising data acquisition, model calibration, and model selection based on both prediction confidence and performance.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"267 ","pages":"Article 105532"},"PeriodicalIF":3.8,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145099511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multiple features fusion and mixup with conditional decoder for 多特征融合和混合与条件解码器

IF 3.8 2区化学

Chemometrics and Intelligent Laboratory Systems Pub Date : 2025-09-12 DOI: 10.1016/j.chemolab.2025.105534

Youpeng Fan , Yongchun Fang

{"title":"Multiple features fusion and mixup with conditional decoder for","authors":"Youpeng Fan , Yongchun Fang","doi":"10.1016/j.chemolab.2025.105534","DOIUrl":"10.1016/j.chemolab.2025.105534","url":null,"abstract":"<div><div>In recent years, the combination of vibration spectral data and data-driven methods has dominated the development and application of close spectral recognition. Nevertheless, in practical applications, open spectral categories (i.e., novel/unknown spectral categories) may be encountered, as collecting comprehend-sive categories is time-consuming and requires professional expertise. The intuitive solution is to obscure features of different categories, but relevant exploratory experiments yield unsatisfactory open-set performance, which may be attributed to sparse spectral features and high inter-class similarity. To remedy this issue, we innovatively propose an end-to-end scheme combining <strong>M</strong>ultiple <strong>F</strong>eatures <strong>F</strong>usion and <strong>M</strong>ixup with <strong>C</strong>onditional <strong>D</strong>ecoder (MFFMCD) in this paper. In particular, to enhance feature representation, MFFMCD adopts two auxiliary feature extraction modules and fuses different branch features. Additionally, to cope with high inter-class similarity, the enhanced features are obscured within a mini-batch and restored to corresponding class samples through a conditional decoder to mimic the feature distribution of unknown classes. Experiments on three publicly available spectral datasets show that the proposed MFFMCD significantly outperforms existing methods. In the end, extensive ablation studies are conducted to investigate the effectiveness, correctness, and robustness of our proposal.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"267 ","pages":"Article 105534"},"PeriodicalIF":3.8,"publicationDate":"2025-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145060693","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0