Mateus P. Schneider , Cristina Malegori , Adriano de A. Gomes , Paolo Oliveri
{"title":"Enhancing one-class classification performance through variable selection: A review based on advanced literature search approaches","authors":"Mateus P. Schneider , Cristina Malegori , Adriano de A. Gomes , Paolo Oliveri","doi":"10.1016/j.chemolab.2025.105491","DOIUrl":null,"url":null,"abstract":"<div><div>Variable selection is a key step in improving One-Class Classification (OCC), especially when applied to high-dimensional datasets common in chemometrics and anomaly detection tasks. This systematic literature review explores how different strategies—filter, wrapper, embedded, and hybrid methods—have been employed to enhance OCC models' accuracy, interpretability, and robustness. A comprehensive search was conducted using Scopus, complemented by AI-powered tools such as Elicit and Litmaps, and visual analytics platforms including VOSviewer and Bibliometrix. The review highlights methodological trends across both chemometric and machine learning domains, revealing a predominance of embedded approaches and a growing interest in hybrid strategies. Embedded methods, particularly LASSO, Elastic Net, and autoencoder-based architectures, were favored for their scalability and model integration. Approximately 69 % of the reviewed studies adopted a rigorous OCC approach—relying solely on target class data—demonstrating a preference for bias-resistant modeling. Additionally, bibliometric analysis revealed a disciplinary division, with chemometric studies emphasizing analytical applications and model interpretability, while computer science-driven studies prioritized automation and scalability. The findings emphasize the need for flexible, domain-adapted variable selection pipelines capable of handling class imbalance and high dimensionality. This work also introduces a reproducible framework combining traditional and AI-assisted literature review tools to support future systematic analyses. The review concludes by identifying emerging trends and suggesting future research directions in OCC and variable selection, with a focus on hybrid modeling, domain adaptability, and performance benchmarking across application fields.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"265 ","pages":"Article 105491"},"PeriodicalIF":3.7000,"publicationDate":"2025-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemometrics and Intelligent Laboratory Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169743925001765","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Variable selection is a key step in improving One-Class Classification (OCC), especially when applied to high-dimensional datasets common in chemometrics and anomaly detection tasks. This systematic literature review explores how different strategies—filter, wrapper, embedded, and hybrid methods—have been employed to enhance OCC models' accuracy, interpretability, and robustness. A comprehensive search was conducted using Scopus, complemented by AI-powered tools such as Elicit and Litmaps, and visual analytics platforms including VOSviewer and Bibliometrix. The review highlights methodological trends across both chemometric and machine learning domains, revealing a predominance of embedded approaches and a growing interest in hybrid strategies. Embedded methods, particularly LASSO, Elastic Net, and autoencoder-based architectures, were favored for their scalability and model integration. Approximately 69 % of the reviewed studies adopted a rigorous OCC approach—relying solely on target class data—demonstrating a preference for bias-resistant modeling. Additionally, bibliometric analysis revealed a disciplinary division, with chemometric studies emphasizing analytical applications and model interpretability, while computer science-driven studies prioritized automation and scalability. The findings emphasize the need for flexible, domain-adapted variable selection pipelines capable of handling class imbalance and high dimensionality. This work also introduces a reproducible framework combining traditional and AI-assisted literature review tools to support future systematic analyses. The review concludes by identifying emerging trends and suggesting future research directions in OCC and variable selection, with a focus on hybrid modeling, domain adaptability, and performance benchmarking across application fields.
期刊介绍:
Chemometrics and Intelligent Laboratory Systems publishes original research papers, short communications, reviews, tutorials and Original Software Publications reporting on development of novel statistical, mathematical, or computer techniques in Chemistry and related disciplines.
Chemometrics is the chemical discipline that uses mathematical and statistical methods to design or select optimal procedures and experiments, and to provide maximum chemical information by analysing chemical data.
The journal deals with the following topics:
1) Development of new statistical, mathematical and chemometrical methods for Chemistry and related fields (Environmental Chemistry, Biochemistry, Toxicology, System Biology, -Omics, etc.)
2) Novel applications of chemometrics to all branches of Chemistry and related fields (typical domains of interest are: process data analysis, experimental design, data mining, signal processing, supervised modelling, decision making, robust statistics, mixture analysis, multivariate calibration etc.) Routine applications of established chemometrical techniques will not be considered.
3) Development of new software that provides novel tools or truly advances the use of chemometrical methods.
4) Well characterized data sets to test performance for the new methods and software.
The journal complies with International Committee of Medical Journal Editors'' Uniform requirements for manuscripts.