{"title":"Quantum hybrid feature selector","authors":"Vadim Lopatkin, Asel Sagingalieva, Luca Lusnig, Tatjana Protasevich, Bernadette Behnke, Alexey Melnikov","doi":"10.1140/epjqt/s40507-026-00491-1","DOIUrl":null,"url":null,"abstract":"<div><p>Unsupervised feature selection is essential for high-dimensional machine learning tasks, as it improves model quality and efficiency while providing interpretable insights into datasets. However, existing methods often struggle to simultaneously achieve robustness, reliability, interpretability, and computational efficiency. Furthermore, feature selection in quantum machine learning remains largely unexplored despite the potential of quantum approaches to capture complex feature interactions. In this work, we propose the Quantum Hybrid Feature Selector, a novel unsupervised feature selection framework that combines quantum-enhanced feature extraction with interpretable scoring algorithms. Our main contributions are threefold: we introduce a quantum hybrid autoencoder pipeline that couples feature extraction with feature ranking in the original input space, we define three scoring mechanisms: SHAP-based scoring, correlation-matrix scoring, and weight-based analysis, that relate latent features to original features without requiring external predictive models, and we provide comprehensive empirical evaluation on synthetic and real-world benchmarks with statistical validation. On Madelon-style synthetic datasets, our quantum SCM achieves up to 23.5% improvement in Mean Informative Rank and 9.7% improvement in Informative Ratio compared to classical alternatives, with all improvements statistically significant at <span>\\(p < 0.001\\)</span>. On the Communities and Crime dataset with injected noise features, both quantum and classical SCM variants achieve near-perfect noise elimination, demonstrating effective denoising capability on real-world data. We also show that expert input is essential for evaluating feature selectors, as different methods emphasize distinct aspects of the data despite similar aggregate metrics. Finally, we evaluate QHFS under realistic hardware noise conditions using device-informed simulations based on IBM superconducting quantum processors, demonstrating graceful performance degradation and systematic improvement with increased shot budgets, which supports the practical viability of our approach on near-term quantum devices.</p></div>","PeriodicalId":547,"journal":{"name":"EPJ Quantum Technology","volume":"13 1","pages":""},"PeriodicalIF":5.6000,"publicationDate":"2026-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1140/epjqt/s40507-026-00491-1.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EPJ Quantum Technology","FirstCategoryId":"101","ListUrlMain":"https://link.springer.com/article/10.1140/epjqt/s40507-026-00491-1","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/4/27 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"OPTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Unsupervised feature selection is essential for high-dimensional machine learning tasks, as it improves model quality and efficiency while providing interpretable insights into datasets. However, existing methods often struggle to simultaneously achieve robustness, reliability, interpretability, and computational efficiency. Furthermore, feature selection in quantum machine learning remains largely unexplored despite the potential of quantum approaches to capture complex feature interactions. In this work, we propose the Quantum Hybrid Feature Selector, a novel unsupervised feature selection framework that combines quantum-enhanced feature extraction with interpretable scoring algorithms. Our main contributions are threefold: we introduce a quantum hybrid autoencoder pipeline that couples feature extraction with feature ranking in the original input space, we define three scoring mechanisms: SHAP-based scoring, correlation-matrix scoring, and weight-based analysis, that relate latent features to original features without requiring external predictive models, and we provide comprehensive empirical evaluation on synthetic and real-world benchmarks with statistical validation. On Madelon-style synthetic datasets, our quantum SCM achieves up to 23.5% improvement in Mean Informative Rank and 9.7% improvement in Informative Ratio compared to classical alternatives, with all improvements statistically significant at \(p < 0.001\). On the Communities and Crime dataset with injected noise features, both quantum and classical SCM variants achieve near-perfect noise elimination, demonstrating effective denoising capability on real-world data. We also show that expert input is essential for evaluating feature selectors, as different methods emphasize distinct aspects of the data despite similar aggregate metrics. Finally, we evaluate QHFS under realistic hardware noise conditions using device-informed simulations based on IBM superconducting quantum processors, demonstrating graceful performance degradation and systematic improvement with increased shot budgets, which supports the practical viability of our approach on near-term quantum devices.
无监督特征选择对于高维机器学习任务至关重要,因为它可以提高模型质量和效率,同时提供对数据集的可解释见解。然而,现有的方法往往难以同时实现鲁棒性、可靠性、可解释性和计算效率。此外,尽管量子方法具有捕获复杂特征相互作用的潜力,但量子机器学习中的特征选择在很大程度上仍未被探索。在这项工作中,我们提出了量子混合特征选择器,这是一种新的无监督特征选择框架,将量子增强特征提取与可解释评分算法相结合。我们的主要贡献有三个方面:我们引入了一个量子混合自编码器管道,将特征提取与原始输入空间中的特征排序结合在一起;我们定义了三种评分机制:基于shap的评分、相关矩阵评分和基于权重的分析,这些机制将潜在特征与原始特征联系起来,而不需要外部预测模型;我们通过统计验证对合成基准和现实基准进行了全面的经验评估。在madelon风格的合成数据集上,我们的量子SCM达到了23.5% improvement in Mean Informative Rank and 9.7% improvement in Informative Ratio compared to classical alternatives, with all improvements statistically significant at \(p < 0.001\). On the Communities and Crime dataset with injected noise features, both quantum and classical SCM variants achieve near-perfect noise elimination, demonstrating effective denoising capability on real-world data. We also show that expert input is essential for evaluating feature selectors, as different methods emphasize distinct aspects of the data despite similar aggregate metrics. Finally, we evaluate QHFS under realistic hardware noise conditions using device-informed simulations based on IBM superconducting quantum processors, demonstrating graceful performance degradation and systematic improvement with increased shot budgets, which supports the practical viability of our approach on near-term quantum devices.
期刊介绍:
Driven by advances in technology and experimental capability, the last decade has seen the emergence of quantum technology: a new praxis for controlling the quantum world. It is now possible to engineer complex, multi-component systems that merge the once distinct fields of quantum optics and condensed matter physics.
EPJ Quantum Technology covers theoretical and experimental advances in subjects including but not limited to the following:
Quantum measurement, metrology and lithography
Quantum complex systems, networks and cellular automata
Quantum electromechanical systems
Quantum optomechanical systems
Quantum machines, engineering and nanorobotics
Quantum control theory
Quantum information, communication and computation
Quantum thermodynamics
Quantum metamaterials
The effect of Casimir forces on micro- and nano-electromechanical systems
Quantum biology
Quantum sensing
Hybrid quantum systems
Quantum simulations.