M Priyadharshini, B Deevena Raju, A Faritha Banu, P Jagdish Kumar, V Murugesh, Oleg Rybin
{"title":"利用蛋白质组学数据预测多发性骨髓瘤药物敏感性的量子机器学习框架。","authors":"M Priyadharshini, B Deevena Raju, A Faritha Banu, P Jagdish Kumar, V Murugesh, Oleg Rybin","doi":"10.1038/s41598-025-06544-2","DOIUrl":null,"url":null,"abstract":"<p><p>In this paper, we introduce QProteoML, a new quantum machine learning (QML) framework for predicting drug sensitivity in Multiple Myeloma (MM) using high-dimensional proteomic data. MM, an extremely heterogeneous condition, displays often mixed responses to treatment, with a large number of patients showing drug resistance to proteasome inhibitors and immune modulatory agents. However, the methods previously used for genomic and proteomic data analysis techniques are plagued by issues of high dimensionality, imbalanced class distribution and feature redundancy, which work against the accurate predictability and generalizability of such methods. These are compounded by the so-called \"curse of dimensionality\", with dimensions far outnumbering samples, hence classical model overfitting. In this work, we present QProteoML as an integration of quantum techniques purposefully developed to deal with high-dimensional, imbalanced and redundant data. The framework integrates a combination of Quantum Support Vector Machine (QSVM), Quantum Principal Component Analysis (qPCA), Quantum Annealing (QA) for feature selection and Quantum Generative Adversarial Networks (QGANs) for data augmentation. These quantum algorithms exploit certain quantum phenomena (superposition and entanglement) to perform modelling of nonlinear relationships, dimensionality reduction, and class-imbalance issues. QSVM employs quantum kernels to map data into a higher-dimensional Hilbert space, so that the model can detect complex patterns in MM drug resistance. qPCA reduces dimensionality without loss of important variance, and thus improves computation efficiency. In addition, Quantum Annealing successfully extracts the most informative biomarkers with low redundancy. QProteoML was experimentally tested by comparing accuracy, F1 score and AUC ROC between classical machine learning models such as Support Vector Machine (SVM), Random Forest (RF), Logistic Regression (LR), and K-Nearest Neighbors (KNN). Our results demonstrate that QProteoML performs better than classical models, particularly in identifying the drug resistant minority class of patients. Additionally, the model is interpretable and stresses important biomarkers of drug sensitivity in MM. This research opens the possibility of quantum machine learning in personalised medicine for Multiple Myeloma. It demonstrates that quantum algorithms can perform complex biological data suggesting more reliable and accurate drug sensitivity predictions. Future research will be directed toward clinical validation of the given system with larger and more diverse cohorts of MM patients; the integration of quantum hardware for practical applications.</p>","PeriodicalId":21811,"journal":{"name":"Scientific Reports","volume":"15 1","pages":"26553"},"PeriodicalIF":3.9000,"publicationDate":"2025-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12284003/pdf/","citationCount":"0","resultStr":"{\"title\":\"A quantum machine learning framework for predicting drug sensitivity in multiple myeloma using proteomic data.\",\"authors\":\"M Priyadharshini, B Deevena Raju, A Faritha Banu, P Jagdish Kumar, V Murugesh, Oleg Rybin\",\"doi\":\"10.1038/s41598-025-06544-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>In this paper, we introduce QProteoML, a new quantum machine learning (QML) framework for predicting drug sensitivity in Multiple Myeloma (MM) using high-dimensional proteomic data. MM, an extremely heterogeneous condition, displays often mixed responses to treatment, with a large number of patients showing drug resistance to proteasome inhibitors and immune modulatory agents. However, the methods previously used for genomic and proteomic data analysis techniques are plagued by issues of high dimensionality, imbalanced class distribution and feature redundancy, which work against the accurate predictability and generalizability of such methods. These are compounded by the so-called \\\"curse of dimensionality\\\", with dimensions far outnumbering samples, hence classical model overfitting. In this work, we present QProteoML as an integration of quantum techniques purposefully developed to deal with high-dimensional, imbalanced and redundant data. The framework integrates a combination of Quantum Support Vector Machine (QSVM), Quantum Principal Component Analysis (qPCA), Quantum Annealing (QA) for feature selection and Quantum Generative Adversarial Networks (QGANs) for data augmentation. These quantum algorithms exploit certain quantum phenomena (superposition and entanglement) to perform modelling of nonlinear relationships, dimensionality reduction, and class-imbalance issues. QSVM employs quantum kernels to map data into a higher-dimensional Hilbert space, so that the model can detect complex patterns in MM drug resistance. qPCA reduces dimensionality without loss of important variance, and thus improves computation efficiency. In addition, Quantum Annealing successfully extracts the most informative biomarkers with low redundancy. QProteoML was experimentally tested by comparing accuracy, F1 score and AUC ROC between classical machine learning models such as Support Vector Machine (SVM), Random Forest (RF), Logistic Regression (LR), and K-Nearest Neighbors (KNN). Our results demonstrate that QProteoML performs better than classical models, particularly in identifying the drug resistant minority class of patients. Additionally, the model is interpretable and stresses important biomarkers of drug sensitivity in MM. This research opens the possibility of quantum machine learning in personalised medicine for Multiple Myeloma. It demonstrates that quantum algorithms can perform complex biological data suggesting more reliable and accurate drug sensitivity predictions. Future research will be directed toward clinical validation of the given system with larger and more diverse cohorts of MM patients; the integration of quantum hardware for practical applications.</p>\",\"PeriodicalId\":21811,\"journal\":{\"name\":\"Scientific Reports\",\"volume\":\"15 1\",\"pages\":\"26553\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-07-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12284003/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientific Reports\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1038/s41598-025-06544-2\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Reports","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41598-025-06544-2","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
A quantum machine learning framework for predicting drug sensitivity in multiple myeloma using proteomic data.
In this paper, we introduce QProteoML, a new quantum machine learning (QML) framework for predicting drug sensitivity in Multiple Myeloma (MM) using high-dimensional proteomic data. MM, an extremely heterogeneous condition, displays often mixed responses to treatment, with a large number of patients showing drug resistance to proteasome inhibitors and immune modulatory agents. However, the methods previously used for genomic and proteomic data analysis techniques are plagued by issues of high dimensionality, imbalanced class distribution and feature redundancy, which work against the accurate predictability and generalizability of such methods. These are compounded by the so-called "curse of dimensionality", with dimensions far outnumbering samples, hence classical model overfitting. In this work, we present QProteoML as an integration of quantum techniques purposefully developed to deal with high-dimensional, imbalanced and redundant data. The framework integrates a combination of Quantum Support Vector Machine (QSVM), Quantum Principal Component Analysis (qPCA), Quantum Annealing (QA) for feature selection and Quantum Generative Adversarial Networks (QGANs) for data augmentation. These quantum algorithms exploit certain quantum phenomena (superposition and entanglement) to perform modelling of nonlinear relationships, dimensionality reduction, and class-imbalance issues. QSVM employs quantum kernels to map data into a higher-dimensional Hilbert space, so that the model can detect complex patterns in MM drug resistance. qPCA reduces dimensionality without loss of important variance, and thus improves computation efficiency. In addition, Quantum Annealing successfully extracts the most informative biomarkers with low redundancy. QProteoML was experimentally tested by comparing accuracy, F1 score and AUC ROC between classical machine learning models such as Support Vector Machine (SVM), Random Forest (RF), Logistic Regression (LR), and K-Nearest Neighbors (KNN). Our results demonstrate that QProteoML performs better than classical models, particularly in identifying the drug resistant minority class of patients. Additionally, the model is interpretable and stresses important biomarkers of drug sensitivity in MM. This research opens the possibility of quantum machine learning in personalised medicine for Multiple Myeloma. It demonstrates that quantum algorithms can perform complex biological data suggesting more reliable and accurate drug sensitivity predictions. Future research will be directed toward clinical validation of the given system with larger and more diverse cohorts of MM patients; the integration of quantum hardware for practical applications.
期刊介绍:
We publish original research from all areas of the natural sciences, psychology, medicine and engineering. You can learn more about what we publish by browsing our specific scientific subject areas below or explore Scientific Reports by browsing all articles and collections.
Scientific Reports has a 2-year impact factor: 4.380 (2021), and is the 6th most-cited journal in the world, with more than 540,000 citations in 2020 (Clarivate Analytics, 2021).
•Engineering
Engineering covers all aspects of engineering, technology, and applied science. It plays a crucial role in the development of technologies to address some of the world''s biggest challenges, helping to save lives and improve the way we live.
•Physical sciences
Physical sciences are those academic disciplines that aim to uncover the underlying laws of nature — often written in the language of mathematics. It is a collective term for areas of study including astronomy, chemistry, materials science and physics.
•Earth and environmental sciences
Earth and environmental sciences cover all aspects of Earth and planetary science and broadly encompass solid Earth processes, surface and atmospheric dynamics, Earth system history, climate and climate change, marine and freshwater systems, and ecology. It also considers the interactions between humans and these systems.
•Biological sciences
Biological sciences encompass all the divisions of natural sciences examining various aspects of vital processes. The concept includes anatomy, physiology, cell biology, biochemistry and biophysics, and covers all organisms from microorganisms, animals to plants.
•Health sciences
The health sciences study health, disease and healthcare. This field of study aims to develop knowledge, interventions and technology for use in healthcare to improve the treatment of patients.