A Comprehensive Review of Various Machine Learning and Deep Learning Models for Anti-Cancer Drug Response Prediction: Comparative Analysis With Existing State of the Art Methods
IF 12.1 2区 工程技术Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Davinder Paul Singh, Pawandeep Kour, Tathagat Banerjee, Debabrata Swain
{"title":"A Comprehensive Review of Various Machine Learning and Deep Learning Models for Anti-Cancer Drug Response Prediction: Comparative Analysis With Existing State of the Art Methods","authors":"Davinder Paul Singh, Pawandeep Kour, Tathagat Banerjee, Debabrata Swain","doi":"10.1007/s11831-025-10255-2","DOIUrl":null,"url":null,"abstract":"<div><p>The optimal treatment selection for cancer patients is extensive, and pharmacogenetic prediction is done using genetic cohort, chemical structure, and target information. Though previous studies sought to characterise pharmacological reactions, there were limits in categorization. Due to the development of various solutions, existing feature selection techniques such as statistical combinations suffer from drawbacks such as local optima, lack of heuristics, and so on. This further leads to a low convergence rate which affects the classification rate. To address this, the current study describes a hybrid approach that is based on machine learning and deep learning, as well as a comparison of the localization heuristic-based Harris Hawk intelligence method and Gravitational Optimization methods with Machine Learning (ML) and Deep- Learning (DL) algorithms. The study suggests the use of Conditional Generative Adversarial Network (CGAN) to obtain better feature selection with less volatility in order to improve data quality and minimise intrinsic variation. In this study, the possible associations between cell lines and drugs are deduced using the CCLE- Cancer-Cell Line Encyclopaedia and Genomics of Drug Sensitivity in Cancer- GDSC datasets, and the study proposes a hybrid Bi-Residual Dense Attention Network for cell line categorization. The proposed method shows better prediction performance based on precision, accuracy, F1-score, Area under curve (AUC), Area under the receiver operating characteristic curve (AUROC), specificity and recall. For the GDSC dataset, the BRDAN-HH framework achieved an accuracy of 0.9675, recall of 0.9795, specificity of 0.975, precision of 0.9785, F1-score of 0.9799, AUC of 0.97, and AUROC of 0.9705. Similarly, for the CCLE dataset, it demonstrated robust performance with an accuracy of 0.9655, recall of 0.986094, specificity of 0.975, precision of 0.975, F1-score of 0.986, AUC of 0.966, and AUROC of 0.9758. The results highlight the efficacy of the BRDAN-HH framework in delivering superior classification metrics, making it a valuable tool for analysing large-scale biomedical datasets.</p></div>","PeriodicalId":55473,"journal":{"name":"Archives of Computational Methods in Engineering","volume":"32 6","pages":"3733 - 3757"},"PeriodicalIF":12.1000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Archives of Computational Methods in Engineering","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s11831-025-10255-2","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
The optimal treatment selection for cancer patients is extensive, and pharmacogenetic prediction is done using genetic cohort, chemical structure, and target information. Though previous studies sought to characterise pharmacological reactions, there were limits in categorization. Due to the development of various solutions, existing feature selection techniques such as statistical combinations suffer from drawbacks such as local optima, lack of heuristics, and so on. This further leads to a low convergence rate which affects the classification rate. To address this, the current study describes a hybrid approach that is based on machine learning and deep learning, as well as a comparison of the localization heuristic-based Harris Hawk intelligence method and Gravitational Optimization methods with Machine Learning (ML) and Deep- Learning (DL) algorithms. The study suggests the use of Conditional Generative Adversarial Network (CGAN) to obtain better feature selection with less volatility in order to improve data quality and minimise intrinsic variation. In this study, the possible associations between cell lines and drugs are deduced using the CCLE- Cancer-Cell Line Encyclopaedia and Genomics of Drug Sensitivity in Cancer- GDSC datasets, and the study proposes a hybrid Bi-Residual Dense Attention Network for cell line categorization. The proposed method shows better prediction performance based on precision, accuracy, F1-score, Area under curve (AUC), Area under the receiver operating characteristic curve (AUROC), specificity and recall. For the GDSC dataset, the BRDAN-HH framework achieved an accuracy of 0.9675, recall of 0.9795, specificity of 0.975, precision of 0.9785, F1-score of 0.9799, AUC of 0.97, and AUROC of 0.9705. Similarly, for the CCLE dataset, it demonstrated robust performance with an accuracy of 0.9655, recall of 0.986094, specificity of 0.975, precision of 0.975, F1-score of 0.986, AUC of 0.966, and AUROC of 0.9758. The results highlight the efficacy of the BRDAN-HH framework in delivering superior classification metrics, making it a valuable tool for analysing large-scale biomedical datasets.
癌症患者的最佳治疗选择是广泛的,药物遗传学预测是利用遗传队列、化学结构和靶点信息来完成的。虽然以前的研究试图描述药理反应的特征,但在分类上存在局限性。由于各种解决方案的发展,现有的特征选择技术(如统计组合)存在局部最优、缺乏启发式等缺点。这进一步导致较低的收敛速度,从而影响分类速度。为了解决这个问题,目前的研究描述了一种基于机器学习和深度学习的混合方法,并将基于定位启发式的Harris Hawk智能方法和重力优化方法与机器学习(ML)和深度学习(DL)算法进行了比较。该研究建议使用条件生成对抗网络(CGAN)来获得更好的特征选择和更少的波动性,以提高数据质量和最小化内在变化。本研究利用CCLE- Cancer- cell Line Encyclopaedia和Genomics of Drug Sensitivity In Cancer- GDSC数据集推导了细胞系和药物之间可能存在的关联,并提出了一种用于细胞系分类的混合bi -残差密集注意网络。该方法在精密度、准确度、f1评分、曲线下面积(AUC)、受试者工作特征曲线下面积(AUROC)、特异性和召回率方面均表现出较好的预测效果。对于GDSC数据集,BRDAN-HH框架的准确率为0.9675,召回率为0.9795,特异性为0.975,精密度为0.9785,f1评分为0.9799,AUC为0.97,AUROC为0.9705。同样,对于CCLE数据集,它表现出稳健的性能,准确率为0.9655,召回率为0.986094,特异性为0.975,精密度为0.975,f1评分为0.986,AUC为0.966,AUROC为0.9758。结果突出了BRDAN-HH框架在提供卓越分类指标方面的功效,使其成为分析大规模生物医学数据集的有价值工具。
期刊介绍:
Archives of Computational Methods in Engineering
Aim and Scope:
Archives of Computational Methods in Engineering serves as an active forum for disseminating research and advanced practices in computational engineering, particularly focusing on mechanics and related fields. The journal emphasizes extended state-of-the-art reviews in selected areas, a unique feature of its publication.
Review Format:
Reviews published in the journal offer:
A survey of current literature
Critical exposition of topics in their full complexity
By organizing the information in this manner, readers can quickly grasp the focus, coverage, and unique features of the Archives of Computational Methods in Engineering.