用于抗癌药物反应预测的各种机器学习和深度学习模型的综合综述:与现有最先进方法的比较分析

IF 12.1 2区 工程技术 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Davinder Paul Singh, Pawandeep Kour, Tathagat Banerjee, Debabrata Swain
{"title":"用于抗癌药物反应预测的各种机器学习和深度学习模型的综合综述:与现有最先进方法的比较分析","authors":"Davinder Paul Singh,&nbsp;Pawandeep Kour,&nbsp;Tathagat Banerjee,&nbsp;Debabrata Swain","doi":"10.1007/s11831-025-10255-2","DOIUrl":null,"url":null,"abstract":"<div><p>The optimal treatment selection for cancer patients is extensive, and pharmacogenetic prediction is done using genetic cohort, chemical structure, and target information. Though previous studies sought to characterise pharmacological reactions, there were limits in categorization. Due to the development of various solutions, existing feature selection techniques such as statistical combinations suffer from drawbacks such as local optima, lack of heuristics, and so on. This further leads to a low convergence rate which affects the classification rate. To address this, the current study describes a hybrid approach that is based on machine learning and deep learning, as well as a comparison of the localization heuristic-based Harris Hawk intelligence method and Gravitational Optimization methods with Machine Learning (ML) and Deep- Learning (DL) algorithms. The study suggests the use of Conditional Generative Adversarial Network (CGAN) to obtain better feature selection with less volatility in order to improve data quality and minimise intrinsic variation. In this study, the possible associations between cell lines and drugs are deduced using the CCLE- Cancer-Cell Line Encyclopaedia and Genomics of Drug Sensitivity in Cancer- GDSC datasets, and the study proposes a hybrid Bi-Residual Dense Attention Network for cell line categorization. The proposed method shows better prediction performance based on precision, accuracy, F1-score, Area under curve (AUC), Area under the receiver operating characteristic curve (AUROC), specificity and recall. For the GDSC dataset, the BRDAN-HH framework achieved an accuracy of 0.9675, recall of 0.9795, specificity of 0.975, precision of 0.9785, F1-score of 0.9799, AUC of 0.97, and AUROC of 0.9705. Similarly, for the CCLE dataset, it demonstrated robust performance with an accuracy of 0.9655, recall of 0.986094, specificity of 0.975, precision of 0.975, F1-score of 0.986, AUC of 0.966, and AUROC of 0.9758. The results highlight the efficacy of the BRDAN-HH framework in delivering superior classification metrics, making it a valuable tool for analysing large-scale biomedical datasets.</p></div>","PeriodicalId":55473,"journal":{"name":"Archives of Computational Methods in Engineering","volume":"32 6","pages":"3733 - 3757"},"PeriodicalIF":12.1000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Comprehensive Review of Various Machine Learning and Deep Learning Models for Anti-Cancer Drug Response Prediction: Comparative Analysis With Existing State of the Art Methods\",\"authors\":\"Davinder Paul Singh,&nbsp;Pawandeep Kour,&nbsp;Tathagat Banerjee,&nbsp;Debabrata Swain\",\"doi\":\"10.1007/s11831-025-10255-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The optimal treatment selection for cancer patients is extensive, and pharmacogenetic prediction is done using genetic cohort, chemical structure, and target information. Though previous studies sought to characterise pharmacological reactions, there were limits in categorization. Due to the development of various solutions, existing feature selection techniques such as statistical combinations suffer from drawbacks such as local optima, lack of heuristics, and so on. This further leads to a low convergence rate which affects the classification rate. To address this, the current study describes a hybrid approach that is based on machine learning and deep learning, as well as a comparison of the localization heuristic-based Harris Hawk intelligence method and Gravitational Optimization methods with Machine Learning (ML) and Deep- Learning (DL) algorithms. The study suggests the use of Conditional Generative Adversarial Network (CGAN) to obtain better feature selection with less volatility in order to improve data quality and minimise intrinsic variation. In this study, the possible associations between cell lines and drugs are deduced using the CCLE- Cancer-Cell Line Encyclopaedia and Genomics of Drug Sensitivity in Cancer- GDSC datasets, and the study proposes a hybrid Bi-Residual Dense Attention Network for cell line categorization. The proposed method shows better prediction performance based on precision, accuracy, F1-score, Area under curve (AUC), Area under the receiver operating characteristic curve (AUROC), specificity and recall. For the GDSC dataset, the BRDAN-HH framework achieved an accuracy of 0.9675, recall of 0.9795, specificity of 0.975, precision of 0.9785, F1-score of 0.9799, AUC of 0.97, and AUROC of 0.9705. Similarly, for the CCLE dataset, it demonstrated robust performance with an accuracy of 0.9655, recall of 0.986094, specificity of 0.975, precision of 0.975, F1-score of 0.986, AUC of 0.966, and AUROC of 0.9758. The results highlight the efficacy of the BRDAN-HH framework in delivering superior classification metrics, making it a valuable tool for analysing large-scale biomedical datasets.</p></div>\",\"PeriodicalId\":55473,\"journal\":{\"name\":\"Archives of Computational Methods in Engineering\",\"volume\":\"32 6\",\"pages\":\"3733 - 3757\"},\"PeriodicalIF\":12.1000,\"publicationDate\":\"2025-03-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Archives of Computational Methods in Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s11831-025-10255-2\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Archives of Computational Methods in Engineering","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s11831-025-10255-2","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

摘要

癌症患者的最佳治疗选择是广泛的,药物遗传学预测是利用遗传队列、化学结构和靶点信息来完成的。虽然以前的研究试图描述药理反应的特征,但在分类上存在局限性。由于各种解决方案的发展,现有的特征选择技术(如统计组合)存在局部最优、缺乏启发式等缺点。这进一步导致较低的收敛速度,从而影响分类速度。为了解决这个问题,目前的研究描述了一种基于机器学习和深度学习的混合方法,并将基于定位启发式的Harris Hawk智能方法和重力优化方法与机器学习(ML)和深度学习(DL)算法进行了比较。该研究建议使用条件生成对抗网络(CGAN)来获得更好的特征选择和更少的波动性,以提高数据质量和最小化内在变化。本研究利用CCLE- Cancer- cell Line Encyclopaedia和Genomics of Drug Sensitivity In Cancer- GDSC数据集推导了细胞系和药物之间可能存在的关联,并提出了一种用于细胞系分类的混合bi -残差密集注意网络。该方法在精密度、准确度、f1评分、曲线下面积(AUC)、受试者工作特征曲线下面积(AUROC)、特异性和召回率方面均表现出较好的预测效果。对于GDSC数据集,BRDAN-HH框架的准确率为0.9675,召回率为0.9795,特异性为0.975,精密度为0.9785,f1评分为0.9799,AUC为0.97,AUROC为0.9705。同样,对于CCLE数据集,它表现出稳健的性能,准确率为0.9655,召回率为0.986094,特异性为0.975,精密度为0.975,f1评分为0.986,AUC为0.966,AUROC为0.9758。结果突出了BRDAN-HH框架在提供卓越分类指标方面的功效,使其成为分析大规模生物医学数据集的有价值工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

A Comprehensive Review of Various Machine Learning and Deep Learning Models for Anti-Cancer Drug Response Prediction: Comparative Analysis With Existing State of the Art Methods

A Comprehensive Review of Various Machine Learning and Deep Learning Models for Anti-Cancer Drug Response Prediction: Comparative Analysis With Existing State of the Art Methods

The optimal treatment selection for cancer patients is extensive, and pharmacogenetic prediction is done using genetic cohort, chemical structure, and target information. Though previous studies sought to characterise pharmacological reactions, there were limits in categorization. Due to the development of various solutions, existing feature selection techniques such as statistical combinations suffer from drawbacks such as local optima, lack of heuristics, and so on. This further leads to a low convergence rate which affects the classification rate. To address this, the current study describes a hybrid approach that is based on machine learning and deep learning, as well as a comparison of the localization heuristic-based Harris Hawk intelligence method and Gravitational Optimization methods with Machine Learning (ML) and Deep- Learning (DL) algorithms. The study suggests the use of Conditional Generative Adversarial Network (CGAN) to obtain better feature selection with less volatility in order to improve data quality and minimise intrinsic variation. In this study, the possible associations between cell lines and drugs are deduced using the CCLE- Cancer-Cell Line Encyclopaedia and Genomics of Drug Sensitivity in Cancer- GDSC datasets, and the study proposes a hybrid Bi-Residual Dense Attention Network for cell line categorization. The proposed method shows better prediction performance based on precision, accuracy, F1-score, Area under curve (AUC), Area under the receiver operating characteristic curve (AUROC), specificity and recall. For the GDSC dataset, the BRDAN-HH framework achieved an accuracy of 0.9675, recall of 0.9795, specificity of 0.975, precision of 0.9785, F1-score of 0.9799, AUC of 0.97, and AUROC of 0.9705. Similarly, for the CCLE dataset, it demonstrated robust performance with an accuracy of 0.9655, recall of 0.986094, specificity of 0.975, precision of 0.975, F1-score of 0.986, AUC of 0.966, and AUROC of 0.9758. The results highlight the efficacy of the BRDAN-HH framework in delivering superior classification metrics, making it a valuable tool for analysing large-scale biomedical datasets.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
19.80
自引率
4.10%
发文量
153
审稿时长
>12 weeks
期刊介绍: Archives of Computational Methods in Engineering Aim and Scope: Archives of Computational Methods in Engineering serves as an active forum for disseminating research and advanced practices in computational engineering, particularly focusing on mechanics and related fields. The journal emphasizes extended state-of-the-art reviews in selected areas, a unique feature of its publication. Review Format: Reviews published in the journal offer: A survey of current literature Critical exposition of topics in their full complexity By organizing the information in this manner, readers can quickly grasp the focus, coverage, and unique features of the Archives of Computational Methods in Engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信