mTOR抑制的集成计算机建模：从脊分类器到无描述符的深度神经网络

Computer methods and programs in biomedicine update Pub Date : 2025-01-01 DOI:10.1016/j.cmpbup.2025.100208

Seyed Alireza Khanghahi , Hadi Kamkar , Seyedehsamaneh Shojaeilangari , Abdollah Allahverdi , Parviz Abdolmaleki

{"title":"mTOR抑制的集成计算机建模：从脊分类器到无描述符的深度神经网络","authors":"Seyed Alireza Khanghahi , Hadi Kamkar , Seyedehsamaneh Shojaeilangari , Abdollah Allahverdi , Parviz Abdolmaleki","doi":"10.1016/j.cmpbup.2025.100208","DOIUrl":null,"url":null,"abstract":"<div><div>Inhibiting the mammalian Target of Rapamycin (mTOR) represents a promising strategy in cancer therapy due to its crucial role in cell growth, survival, and metabolism. Using a variety of quantitative structure-activity relationship (QSAR) models, we present a comprehensive comparison of deep learning (DL) and classical machine learning (ML) techniques for modeling mTOR inhibitor activity. Unlike prior studies that focused on specific algorithms or limited descriptors, we benchmark a wide range of models, from traditional models like Random Forest, logistic regression, and SVM to modern algorithms like CNNs, GRUs, and LSTMs, on both descriptor-based features obtained by Dragon and descriptor-free inputs, including raw SMILES string, and Morgan fingerprints. This comprehensive analysis provides a robust foundation for using the best QSAR models specific to mTOR inhibition. Our findings revealed that while the Random Forest classifier achieved the highest accuracy among all models (0.9290 accuracy, 0.8940 F1-score, 0.9737 AUC), DL methods also demonstrated strong predictive capabilities, with nearly all models attaining an accuracy above 0.90. Among the DL models, CNN-QSAR using Morgan fingerprints achieved the highest accuracy (0.9271), F1-score (0.8950), and AUC (0.9696), demonstrating its effectiveness in capturing structural characteristics. The GRU-QSAR and LSTM-QSAR models, which utilized tokenized SMILES, achieved accuracies of 0.9002 and 0.9021, F1-scores of 0.8595 and 0.8603, and AUCs of 0.9270 and 0.9529, respectively, leveraging their ability to process sequential data.</div></div>","PeriodicalId":72670,"journal":{"name":"Computer methods and programs in biomedicine update","volume":"8 ","pages":"Article 100208"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Integrative in Silico modeling for mTOR inhibition: From ridge classifiers to descriptor-free deep neural networks\",\"authors\":\"Seyed Alireza Khanghahi , Hadi Kamkar , Seyedehsamaneh Shojaeilangari , Abdollah Allahverdi , Parviz Abdolmaleki\",\"doi\":\"10.1016/j.cmpbup.2025.100208\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Inhibiting the mammalian Target of Rapamycin (mTOR) represents a promising strategy in cancer therapy due to its crucial role in cell growth, survival, and metabolism. Using a variety of quantitative structure-activity relationship (QSAR) models, we present a comprehensive comparison of deep learning (DL) and classical machine learning (ML) techniques for modeling mTOR inhibitor activity. Unlike prior studies that focused on specific algorithms or limited descriptors, we benchmark a wide range of models, from traditional models like Random Forest, logistic regression, and SVM to modern algorithms like CNNs, GRUs, and LSTMs, on both descriptor-based features obtained by Dragon and descriptor-free inputs, including raw SMILES string, and Morgan fingerprints. This comprehensive analysis provides a robust foundation for using the best QSAR models specific to mTOR inhibition. Our findings revealed that while the Random Forest classifier achieved the highest accuracy among all models (0.9290 accuracy, 0.8940 F1-score, 0.9737 AUC), DL methods also demonstrated strong predictive capabilities, with nearly all models attaining an accuracy above 0.90. Among the DL models, CNN-QSAR using Morgan fingerprints achieved the highest accuracy (0.9271), F1-score (0.8950), and AUC (0.9696), demonstrating its effectiveness in capturing structural characteristics. The GRU-QSAR and LSTM-QSAR models, which utilized tokenized SMILES, achieved accuracies of 0.9002 and 0.9021, F1-scores of 0.8595 and 0.8603, and AUCs of 0.9270 and 0.9529, respectively, leveraging their ability to process sequential data.</div></div>\",\"PeriodicalId\":72670,\"journal\":{\"name\":\"Computer methods and programs in biomedicine update\",\"volume\":\"8 \",\"pages\":\"Article 100208\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer methods and programs in biomedicine update\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666990025000333\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine update","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666990025000333","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

抑制哺乳动物雷帕霉素靶点（mTOR）是一种很有前景的癌症治疗策略，因为它在细胞生长、存活和代谢中起着至关重要的作用。利用各种定量结构-活性关系（QSAR）模型，我们全面比较了深度学习（DL）和经典机器学习（ML）技术对mTOR抑制剂活性的建模。与之前专注于特定算法或有限描述符的研究不同，我们对广泛的模型进行了基准测试，从随机森林、逻辑回归和SVM等传统模型到cnn、gru和LSTMs等现代算法，包括Dragon获得的基于描述符的特征和无描述符的输入，包括原始SMILES字符串和Morgan指纹。这一综合分析为使用mTOR抑制的最佳QSAR模型提供了坚实的基础。我们的研究结果表明，虽然随机森林分类器在所有模型中准确率最高（0.9290准确率，0.8940 f1分数，0.9737 AUC），但DL方法也表现出很强的预测能力，几乎所有模型的准确率都在0.90以上。在深度学习模型中，采用Morgan指纹的CNN-QSAR准确率最高（0.9271），f1得分最高（0.8950），AUC最高（0.9696），显示了其捕获结构特征的有效性。使用标记化SMILES的GRU-QSAR和LSTM-QSAR模型，利用其处理序列数据的能力，准确率分别为0.9002和0.9021，f1得分分别为0.8595和0.8603，auc分别为0.9270和0.9529。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Integrative in Silico modeling for mTOR inhibition: From ridge classifiers to descriptor-free deep neural networks

Inhibiting the mammalian Target of Rapamycin (mTOR) represents a promising strategy in cancer therapy due to its crucial role in cell growth, survival, and metabolism. Using a variety of quantitative structure-activity relationship (QSAR) models, we present a comprehensive comparison of deep learning (DL) and classical machine learning (ML) techniques for modeling mTOR inhibitor activity. Unlike prior studies that focused on specific algorithms or limited descriptors, we benchmark a wide range of models, from traditional models like Random Forest, logistic regression, and SVM to modern algorithms like CNNs, GRUs, and LSTMs, on both descriptor-based features obtained by Dragon and descriptor-free inputs, including raw SMILES string, and Morgan fingerprints. This comprehensive analysis provides a robust foundation for using the best QSAR models specific to mTOR inhibition. Our findings revealed that while the Random Forest classifier achieved the highest accuracy among all models (0.9290 accuracy, 0.8940 F1-score, 0.9737 AUC), DL methods also demonstrated strong predictive capabilities, with nearly all models attaining an accuracy above 0.90. Among the DL models, CNN-QSAR using Morgan fingerprints achieved the highest accuracy (0.9271), F1-score (0.8950), and AUC (0.9696), demonstrating its effectiveness in capturing structural characteristics. The GRU-QSAR and LSTM-QSAR models, which utilized tokenized SMILES, achieved accuracies of 0.9002 and 0.9021, F1-scores of 0.8595 and 0.8603, and AUCs of 0.9270 and 0.9529, respectively, leveraging their ability to process sequential data.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computer methods and programs in biomedicine update Health Informatics

CiteScore

5.90

自引率

0.00%

发文量

审稿时长

10 weeks