灵芝中潜在抗肿瘤成分的预测:利用机器学习和分子对接的综合方法

IF 3.7 2区 化学 Q2 AUTOMATION & CONTROL SYSTEMS
Qi Yang , Lihao Yao , Fang Jia , Guiyuan Pang , Meiyu Huang , Chengxiang Liu , Hua Luo , Lili Fan
{"title":"灵芝中潜在抗肿瘤成分的预测:利用机器学习和分子对接的综合方法","authors":"Qi Yang ,&nbsp;Lihao Yao ,&nbsp;Fang Jia ,&nbsp;Guiyuan Pang ,&nbsp;Meiyu Huang ,&nbsp;Chengxiang Liu ,&nbsp;Hua Luo ,&nbsp;Lili Fan","doi":"10.1016/j.chemolab.2024.105271","DOIUrl":null,"url":null,"abstract":"<div><div>The objective of this study is to develop a reliable predictive model for antitumour activity and to identify potential antitumour components in <em>Ganoderma lucidum</em>. Four machine learning models, including Random Forest, were employed to train predictive models for antitumour activity, utilising Morgan fingerprints as molecular descriptors. The most effective model was then employed to predict the chemical composition of <em>Ganoderma lucidum</em>, identifying the four most probable compounds for molecular docking with known TNF-α-related targets. The findings of the study indicate that a Support Vector Machine (SVM) model exhibits an accuracy, F1 score, AUC, and sensitivity of 0.7638, 0.7638, 0.8332, and 0.7621, respectively. The model demonstrated an 80 % accuracy rate in predicting the antitumour activity of 10 FDA-approved drugs. Besides, the model identified 11 components in <em>Ganoderma lucidum</em>, including 3-nitroanisole, with a probability of antitumour activity exceeding 0.5, indicating their potential as antitumour agents. The results of the molecular docking procedure indicated that the four most promising antitumour compounds derived from <em>Ganoderma lucidum</em> exhibited a favourable binding affinity with the TNF-α target. In conclusion, this study incorporated a machine learning prediction step prior to molecular docking, thereby enhancing the reliability of the latter. Furthermore, it identified previously unreported compounds in <em>Ganoderma lucidum</em> with potential antitumour activity, such as 3-nitroanisole.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"255 ","pages":"Article 105271"},"PeriodicalIF":3.7000,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediction of potential antitumor components in Ganoderma lucidum: A combined approach using machine learning and molecular docking\",\"authors\":\"Qi Yang ,&nbsp;Lihao Yao ,&nbsp;Fang Jia ,&nbsp;Guiyuan Pang ,&nbsp;Meiyu Huang ,&nbsp;Chengxiang Liu ,&nbsp;Hua Luo ,&nbsp;Lili Fan\",\"doi\":\"10.1016/j.chemolab.2024.105271\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The objective of this study is to develop a reliable predictive model for antitumour activity and to identify potential antitumour components in <em>Ganoderma lucidum</em>. Four machine learning models, including Random Forest, were employed to train predictive models for antitumour activity, utilising Morgan fingerprints as molecular descriptors. The most effective model was then employed to predict the chemical composition of <em>Ganoderma lucidum</em>, identifying the four most probable compounds for molecular docking with known TNF-α-related targets. The findings of the study indicate that a Support Vector Machine (SVM) model exhibits an accuracy, F1 score, AUC, and sensitivity of 0.7638, 0.7638, 0.8332, and 0.7621, respectively. The model demonstrated an 80 % accuracy rate in predicting the antitumour activity of 10 FDA-approved drugs. Besides, the model identified 11 components in <em>Ganoderma lucidum</em>, including 3-nitroanisole, with a probability of antitumour activity exceeding 0.5, indicating their potential as antitumour agents. The results of the molecular docking procedure indicated that the four most promising antitumour compounds derived from <em>Ganoderma lucidum</em> exhibited a favourable binding affinity with the TNF-α target. In conclusion, this study incorporated a machine learning prediction step prior to molecular docking, thereby enhancing the reliability of the latter. Furthermore, it identified previously unreported compounds in <em>Ganoderma lucidum</em> with potential antitumour activity, such as 3-nitroanisole.</div></div>\",\"PeriodicalId\":9774,\"journal\":{\"name\":\"Chemometrics and Intelligent Laboratory Systems\",\"volume\":\"255 \",\"pages\":\"Article 105271\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2024-11-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Chemometrics and Intelligent Laboratory Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0169743924002119\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemometrics and Intelligent Laboratory Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169743924002119","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

本研究的目的是开发一种可靠的抗肿瘤活性预测模型,并确定灵芝中潜在的抗肿瘤成分。利用摩根指纹作为分子描述符,采用包括随机森林在内的四种机器学习模型来训练抗肿瘤活性预测模型。然后利用最有效的模型预测灵芝的化学成分,确定了四种最有可能与已知 TNF-α 相关靶点进行分子对接的化合物。研究结果表明,支持向量机(SVM)模型的准确度、F1得分、AUC和灵敏度分别为0.7638、0.7638、0.8332和0.7621。该模型在预测美国 FDA 批准的 10 种药物的抗肿瘤活性方面显示出 80% 的准确率。此外,该模型还发现灵芝中包括3-硝基苯甲醚在内的11种成分的抗肿瘤活性概率超过0.5,表明它们具有作为抗肿瘤药物的潜力。分子对接程序的结果表明,从灵芝中提取的四种最有潜力的抗肿瘤化合物与 TNF-α 靶点具有良好的结合亲和力。总之,本研究在分子对接之前加入了机器学习预测步骤,从而提高了后者的可靠性。此外,它还发现了灵芝中以前未报道过的具有潜在抗肿瘤活性的化合物,如3-硝基苯甲醚。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Prediction of potential antitumor components in Ganoderma lucidum: A combined approach using machine learning and molecular docking
The objective of this study is to develop a reliable predictive model for antitumour activity and to identify potential antitumour components in Ganoderma lucidum. Four machine learning models, including Random Forest, were employed to train predictive models for antitumour activity, utilising Morgan fingerprints as molecular descriptors. The most effective model was then employed to predict the chemical composition of Ganoderma lucidum, identifying the four most probable compounds for molecular docking with known TNF-α-related targets. The findings of the study indicate that a Support Vector Machine (SVM) model exhibits an accuracy, F1 score, AUC, and sensitivity of 0.7638, 0.7638, 0.8332, and 0.7621, respectively. The model demonstrated an 80 % accuracy rate in predicting the antitumour activity of 10 FDA-approved drugs. Besides, the model identified 11 components in Ganoderma lucidum, including 3-nitroanisole, with a probability of antitumour activity exceeding 0.5, indicating their potential as antitumour agents. The results of the molecular docking procedure indicated that the four most promising antitumour compounds derived from Ganoderma lucidum exhibited a favourable binding affinity with the TNF-α target. In conclusion, this study incorporated a machine learning prediction step prior to molecular docking, thereby enhancing the reliability of the latter. Furthermore, it identified previously unreported compounds in Ganoderma lucidum with potential antitumour activity, such as 3-nitroanisole.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.50
自引率
7.70%
发文量
169
审稿时长
3.4 months
期刊介绍: Chemometrics and Intelligent Laboratory Systems publishes original research papers, short communications, reviews, tutorials and Original Software Publications reporting on development of novel statistical, mathematical, or computer techniques in Chemistry and related disciplines. Chemometrics is the chemical discipline that uses mathematical and statistical methods to design or select optimal procedures and experiments, and to provide maximum chemical information by analysing chemical data. The journal deals with the following topics: 1) Development of new statistical, mathematical and chemometrical methods for Chemistry and related fields (Environmental Chemistry, Biochemistry, Toxicology, System Biology, -Omics, etc.) 2) Novel applications of chemometrics to all branches of Chemistry and related fields (typical domains of interest are: process data analysis, experimental design, data mining, signal processing, supervised modelling, decision making, robust statistics, mixture analysis, multivariate calibration etc.) Routine applications of established chemometrical techniques will not be considered. 3) Development of new software that provides novel tools or truly advances the use of chemometrical methods. 4) Well characterized data sets to test performance for the new methods and software. The journal complies with International Committee of Medical Journal Editors'' Uniform requirements for manuscripts.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信