Integrating Graph Convolution and Attention Mechanism for Kinase Inhibition Prediction.

IF 4.2 2区 化学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY
Hamza Zahid, Kil To Chong, Hilal Tayara
{"title":"Integrating Graph Convolution and Attention Mechanism for Kinase Inhibition Prediction.","authors":"Hamza Zahid, Kil To Chong, Hilal Tayara","doi":"10.3390/molecules30132871","DOIUrl":null,"url":null,"abstract":"<p><p>Kinase is an enzyme responsible for cell signaling and other complex processes. Mutations or changes in kinase can cause cancer and other diseases in humans, including leukemia, neuroblastomas, glioblastomas, and more. Considering these concerns, inhibiting overexpressed or dysregulated kinases through small drug molecules is very important. In the past, many machine learning and deep learning approaches have been used to inhibit unregulated kinase enzymes. In this work, we employ a Graph Neural Network (GNN) to predict the inhibition activities of kinases. A separate Graph Convolution Network (GCN) and combined Graph Convolution and Graph Attention Network (GCN_GAT) are developed and trained on two large datasets (Kinase Datasets 1 and 2) consisting of small drug molecules against the targeted kinase using 10-fold cross-validation. Furthermore, a wide range of molecules are used as independent datasets on which the performance of the models is evaluated. On both independent kinase datasets, our model combining GCN and GAT provides the best evaluation and outperforms previous models in terms of accuracy, Matthews Correlation Coefficient (MCC), sensitivity, specificity, and precision. On the independent Kinase Dataset 1, the values of accuracy, MCC, sensitivity, specificity, and precision are 0.96, 0.89, 0.90, 0.98, and 0.91, respectively. Similarly, the performance of our model combining GCN and GAT on the independent Kinase Dataset 2 is 0.97, 0.90, 0.91, 0.99, and 0.92 in terms of accuracy, MCC, sensitivity, specificity, and precision, respectively.</p>","PeriodicalId":19041,"journal":{"name":"Molecules","volume":"30 13","pages":""},"PeriodicalIF":4.2000,"publicationDate":"2025-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12251378/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecules","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.3390/molecules30132871","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Kinase is an enzyme responsible for cell signaling and other complex processes. Mutations or changes in kinase can cause cancer and other diseases in humans, including leukemia, neuroblastomas, glioblastomas, and more. Considering these concerns, inhibiting overexpressed or dysregulated kinases through small drug molecules is very important. In the past, many machine learning and deep learning approaches have been used to inhibit unregulated kinase enzymes. In this work, we employ a Graph Neural Network (GNN) to predict the inhibition activities of kinases. A separate Graph Convolution Network (GCN) and combined Graph Convolution and Graph Attention Network (GCN_GAT) are developed and trained on two large datasets (Kinase Datasets 1 and 2) consisting of small drug molecules against the targeted kinase using 10-fold cross-validation. Furthermore, a wide range of molecules are used as independent datasets on which the performance of the models is evaluated. On both independent kinase datasets, our model combining GCN and GAT provides the best evaluation and outperforms previous models in terms of accuracy, Matthews Correlation Coefficient (MCC), sensitivity, specificity, and precision. On the independent Kinase Dataset 1, the values of accuracy, MCC, sensitivity, specificity, and precision are 0.96, 0.89, 0.90, 0.98, and 0.91, respectively. Similarly, the performance of our model combining GCN and GAT on the independent Kinase Dataset 2 is 0.97, 0.90, 0.91, 0.99, and 0.92 in terms of accuracy, MCC, sensitivity, specificity, and precision, respectively.

结合图卷积和注意机制预测激酶抑制。
激酶是一种负责细胞信号传导和其他复杂过程的酶。激酶的突变或改变会导致人类癌症和其他疾病,包括白血病、神经母细胞瘤、胶质母细胞瘤等。考虑到这些问题,通过小药物分子抑制过表达或失调的激酶是非常重要的。过去,许多机器学习和深度学习方法已被用于抑制不调节的激酶。在这项工作中,我们采用图神经网络(GNN)来预测激酶的抑制活性。一个单独的图卷积网络(GCN)和组合的图卷积和图注意网络(GCN_GAT)被开发出来,并在两个大型数据集(激酶数据集1和2)上进行训练,这些数据集由针对靶向激酶的小药物分子组成,使用10倍交叉验证。此外,广泛的分子被用作独立的数据集来评估模型的性能。在两个独立的激酶数据集上,我们的模型结合GCN和GAT提供了最好的评估,并在准确性,马修斯相关系数(MCC),灵敏度,特异性和精度方面优于先前的模型。在独立的激酶数据集1上,准确性、MCC、敏感性、特异性和精密度分别为0.96、0.89、0.90、0.98和0.91。同样,我们的模型结合GCN和GAT在独立的激酶数据集2上的性能在准确性、MCC、灵敏度、特异性和精度方面分别为0.97、0.90、0.91、0.99和0.92。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Molecules
Molecules 化学-有机化学
CiteScore
7.40
自引率
8.70%
发文量
7524
审稿时长
1.4 months
期刊介绍: Molecules (ISSN 1420-3049, CODEN: MOLEFW) is an open access journal of synthetic organic chemistry and natural product chemistry. All articles are peer-reviewed and published continously upon acceptance. Molecules is published by MDPI, Basel, Switzerland. Our aim is to encourage chemists to publish as much as possible their experimental detail, particularly synthetic procedures and characterization information. There is no restriction on the length of the experimental section. In addition, availability of compound samples is published and considered as important information. Authors are encouraged to register or deposit their chemical samples through the non-profit international organization Molecular Diversity Preservation International (MDPI). Molecules has been launched in 1996 to preserve and exploit molecular diversity of both, chemical information and chemical substances.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信