Generating novel molecule for target protein (SARS-CoV-2) using drug-target interaction based on graph neural network.

IF 2 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY
Amit Ranjan, Shivansh Shukla, Deepanjan Datta, Rajiv Misra
{"title":"Generating novel molecule for target protein (SARS-CoV-2) using drug-target interaction based on graph neural network.","authors":"Amit Ranjan,&nbsp;Shivansh Shukla,&nbsp;Deepanjan Datta,&nbsp;Rajiv Misra","doi":"10.1007/s13721-021-00351-1","DOIUrl":null,"url":null,"abstract":"<p><p>The transmittable spread of viral coronavirus (SARS-CoV-2) has resulted in a significant rise in global mortality. Due to lack of effective treatment, our aim is to generate a highly potent active molecule that can bind with the protein structure of SARS-CoV-2. Different machine learning and deep learning approaches have been proposed for molecule generation; however, most of these approaches represent the drug molecule and protein structure in 1D sequence, ignoring the fact that molecules are by nature in 3D structure, and because of this many critical properties are lost. In this work, a framework is proposed that takes account of both tertiary and sequential representations of molecules and proteins using Gated Graph Neural Network (GGNN), Knowledge graph, and Early Fusion approach. The generated molecules from GGNN are screened using Knowledge Graph to reduce the search space by discarding the non-binding molecules before being fed into the Early Fusion model. Further, the binding affinity score of the generated molecule is predicted using the early fusion approach. Experimental result shows that our framework generates valid and unique molecules with high accuracy while preserving the chemical properties. The use of a knowledge graph claims that the entire generated dataset of molecules was reduced by roughly 96% while retaining more than 85% of good binding desirable molecules and the rejection of more than 99% of fruitless molecules. Additionally, the framework was tested with two of the SARS-CoV-2 viral proteins: RNA-dependent-RNA polymerase (RdRp) and 3C-like protease (3CLpro).</p>","PeriodicalId":44876,"journal":{"name":"Network Modeling and Analysis in Health Informatics and Bioinformatics","volume":" ","pages":"6"},"PeriodicalIF":2.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8683294/pdf/","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Network Modeling and Analysis in Health Informatics and Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s13721-021-00351-1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2021/12/18 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 6

Abstract

The transmittable spread of viral coronavirus (SARS-CoV-2) has resulted in a significant rise in global mortality. Due to lack of effective treatment, our aim is to generate a highly potent active molecule that can bind with the protein structure of SARS-CoV-2. Different machine learning and deep learning approaches have been proposed for molecule generation; however, most of these approaches represent the drug molecule and protein structure in 1D sequence, ignoring the fact that molecules are by nature in 3D structure, and because of this many critical properties are lost. In this work, a framework is proposed that takes account of both tertiary and sequential representations of molecules and proteins using Gated Graph Neural Network (GGNN), Knowledge graph, and Early Fusion approach. The generated molecules from GGNN are screened using Knowledge Graph to reduce the search space by discarding the non-binding molecules before being fed into the Early Fusion model. Further, the binding affinity score of the generated molecule is predicted using the early fusion approach. Experimental result shows that our framework generates valid and unique molecules with high accuracy while preserving the chemical properties. The use of a knowledge graph claims that the entire generated dataset of molecules was reduced by roughly 96% while retaining more than 85% of good binding desirable molecules and the rejection of more than 99% of fruitless molecules. Additionally, the framework was tested with two of the SARS-CoV-2 viral proteins: RNA-dependent-RNA polymerase (RdRp) and 3C-like protease (3CLpro).

Abstract Image

Abstract Image

Abstract Image

基于图神经网络的药物-靶标相互作用生成靶蛋白(SARS-CoV-2)新分子
冠状病毒(SARS-CoV-2)的可传播传播已导致全球死亡率大幅上升。由于缺乏有效的治疗方法,我们的目标是产生一种能与SARS-CoV-2蛋白结构结合的高效活性分子。不同的机器学习和深度学习方法已经被提出用于分子生成;然而,这些方法大多以一维序列表示药物分子和蛋白质结构,忽略了分子本质上是三维结构的事实,因此失去了许多关键性质。在这项工作中,提出了一个框架,该框架使用门控图神经网络(GGNN)、知识图和早期融合方法考虑了分子和蛋白质的三级和顺序表示。利用知识图谱对GGNN生成的分子进行筛选,通过丢弃非结合分子来减少搜索空间,然后将其输入Early Fusion模型。此外,使用早期融合方法预测生成的分子的结合亲和力评分。实验结果表明,我们的框架在保持化学性质的同时,能以较高的精度生成有效的、独特的分子。知识图的使用声称整个生成的分子数据集减少了大约96%,同时保留了85%以上的良好结合的理想分子,并拒绝了99%以上的无效分子。此外,该框架用两种SARS-CoV-2病毒蛋白进行了测试:rna依赖性rna聚合酶(RdRp)和3c样蛋白酶(3CLpro)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.40
自引率
4.30%
发文量
43
期刊介绍: NetMAHIB publishes original research articles and reviews reporting how graph theory, statistics, linear algebra and machine learning techniques can be effectively used for modelling and analysis in health informatics and bioinformatics. It aims at creating a synergy between these disciplines by providing a forum for disseminating the latest developments and research findings; hence, results can be shared with readers across institutions, governments, researchers, students, and the industry. The journal emphasizes fundamental contributions on new methodologies, discoveries and techniques that have general applicability and which form the basis for network based modelling, knowledge discovery, knowledge sharing and decision support to the benefit of patients, healthcare professionals and society in traditional and advanced emerging settings, including eHealth and mHealth .
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信