Enhancing Drug-Target Binding Affinity Prediction through Deep Learning and Protein Secondary Structure Integration

IF 2.9 3区生物学 Q3 BIOCHEMICAL RESEARCH METHODS

Current Bioinformatics Pub Date : 2024-02-07 DOI:10.2174/0115748936285519240110070209

Runhua Zhang, Baozhong Zhu, Tengsheng Jiang, Zhiming Cui, Hongjie Wu

{"title":"Enhancing Drug-Target Binding Affinity Prediction through Deep Learning and Protein Secondary Structure Integration","authors":"Runhua Zhang, Baozhong Zhu, Tengsheng Jiang, Zhiming Cui, Hongjie Wu","doi":"10.2174/0115748936285519240110070209","DOIUrl":null,"url":null,"abstract":"Background: Conventional approaches to drug discovery are often characterized by lengthy and costly processes. To expedite the discovery of new drugs, the integration of artificial intelligence (AI) in predicting drug-target binding affinity (DTA) has emerged as a crucial approach. Despite the proliferation of deep learning methods for DTA prediction, many of these methods primarily concentrate on the amino acid sequence of proteins. Yet, the interactions between drug compounds and targets occur within distinct segments within the protein structures, whereas the primary sequence primarily captures global protein features. Consequently, it falls short of fully elucidating the intricate relationship between drugs and their respective targets. Objective: This study aims to employ advanced deep-learning techniques to forecast DTA while incorporating information about the secondary structure of proteins. Methods: In our research, both the primary sequence of protein and the secondary structure of protein were leveraged for protein representation. While the primary sequence played the role of the overarching feature, the secondary structure was employed as the localized feature. Convolutional neural networks and graph neural networks were utilized to independently model the intricate features of target proteins and drug compounds. This approach enhanced our ability to capture drugtarget interactions more effectively Results: We have introduced a novel method for predicting DTA. In comparison to DeepDTA, our approach demonstrates significant enhancements, achieving a 3.9% increase in the Concordance Index (CI) and a remarkable 34% reduction in Mean Squared Error (MSE) when evaluated on the KIBA dataset. Conclusion: In conclusion, our results unequivocally demonstrate that augmenting DTA prediction with the inclusion of the protein's secondary structure as a localized feature yields significantly improved accuracy compared to relying solely on the primary structure.","PeriodicalId":10801,"journal":{"name":"Current Bioinformatics","volume":"24 1","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2024-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.2174/0115748936285519240110070209","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Conventional approaches to drug discovery are often characterized by lengthy and costly processes. To expedite the discovery of new drugs, the integration of artificial intelligence (AI) in predicting drug-target binding affinity (DTA) has emerged as a crucial approach. Despite the proliferation of deep learning methods for DTA prediction, many of these methods primarily concentrate on the amino acid sequence of proteins. Yet, the interactions between drug compounds and targets occur within distinct segments within the protein structures, whereas the primary sequence primarily captures global protein features. Consequently, it falls short of fully elucidating the intricate relationship between drugs and their respective targets. Objective: This study aims to employ advanced deep-learning techniques to forecast DTA while incorporating information about the secondary structure of proteins. Methods: In our research, both the primary sequence of protein and the secondary structure of protein were leveraged for protein representation. While the primary sequence played the role of the overarching feature, the secondary structure was employed as the localized feature. Convolutional neural networks and graph neural networks were utilized to independently model the intricate features of target proteins and drug compounds. This approach enhanced our ability to capture drugtarget interactions more effectively Results: We have introduced a novel method for predicting DTA. In comparison to DeepDTA, our approach demonstrates significant enhancements, achieving a 3.9% increase in the Concordance Index (CI) and a remarkable 34% reduction in Mean Squared Error (MSE) when evaluated on the KIBA dataset. Conclusion: In conclusion, our results unequivocally demonstrate that augmenting DTA prediction with the inclusion of the protein's secondary structure as a localized feature yields significantly improved accuracy compared to relying solely on the primary structure.

查看原文本刊更多论文

通过深度学习和蛋白质二级结构整合加强药物与靶点的结合亲和力预测

背景：传统的药物发现方法通常具有过程冗长、成本高昂的特点。为了加快新药的发现，人工智能（AI）在预测药物与靶点结合亲和力（DTA）方面的整合已成为一种重要方法。尽管用于 DTA 预测的深度学习方法层出不穷，但其中许多方法主要集中于蛋白质的氨基酸序列。然而，药物化合物与靶点之间的相互作用发生在蛋白质结构的不同片段中，而主序列主要捕捉的是蛋白质的整体特征。因此，这种方法无法完全阐明药物与各自靶标之间错综复杂的关系。研究目的本研究旨在采用先进的深度学习技术预测 DTA，同时纳入蛋白质二级结构的相关信息。研究方法在我们的研究中，蛋白质的一级序列和二级结构都被用来表示蛋白质。一级序列是总体特征，二级结构则是局部特征。我们利用卷积神经网络和图神经网络对目标蛋白质和药物化合物的复杂特征进行独立建模。这种方法提高了我们更有效地捕捉药物与目标相互作用的能力：我们推出了一种预测 DTA 的新方法。与 DeepDTA 相比，我们的方法有了显著提高，在 KIBA 数据集上进行评估时，一致性指数 (CI) 提高了 3.9%，平均平方误差 (MSE) 显著降低了 34%。结论总之，我们的研究结果清楚地表明，通过将蛋白质的二级结构作为局部特征来增强 DTA 预测，与仅仅依赖一级结构相比，准确率有了显著提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Current Bioinformatics 生物-生化研究方法

CiteScore

6.60

自引率

2.50%

发文量

审稿时长

>12 weeks

期刊介绍： Current Bioinformatics aims to publish all the latest and outstanding developments in bioinformatics. Each issue contains a series of timely, in-depth/mini-reviews, research papers and guest edited thematic issues written by leaders in the field, covering a wide range of the integration of biology with computer and information science. The journal focuses on advances in computational molecular/structural biology, encompassing areas such as computing in biomedicine and genomics, computational proteomics and systems biology, and metabolic pathway engineering. Developments in these fields have direct implications on key issues related to health care, medicine, genetic disorders, development of agricultural products, renewable energy, environmental protection, etc.