MMSG-DTA: A Multimodal, Multiscale Model Based on Sequence and Graph Modalities for Drug-Target Affinity Prediction.

IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL
Jiahao Xu, Lei Ci, Bo Zhu, Guanhua Zhang, Linhua Jiang, Shixin Ye-Lehmann, Wei Long
{"title":"MMSG-DTA: A Multimodal, Multiscale Model Based on Sequence and Graph Modalities for Drug-Target Affinity Prediction.","authors":"Jiahao Xu, Lei Ci, Bo Zhu, Guanhua Zhang, Linhua Jiang, Shixin Ye-Lehmann, Wei Long","doi":"10.1021/acs.jcim.4c01828","DOIUrl":null,"url":null,"abstract":"<p><p>Drug-Target Affinity (DTA) prediction is a cornerstone of drug discovery and development, providing critical insights into the intricate interactions between candidate drugs and their biological targets. Despite its importance, existing methodologies often face significant limitations in capturing comprehensive global features from molecular graphs, which are essential for accurately characterizing drug properties. Furthermore, protein feature extraction is predominantly restricted to 1D amino acid sequences, which fail to adequately represent the spatial structures and complex functional regions of proteins. These shortcomings impede the development of models capable of fully elucidating the mechanisms underlying drug-target interactions. To overcome these challenges, we propose a multimodal, multiscale model based on Sequence and Graph Modalities for Drug-Target Affinity (MMSG-DTA) Prediction. The model combines graph neural networks with Transformers to effectively capture both local node-level features and global structural features of molecular graphs. Additionally, a graph-based modality is employed to improve the extraction of protein features from amino acid sequences. To further enhance the model's performance, an attention-based feature fusion module is incorporated to integrate diverse feature types, thereby strengthening its representation capacity and robustness. We evaluated MMSG-DTA on three public benchmark data sets─Davis, KIBA, and Metz─and the experimental results demonstrate that the proposed model outperforms several state-of-the-art methods in DTA prediction. These findings highlight the effectiveness of MMSG-DTA in advancing the accuracy and robustness of drug-target interaction modeling.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":"981-996"},"PeriodicalIF":5.6000,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Information and Modeling ","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jcim.4c01828","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/7 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0

Abstract

Drug-Target Affinity (DTA) prediction is a cornerstone of drug discovery and development, providing critical insights into the intricate interactions between candidate drugs and their biological targets. Despite its importance, existing methodologies often face significant limitations in capturing comprehensive global features from molecular graphs, which are essential for accurately characterizing drug properties. Furthermore, protein feature extraction is predominantly restricted to 1D amino acid sequences, which fail to adequately represent the spatial structures and complex functional regions of proteins. These shortcomings impede the development of models capable of fully elucidating the mechanisms underlying drug-target interactions. To overcome these challenges, we propose a multimodal, multiscale model based on Sequence and Graph Modalities for Drug-Target Affinity (MMSG-DTA) Prediction. The model combines graph neural networks with Transformers to effectively capture both local node-level features and global structural features of molecular graphs. Additionally, a graph-based modality is employed to improve the extraction of protein features from amino acid sequences. To further enhance the model's performance, an attention-based feature fusion module is incorporated to integrate diverse feature types, thereby strengthening its representation capacity and robustness. We evaluated MMSG-DTA on three public benchmark data sets─Davis, KIBA, and Metz─and the experimental results demonstrate that the proposed model outperforms several state-of-the-art methods in DTA prediction. These findings highlight the effectiveness of MMSG-DTA in advancing the accuracy and robustness of drug-target interaction modeling.

MMSG-DTA:基于序列和图模式的药物-靶点亲和力预测的多模态、多尺度模型。
药物靶标亲和力(DTA)预测是药物发现和开发的基石,为候选药物与其生物靶标之间复杂的相互作用提供了重要的见解。尽管它很重要,但现有的方法在从分子图中捕获全面的全局特征方面往往面临重大限制,这对于准确表征药物性质至关重要。此外,蛋白质特征提取主要局限于一维氨基酸序列,无法充分表征蛋白质的空间结构和复杂功能区域。这些缺点阻碍了能够充分阐明药物-靶标相互作用机制的模型的发展。为了克服这些挑战,我们提出了一个基于序列和图模式的多模态、多尺度的药物-靶标亲和力(MMSG-DTA)预测模型。该模型将图神经网络与变形器相结合,有效地捕获了分子图的局部节点级特征和全局结构特征。此外,采用基于图的模式来改进从氨基酸序列中提取蛋白质特征。为了进一步提高模型的性能,我们引入了基于注意力的特征融合模块,将不同的特征类型融合在一起,增强了模型的表征能力和鲁棒性。我们在三个公共基准数据集──Davis、KIBA和Metz──上评估了MMSG-DTA,实验结果表明,所提出的模型在DTA预测方面优于几种最先进的方法。这些发现突出了MMSG-DTA在提高药物-靶标相互作用建模的准确性和鲁棒性方面的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
9.80
自引率
10.70%
发文量
529
审稿时长
1.4 months
期刊介绍: The Journal of Chemical Information and Modeling publishes papers reporting new methodology and/or important applications in the fields of chemical informatics and molecular modeling. Specific topics include the representation and computer-based searching of chemical databases, molecular modeling, computer-aided molecular design of new materials, catalysts, or ligands, development of new computational methods or efficient algorithms for chemical software, and biopharmaceutical chemistry including analyses of biological activity and other issues related to drug discovery. Astute chemists, computer scientists, and information specialists look to this monthly’s insightful research studies, programming innovations, and software reviews to keep current with advances in this integral, multidisciplinary field. As a subscriber you’ll stay abreast of database search systems, use of graph theory in chemical problems, substructure search systems, pattern recognition and clustering, analysis of chemical and physical data, molecular modeling, graphics and natural language interfaces, bibliometric and citation analysis, and synthesis design and reactions databases.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信