Improving generalizability of drug-target binding prediction by pre-trained multi-view molecular representations.

Xike Ouyang, Yannuo Feng, Chen Cui, Yunhe Li, Li Zhang, Han Wang
{"title":"Improving generalizability of drug-target binding prediction by pre-trained multi-view molecular representations.","authors":"Xike Ouyang, Yannuo Feng, Chen Cui, Yunhe Li, Li Zhang, Han Wang","doi":"10.1093/bioinformatics/btaf002","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Most drugs start on their journey inside the body by binding the right target proteins. This is the reason that numerous efforts have been devoted to predicting the drug-target binding during drug development. However, the inherent diversity among molecular properties, coupled with limited training data availability, poses challenges to the accuracy and generalizability of these methods beyond their training domain.</p><p><strong>Results: </strong>In this work, we proposed a neural networks construction for high accurate and generalizable drug-target binding prediction, named Pre-trained Multi-view Molecular Representations (PMMR). The method uses pre-trained models to transfer representations of target proteins and drugs to the domain of drug-target binding prediction, mitigating the issue of poor generalizability stemming from limited data. Then, two typical representations of drug molecules, Graphs and SMILES strings, are learned respectively by a Graph Neural Network and a Transformer to achieve complementarity between local and global features. PMMR was evaluated on drug-target affinity and interaction benchmark datasets, and it derived preponderant performance contrast to peer methods, especially generalizability in cold-start scenarios. Furthermore, our state-of-the-art method was indicated to have the potential for drug discovery by a case study of cyclin-dependent kinase 2.</p><p><strong>Availability and implementation: </strong>https://github.com/NENUBioCompute/PMMR.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11751634/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btaf002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Motivation: Most drugs start on their journey inside the body by binding the right target proteins. This is the reason that numerous efforts have been devoted to predicting the drug-target binding during drug development. However, the inherent diversity among molecular properties, coupled with limited training data availability, poses challenges to the accuracy and generalizability of these methods beyond their training domain.

Results: In this work, we proposed a neural networks construction for high accurate and generalizable drug-target binding prediction, named Pre-trained Multi-view Molecular Representations (PMMR). The method uses pre-trained models to transfer representations of target proteins and drugs to the domain of drug-target binding prediction, mitigating the issue of poor generalizability stemming from limited data. Then, two typical representations of drug molecules, Graphs and SMILES strings, are learned respectively by a Graph Neural Network and a Transformer to achieve complementarity between local and global features. PMMR was evaluated on drug-target affinity and interaction benchmark datasets, and it derived preponderant performance contrast to peer methods, especially generalizability in cold-start scenarios. Furthermore, our state-of-the-art method was indicated to have the potential for drug discovery by a case study of cyclin-dependent kinase 2.

Availability and implementation: https://github.com/NENUBioCompute/PMMR.

利用预训练的多视点分子表征提高药物与靶点结合预测的通用性。
动机:大多数药物通过结合正确的靶蛋白开始其在体内的旅程。这就是为什么在药物开发过程中对药物-靶点结合进行预测的原因。然而,分子性质的固有多样性,加上训练数据的有限性,对这些方法的准确性和泛化性提出了挑战。结果:在这项工作中,我们提出了一种用于高精度和可推广的药物靶点结合预测的神经网络结构,称为预训练多视图分子表征(PMMR)。该方法使用预先训练的模型将靶蛋白和药物的表示转移到药物-靶标结合预测领域,减轻了由于数据有限而导致的较差泛化性的问题。然后,利用图神经网络(Graph Neural Network, GNN)和Transformer分别学习药物分子的两种典型表征Graph和SMILES字符串,实现局部特征和全局特征的互补。PMMR在药物靶标亲和力和相互作用基准数据集上进行了评估,与同类方法相比,它的性能优于同类方法,特别是在冷启动场景下的通用性。此外,通过周期蛋白依赖性激酶2 (CDK2)的案例研究表明,我们最先进的方法具有药物发现的潜力。可用性和实施:https://github.com/NENUBioCompute/PMMR.Supplementary信息:补充数据可在生物信息学网站在线获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信