MPCD: A Multitask Graph Transformer for Molecular Property Prediction by Integrating Common and Domain Knowledge

IF 6.8 1区 医学 Q1 CHEMISTRY, MEDICINAL
Xixi Yang, Yanjing Duan, Zhixiang Cheng, Kun Li, Yuansheng Liu, Xiangxiang Zeng, Dongsheng Cao
{"title":"MPCD: A Multitask Graph Transformer for Molecular Property Prediction by Integrating Common and Domain Knowledge","authors":"Xixi Yang, Yanjing Duan, Zhixiang Cheng, Kun Li, Yuansheng Liu, Xiangxiang Zeng, Dongsheng Cao","doi":"10.1021/acs.jmedchem.4c02193","DOIUrl":null,"url":null,"abstract":"Molecular property prediction with deep learning often employs self-supervised learning techniques to learn common knowledge through masked atom prediction. However, the common knowledge gained by masked atom prediction dramatically differs from the graph-level optimization objective of downstream tasks, which results in suboptimal problems. Particularly for properties with limited data, the failure to consider domain knowledge results in a direct search in an immense common space, rendering it infeasible to identify the global optimum. To address this, we propose MPCD, which enhances pretraining transferability by aligning the optimization objectives between pretraining and fine-tuning with domain knowledge. MPCD also leverages multitask learning to improve data utilization and model robustness. Technically, MPCD employs a relation-aware self-attention mechanism to capture molecules’ local and global structures comprehensively. Extensive validation demonstrates that MPCD outperforms state-of-the-art methods for absorption, distribution, metabolism, excretion, and toxicity (ADMET) and physicochemical prediction across various data sizes.","PeriodicalId":46,"journal":{"name":"Journal of Medicinal Chemistry","volume":"27 1","pages":""},"PeriodicalIF":6.8000,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medicinal Chemistry","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1021/acs.jmedchem.4c02193","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0

Abstract

Molecular property prediction with deep learning often employs self-supervised learning techniques to learn common knowledge through masked atom prediction. However, the common knowledge gained by masked atom prediction dramatically differs from the graph-level optimization objective of downstream tasks, which results in suboptimal problems. Particularly for properties with limited data, the failure to consider domain knowledge results in a direct search in an immense common space, rendering it infeasible to identify the global optimum. To address this, we propose MPCD, which enhances pretraining transferability by aligning the optimization objectives between pretraining and fine-tuning with domain knowledge. MPCD also leverages multitask learning to improve data utilization and model robustness. Technically, MPCD employs a relation-aware self-attention mechanism to capture molecules’ local and global structures comprehensively. Extensive validation demonstrates that MPCD outperforms state-of-the-art methods for absorption, distribution, metabolism, excretion, and toxicity (ADMET) and physicochemical prediction across various data sizes.

Abstract Image

MPCD:一种整合公共知识和领域知识的分子性质预测多任务图转换器
基于深度学习的分子性质预测通常采用自监督学习技术,通过掩模原子预测学习共性知识。然而,通过屏蔽原子预测获得的共同知识与下游任务的图级优化目标存在显著差异,从而导致次优问题。特别是对于数据有限的属性,不考虑领域知识会导致在一个巨大的公共空间中直接搜索,从而无法识别全局最优。为了解决这个问题,我们提出了MPCD,它通过将预训练和微调之间的优化目标与领域知识对齐来增强预训练的可转移性。MPCD还利用多任务学习来提高数据利用率和模型鲁棒性。从技术上讲,MPCD采用了一种关系感知的自关注机制来全面捕获分子的局部和全局结构。广泛的验证表明,MPCD在各种数据规模的吸收、分布、代谢、排泄和毒性(ADMET)和物理化学预测方面优于最先进的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Medicinal Chemistry
Journal of Medicinal Chemistry 医学-医药化学
CiteScore
4.00
自引率
11.00%
发文量
804
审稿时长
1.9 months
期刊介绍: The Journal of Medicinal Chemistry is a prestigious biweekly peer-reviewed publication that focuses on the multifaceted field of medicinal chemistry. Since its inception in 1959 as the Journal of Medicinal and Pharmaceutical Chemistry, it has evolved to become a cornerstone in the dissemination of research findings related to the design, synthesis, and development of therapeutic agents. The Journal of Medicinal Chemistry is recognized for its significant impact in the scientific community, as evidenced by its 2022 impact factor of 7.3. This metric reflects the journal's influence and the importance of its content in shaping the future of drug discovery and development. The journal serves as a vital resource for chemists, pharmacologists, and other researchers interested in the molecular mechanisms of drug action and the optimization of therapeutic compounds.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信