Enhancing molecular property prediction with auxiliary learning and task-specific adaptation

IF 7.1 2区化学 Q1 CHEMISTRY, MULTIDISCIPLINARY

Journal of Cheminformatics Pub Date : 2024-07-24 DOI:10.1186/s13321-024-00880-7

Vishal Dey, Xia Ning

{"title":"Enhancing molecular property prediction with auxiliary learning and task-specific adaptation","authors":"Vishal Dey, Xia Ning","doi":"10.1186/s13321-024-00880-7","DOIUrl":null,"url":null,"abstract":"<div>Pretrained Graph Neural Networks have been widely adopted for various molecular property prediction tasks. Despite their ability to encode structural and relational features of molecules, traditional fine-tuning of such pretrained GNNs on the target task can lead to poor generalization. To address this, we explore the adaptation of pretrained GNNs to the target task by jointly training them with multiple auxiliary tasks. This could enable the GNNs to learn both general and task-specific features, which may benefit the target task. However, a major challenge is to determine the relatedness of auxiliary tasks with the target task. To address this, we investigate multiple strategies to measure the relevance of auxiliary tasks and integrate such tasks by adaptively combining task gradients or by learning task weights via bi-level optimization. Additionally, we propose a novel gradient surgery-based approach, Rotation of Conflicting Gradients (\$\\mathop {\\texttt{RCGrad}}\\limits\$), that learns to align conflicting auxiliary task gradients through rotation. Our experiments with state-of-the-art pretrained GNNs demonstrate the efficacy of our proposed methods, with improvements of up to 7.7% over fine-tuning. This suggests that incorporating auxiliary tasks along with target task fine-tuning can be an effective way to improve the generalizability of pretrained GNNs for molecular property prediction.Scientific contributionWe introduce a novel framework for adapting pretrained GNNs to molecular tasks using auxiliary learning to address the critical issue of negative transfer. Leveraging novel gradient surgery techniques such as \$\\mathop {\\texttt{RCGrad}}\\limits\$, the proposed adaptation framework represents a significant departure from the dominant pretraining fine-tuning approach for molecular GNNs. Our contributions are significant for drug discovery research, especially for tasks with limited data, filling a notable gap in the efficient adaptation of pretrained models for molecular GNNs.</div>","PeriodicalId":617,"journal":{"name":"Journal of Cheminformatics","volume":"16 1","pages":""},"PeriodicalIF":7.1000,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://jcheminf.biomedcentral.com/counter/pdf/10.1186/s13321-024-00880-7","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cheminformatics","FirstCategoryId":"92","ListUrlMain":"https://link.springer.com/article/10.1186/s13321-024-00880-7","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Pretrained Graph Neural Networks have been widely adopted for various molecular property prediction tasks. Despite their ability to encode structural and relational features of molecules, traditional fine-tuning of such pretrained GNNs on the target task can lead to poor generalization. To address this, we explore the adaptation of pretrained GNNs to the target task by jointly training them with multiple auxiliary tasks. This could enable the GNNs to learn both general and task-specific features, which may benefit the target task. However, a major challenge is to determine the relatedness of auxiliary tasks with the target task. To address this, we investigate multiple strategies to measure the relevance of auxiliary tasks and integrate such tasks by adaptively combining task gradients or by learning task weights via bi-level optimization. Additionally, we propose a novel gradient surgery-based approach, Rotation of Conflicting Gradients ($\mathop {\texttt{RCGrad}}\limits$), that learns to align conflicting auxiliary task gradients through rotation. Our experiments with state-of-the-art pretrained GNNs demonstrate the efficacy of our proposed methods, with improvements of up to 7.7% over fine-tuning. This suggests that incorporating auxiliary tasks along with target task fine-tuning can be an effective way to improve the generalizability of pretrained GNNs for molecular property prediction.

Scientific contribution

We introduce a novel framework for adapting pretrained GNNs to molecular tasks using auxiliary learning to address the critical issue of negative transfer. Leveraging novel gradient surgery techniques such as $\mathop {\texttt{RCGrad}}\limits$, the proposed adaptation framework represents a significant departure from the dominant pretraining fine-tuning approach for molecular GNNs. Our contributions are significant for drug discovery research, especially for tasks with limited data, filling a notable gap in the efficient adaptation of pretrained models for molecular GNNs.

查看原文本刊更多论文

利用辅助学习和特定任务适应性加强分子特性预测

预训练的图神经网络已被广泛用于各种分子特性预测任务。尽管预训练图神经网络能够编码分子的结构和关系特征，但根据目标任务对其进行传统的微调可能会导致泛化效果不佳。为了解决这个问题，我们探索了通过多个辅助任务联合训练预训练 GNN 来使其适应目标任务。这可以使 GNN 同时学习通用特征和特定任务特征，从而有利于目标任务。然而，如何确定辅助任务与目标任务的相关性是一大挑战。为了解决这个问题，我们研究了多种策略来衡量辅助任务的相关性，并通过自适应结合任务梯度或通过双层优化学习任务权重来整合这些任务。此外，我们还提出了一种基于梯度手术的新方法--"冲突梯度旋转"（Rotation of Conflicting Gradients），该方法可通过旋转来调整相互冲突的辅助任务梯度。我们用最先进的预训练 GNN 进行的实验证明了我们提出的方法的有效性，与微调相比，改进幅度高达 7.7%。这表明，将辅助任务与目标任务微调结合起来，可以有效提高预训练 GNN 在分子性质预测方面的通用性。科学贡献我们引入了一个新框架，利用辅助学习使预训练的 GNN 适应分子任务，以解决负迁移的关键问题。利用$$\mathop {texttt{RCGrad}}\limits$$ 等新颖的梯度手术技术，所提出的适应框架与分子 GNNs 的主流预训练微调方法大相径庭。我们的贡献对于药物发现研究意义重大，尤其是对于数据有限的任务，填补了分子 GNN 预训练模型高效适配方面的显著空白。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Cheminformatics CHEMISTRY, MULTIDISCIPLINARY-COMPUTER SCIENCE, INFORMATION SYSTEMS

CiteScore

14.10

自引率

7.00%

发文量

审稿时长

3 months

期刊介绍： Journal of Cheminformatics is an open access journal publishing original peer-reviewed research in all aspects of cheminformatics and molecular modelling. Coverage includes, but is not limited to: chemical information systems, software and databases, and molecular modelling, chemical structure representations and their use in structure, substructure, and similarity searching of chemical substance and chemical reaction databases, computer and molecular graphics, computer-aided molecular design, expert systems, QSAR, and data mining techniques.