GraphGIM: rethinking molecular graph contrastive learning via geometry image modeling.

IF 4.4 1区 生物学 Q1 BIOLOGY
Chaoyi Li, Hongxin Xiang, Wenjie Du, Tengfei Ma, Haowen Chen, Xiangxiang Zeng, Lei Xu
{"title":"GraphGIM: rethinking molecular graph contrastive learning via geometry image modeling.","authors":"Chaoyi Li, Hongxin Xiang, Wenjie Du, Tengfei Ma, Haowen Chen, Xiangxiang Zeng, Lei Xu","doi":"10.1186/s12915-025-02249-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Learning molecular representations is crucial for accurate drug discovery. Using graphs to represent molecules is a popular solution, and many researchers have used contrastive learning to improve the generalization of molecular graph representations.</p><p><strong>Results: </strong>In this work, we revisit existing graph-based contrastive methods and find that these methods have limited diversity in the constructed sample pairs, resulting in insufficient performance gains. To alleviate the above challenge, we propose a novel molecular graph contrastive learning method via geometry image modeling, called GraphGIM, which enhances the diversity between sample pairs. GraphGIM is pre-trained on 2 million 2D graphs and multi-view 3D geometry images through contrastive learning. Furthermore, we find that as the convolutional layers process the image becomes deeper, the information of feature maps gradually changes from global molecular-level information (molecular scaffolds) to local atomic-level information (molecular atoms and functional groups), which provides chemical information at different scales. Therefore, we propose two variants of GraphGIM, called GraphGIM-M and GraphGIM-P, which fuse feature maps of different scales in the image using a weighted strategy and a prompt-based strategy, respectively.</p><p><strong>Conclusions: </strong>Extensive experiments show that GraphGIM and its two variants outperform state-of-the-art graph contrastive learning methods on eight molecular property prediction benchmarks from MoleculeNet and achieve competitive results with state-of-the-art methods. The code is available at https://github.com/cyli029/GraphGIM .</p>","PeriodicalId":9339,"journal":{"name":"BMC Biology","volume":"23 1","pages":"189"},"PeriodicalIF":4.4000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12219247/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12915-025-02249-0","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Learning molecular representations is crucial for accurate drug discovery. Using graphs to represent molecules is a popular solution, and many researchers have used contrastive learning to improve the generalization of molecular graph representations.

Results: In this work, we revisit existing graph-based contrastive methods and find that these methods have limited diversity in the constructed sample pairs, resulting in insufficient performance gains. To alleviate the above challenge, we propose a novel molecular graph contrastive learning method via geometry image modeling, called GraphGIM, which enhances the diversity between sample pairs. GraphGIM is pre-trained on 2 million 2D graphs and multi-view 3D geometry images through contrastive learning. Furthermore, we find that as the convolutional layers process the image becomes deeper, the information of feature maps gradually changes from global molecular-level information (molecular scaffolds) to local atomic-level information (molecular atoms and functional groups), which provides chemical information at different scales. Therefore, we propose two variants of GraphGIM, called GraphGIM-M and GraphGIM-P, which fuse feature maps of different scales in the image using a weighted strategy and a prompt-based strategy, respectively.

Conclusions: Extensive experiments show that GraphGIM and its two variants outperform state-of-the-art graph contrastive learning methods on eight molecular property prediction benchmarks from MoleculeNet and achieve competitive results with state-of-the-art methods. The code is available at https://github.com/cyli029/GraphGIM .

GraphGIM:通过几何图像建模重新思考分子图对比学习。
背景:学习分子表征对于准确的药物发现至关重要。使用图来表示分子是一种流行的解决方案,许多研究人员已经使用对比学习来提高分子图表示的泛化。结果:在这项工作中,我们重新审视了现有的基于图的对比方法,发现这些方法在构建的样本对中多样性有限,导致性能提升不足。为了缓解上述挑战,我们提出了一种新的基于几何图像建模的分子图对比学习方法GraphGIM,该方法增强了样本对之间的多样性。GraphGIM通过对比学习对200万张2D图和多视图3D几何图像进行预训练。此外,我们发现随着卷积层对图像处理的深入,特征图的信息逐渐从全局分子水平信息(分子支架)转变为局部原子水平信息(分子原子和官能团),提供了不同尺度的化学信息。因此,我们提出了GraphGIM的两种变体GraphGIM- m和GraphGIM- p,它们分别使用加权策略和基于提示的策略融合图像中不同尺度的特征映射。结论:大量实验表明,GraphGIM及其两个变体在来自MoleculeNet的8个分子性质预测基准上优于最先进的图对比学习方法,并取得了与最先进的方法相竞争的结果。代码可在https://github.com/cyli029/GraphGIM上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
BMC Biology
BMC Biology 生物-生物学
CiteScore
7.80
自引率
1.90%
发文量
260
审稿时长
3 months
期刊介绍: BMC Biology is a broad scope journal covering all areas of biology. Our content includes research articles, new methods and tools. BMC Biology also publishes reviews, Q&A, and commentaries.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信