Revealing the Impact of Aggregations in the Graph‐Based Molecular Machine Learning: Electrostatic Interaction Versus Pooling Methods

IF 2.9 4区 工程技术 Q1 MULTIDISCIPLINARY SCIENCES
Sanghoon Lee, Hyun Woo Kim
{"title":"Revealing the Impact of Aggregations in the Graph‐Based Molecular Machine Learning: Electrostatic Interaction Versus Pooling Methods","authors":"Sanghoon Lee, Hyun Woo Kim","doi":"10.1002/adts.202500133","DOIUrl":null,"url":null,"abstract":"Molecular structures that can be readily represented by graphs comprising constituent atoms (nodes) and their chemical bonds (edges) can also be used as input data for well‐known machine learning (ML) models that process this data, such as graph neural networks (GNNs). GNNs have shown a reasonable performance in the predicting properties of chemical systems. In typical applications of GNNs to chemistry‐related fields, the main objective is to create an optimal molecular representation by aggregating atomic features and pooling features in the graph. In this study, two different approaches are investigated that can possibly generate better molecular representations. First, intermolecular edges are created to predict the photochemical properties of chromophore molecules in the solution. These intermolecular edges are constructed using atomic partial charges, inspired from the fact that electrostatic interaction is the main component of solute‐solvent interaction. In the second approach, the effect of the aggregation and pooling functions is investigated. The results show that intermolecular electrostatic edges based on ground state charges prevent the GNN model from generating more effective molecular representations. On the contrary, the model demonstrated better performance when the averaging and adding operations are employed in a hybrid manner for the aggregation and pooling functions.","PeriodicalId":7219,"journal":{"name":"Advanced Theory and Simulations","volume":"5 1","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Theory and Simulations","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1002/adts.202500133","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Molecular structures that can be readily represented by graphs comprising constituent atoms (nodes) and their chemical bonds (edges) can also be used as input data for well‐known machine learning (ML) models that process this data, such as graph neural networks (GNNs). GNNs have shown a reasonable performance in the predicting properties of chemical systems. In typical applications of GNNs to chemistry‐related fields, the main objective is to create an optimal molecular representation by aggregating atomic features and pooling features in the graph. In this study, two different approaches are investigated that can possibly generate better molecular representations. First, intermolecular edges are created to predict the photochemical properties of chromophore molecules in the solution. These intermolecular edges are constructed using atomic partial charges, inspired from the fact that electrostatic interaction is the main component of solute‐solvent interaction. In the second approach, the effect of the aggregation and pooling functions is investigated. The results show that intermolecular electrostatic edges based on ground state charges prevent the GNN model from generating more effective molecular representations. On the contrary, the model demonstrated better performance when the averaging and adding operations are employed in a hybrid manner for the aggregation and pooling functions.
揭示基于图的分子机器学习中聚集的影响:静电相互作用与池化方法
分子结构可以很容易地用由组成原子(节点)及其化学键(边)组成的图来表示,也可以用作处理这些数据的众所周知的机器学习(ML)模型的输入数据,例如图神经网络(gnn)。GNNs在预测化学系统性质方面表现出了良好的性能。在gnn在化学相关领域的典型应用中,主要目标是通过聚合图中的原子特征和池化特征来创建最佳的分子表示。在这项研究中,研究了两种可能产生更好的分子表征的不同方法。首先,创建分子间边缘来预测溶液中发色团分子的光化学性质。这些分子间的边缘是用原子部分电荷构建的,灵感来自于静电相互作用是溶质-溶剂相互作用的主要组成部分。在第二种方法中,研究了聚合和池化函数的影响。结果表明,基于基态电荷的分子间静电边阻碍了GNN模型产生更有效的分子表示。相反,当对聚合和池化函数混合使用平均和添加操作时,模型表现出更好的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Advanced Theory and Simulations
Advanced Theory and Simulations Multidisciplinary-Multidisciplinary
CiteScore
5.50
自引率
3.00%
发文量
221
期刊介绍: Advanced Theory and Simulations is an interdisciplinary, international, English-language journal that publishes high-quality scientific results focusing on the development and application of theoretical methods, modeling and simulation approaches in all natural science and medicine areas, including: materials, chemistry, condensed matter physics engineering, energy life science, biology, medicine atmospheric/environmental science, climate science planetary science, astronomy, cosmology method development, numerical methods, statistics
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信