{"title":"SPIN: SE(3)-Invariant Physics Informed Network for Binding Affinity Prediction","authors":"Seungyeon Choi, Sangmin Seo, Sanghyun Park","doi":"arxiv-2407.11057","DOIUrl":null,"url":null,"abstract":"Accurate prediction of protein-ligand binding affinity is crucial for rapid\nand efficient drug development. Recently, the importance of predicting binding\naffinity has led to increased attention on research that models the\nthree-dimensional structure of protein-ligand complexes using graph neural\nnetworks to predict binding affinity. However, traditional methods often fail\nto accurately model the complex's spatial information or rely solely on\ngeometric features, neglecting the principles of protein-ligand binding. This\ncan lead to overfitting, resulting in models that perform poorly on independent\ndatasets and ultimately reducing their usefulness in real drug development. To\naddress this issue, we propose SPIN, a model designed to achieve superior\ngeneralization by incorporating various inductive biases applicable to this\ntask, beyond merely training on empirical data from datasets. For prediction,\nwe defined two types of inductive biases: a geometric perspective that\nmaintains consistent binding affinity predictions regardless of the complexs\nrotations and translations, and a physicochemical perspective that necessitates\nminimal binding free energy along their reaction coordinate for effective\nprotein-ligand binding. These prior knowledge inputs enable the SPIN to\noutperform comparative models in benchmark sets such as CASF-2016 and CSAR HiQ.\nFurthermore, we demonstrated the practicality of our model through virtual\nscreening experiments and validated the reliability and potential of our\nproposed model based on experiments assessing its interpretability.","PeriodicalId":501022,"journal":{"name":"arXiv - QuanBio - Biomolecules","volume":"33 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Biomolecules","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.11057","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate prediction of protein-ligand binding affinity is crucial for rapid
and efficient drug development. Recently, the importance of predicting binding
affinity has led to increased attention on research that models the
three-dimensional structure of protein-ligand complexes using graph neural
networks to predict binding affinity. However, traditional methods often fail
to accurately model the complex's spatial information or rely solely on
geometric features, neglecting the principles of protein-ligand binding. This
can lead to overfitting, resulting in models that perform poorly on independent
datasets and ultimately reducing their usefulness in real drug development. To
address this issue, we propose SPIN, a model designed to achieve superior
generalization by incorporating various inductive biases applicable to this
task, beyond merely training on empirical data from datasets. For prediction,
we defined two types of inductive biases: a geometric perspective that
maintains consistent binding affinity predictions regardless of the complexs
rotations and translations, and a physicochemical perspective that necessitates
minimal binding free energy along their reaction coordinate for effective
protein-ligand binding. These prior knowledge inputs enable the SPIN to
outperform comparative models in benchmark sets such as CASF-2016 and CSAR HiQ.
Furthermore, we demonstrated the practicality of our model through virtual
screening experiments and validated the reliability and potential of our
proposed model based on experiments assessing its interpretability.