A novel deep learning framework for predicting protein-ligand interaction fingerprints from sequence data: integrating graph inductive bias transformer with Kolmogorov-Arnold networks
{"title":"A novel deep learning framework for predicting protein-ligand interaction fingerprints from sequence data: integrating graph inductive bias transformer with Kolmogorov-Arnold networks","authors":"Lixin Lei, Qianjin Guo, Wu Liu, Zijun Wang, Kaitai Han, Chaojing Shi, Zhenxing Li, Sichao Lu, Mengqiu Wang, Zhiwei Zhang, Ruoyan Dai, Zhenghui Wang, Xingyu Liu","doi":"10.1016/j.comtox.2025.100386","DOIUrl":null,"url":null,"abstract":"<div><div>Accurately modeling protein–ligand interactions is a central challenge in computational protein design and drug discovery. Traditional interaction fingerprint (IFP) approaches, while valuable, struggle to capture subtle binding features and adapt to diverse structural contexts. To address these limitations, we propose <strong>GITK</strong>, a deep learning framework that integrates a modified graph inductive bias transformer (GRIT) with Kolmogorov–Arnold networks (KANs) for interpretable interaction fingerprint prediction. GRIT introduces inductive bias to effectively represent both local and global graph structures of proteins and ligands, while KAN provides a principled functional decomposition that enhances nonlinear feature learning and interpretability. Benchmarking across multiple datasets demonstrates that GITK outperforms state-of-the-art models in binding affinity prediction, functional effect classification, and virtual screening. Moreover, GITK enables reliable selectivity analysis, successfully highlighting conformational differences and key residues in adenosine receptor subtypes, consistent with experimental findings such as the selectivity of the A1 antagonist DPCPX.</div></div>","PeriodicalId":37651,"journal":{"name":"Computational Toxicology","volume":"36 ","pages":"Article 100386"},"PeriodicalIF":2.9000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Toxicology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2468111325000465","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TOXICOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Accurately modeling protein–ligand interactions is a central challenge in computational protein design and drug discovery. Traditional interaction fingerprint (IFP) approaches, while valuable, struggle to capture subtle binding features and adapt to diverse structural contexts. To address these limitations, we propose GITK, a deep learning framework that integrates a modified graph inductive bias transformer (GRIT) with Kolmogorov–Arnold networks (KANs) for interpretable interaction fingerprint prediction. GRIT introduces inductive bias to effectively represent both local and global graph structures of proteins and ligands, while KAN provides a principled functional decomposition that enhances nonlinear feature learning and interpretability. Benchmarking across multiple datasets demonstrates that GITK outperforms state-of-the-art models in binding affinity prediction, functional effect classification, and virtual screening. Moreover, GITK enables reliable selectivity analysis, successfully highlighting conformational differences and key residues in adenosine receptor subtypes, consistent with experimental findings such as the selectivity of the A1 antagonist DPCPX.
期刊介绍:
Computational Toxicology is an international journal publishing computational approaches that assist in the toxicological evaluation of new and existing chemical substances assisting in their safety assessment. -All effects relating to human health and environmental toxicity and fate -Prediction of toxicity, metabolism, fate and physico-chemical properties -The development of models from read-across, (Q)SARs, PBPK, QIVIVE, Multi-Scale Models -Big Data in toxicology: integration, management, analysis -Implementation of models through AOPs, IATA, TTC -Regulatory acceptance of models: evaluation, verification and validation -From metals, to small organic molecules to nanoparticles -Pharmaceuticals, pesticides, foods, cosmetics, fine chemicals -Bringing together the views of industry, regulators, academia, NGOs