{"title":"Enhanced metabolite-disease associations prediction via Neighborhood Aggregation Graph Transformer with Kolmogorov–Arnold Networks","authors":"Pengli Lu , Jian Zhang , Wenzhi Liu , Fentang Gao","doi":"10.1016/j.jocs.2025.102629","DOIUrl":null,"url":null,"abstract":"<div><div>Metabolites are essential products of cellular chemical reactions, critical for sustaining life and reproduction. Research shows that metabolite concentrations in patients differ from those in healthy individuals, making metabolite-based disease prediction crucial for diagnosis and treatment. To address the limitations of current computational methods in accuracy and interpretability, we propose a novel Neighborhood Aggregation Graph Transformer method (AGKphormer). This method enhances link relationships by optimizing the minimum nuclear norm using the Alternating Direction Method of Multipliers (ADMM) and incorporates Fast Kolmogorov–Arnold Networks (FastKAN) to improve both accuracy and interpretability. We first construct a heterogeneous network based on the correlation and similarity between metabolites and diseases. Then, we utilize the ADMM algorithm to enhance link relationships by solving the minimum nuclear norm, reducing sparse relationships between nodes and providing richer features for neural network learning. For the features learned by the graph convolutional network (GCN), we employ a Graph Transformer augmented with FastKAN to learn long-range dependencies. This approach enables global feature embedding and addresses GCN’s smoothness issue while enhancing interpretability. Through five-fold cross-validation, AGKphormer achieved average AUC and AUPR values of 97.32% and 97.34%, respectively, outperforming most methods and demonstrating its effectiveness in predicting disease-related metabolites. Additionally, case studies further confirm that AGKphormer is a reliable tool for discovering potential metabolites.</div></div>","PeriodicalId":48907,"journal":{"name":"Journal of Computational Science","volume":"90 ","pages":"Article 102629"},"PeriodicalIF":3.7000,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Science","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1877750325001061","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Metabolites are essential products of cellular chemical reactions, critical for sustaining life and reproduction. Research shows that metabolite concentrations in patients differ from those in healthy individuals, making metabolite-based disease prediction crucial for diagnosis and treatment. To address the limitations of current computational methods in accuracy and interpretability, we propose a novel Neighborhood Aggregation Graph Transformer method (AGKphormer). This method enhances link relationships by optimizing the minimum nuclear norm using the Alternating Direction Method of Multipliers (ADMM) and incorporates Fast Kolmogorov–Arnold Networks (FastKAN) to improve both accuracy and interpretability. We first construct a heterogeneous network based on the correlation and similarity between metabolites and diseases. Then, we utilize the ADMM algorithm to enhance link relationships by solving the minimum nuclear norm, reducing sparse relationships between nodes and providing richer features for neural network learning. For the features learned by the graph convolutional network (GCN), we employ a Graph Transformer augmented with FastKAN to learn long-range dependencies. This approach enables global feature embedding and addresses GCN’s smoothness issue while enhancing interpretability. Through five-fold cross-validation, AGKphormer achieved average AUC and AUPR values of 97.32% and 97.34%, respectively, outperforming most methods and demonstrating its effectiveness in predicting disease-related metabolites. Additionally, case studies further confirm that AGKphormer is a reliable tool for discovering potential metabolites.
期刊介绍:
Computational Science is a rapidly growing multi- and interdisciplinary field that uses advanced computing and data analysis to understand and solve complex problems. It has reached a level of predictive capability that now firmly complements the traditional pillars of experimentation and theory.
The recent advances in experimental techniques such as detectors, on-line sensor networks and high-resolution imaging techniques, have opened up new windows into physical and biological processes at many levels of detail. The resulting data explosion allows for detailed data driven modeling and simulation.
This new discipline in science combines computational thinking, modern computational methods, devices and collateral technologies to address problems far beyond the scope of traditional numerical methods.
Computational science typically unifies three distinct elements:
• Modeling, Algorithms and Simulations (e.g. numerical and non-numerical, discrete and continuous);
• Software developed to solve science (e.g., biological, physical, and social), engineering, medicine, and humanities problems;
• Computer and information science that develops and optimizes the advanced system hardware, software, networking, and data management components (e.g. problem solving environments).