{"title":"A Local–Global Graph KAN for Multi-Class Prediction of PPI","authors":"Minghui Liu, Ying Qu","doi":"10.1002/eng2.70164","DOIUrl":null,"url":null,"abstract":"<p>Traditional experimental methods for identifying protein–protein interactions (PPI) are expensive and time-consuming. Therefore, using machine learning to treat multiple PPI predictions as binary classifications has become an alternative, but there is a problem of data imbalance. The proposed GLGKAN-PPI method integrates features from both global graphs and local subgraphs to capture the complex structural information of PPI networks comprehensively. Specifically, the method utilizes the pre-trained model MASSA to extract multimodal features of proteins. The global graph features are extracted using the GKAN (Graph Kolmogorov-Arnold Network) algorithm. Meanwhile, the local subgraph features are extracted using the MOE-GKAN (Mixture of Experts-Graph Kolmogorov-Arnold Network) algorithm. To mitigate data imbalance, an asymmetric loss function is utilized to better handle minority classes and improve overall prediction accuracy. Experimental results demonstrate that GLGKAN-PPI outperforms a range of existing intelligent approaches across multiple datasets and partitioning strategies.</p>","PeriodicalId":72922,"journal":{"name":"Engineering reports : open access","volume":"7 5","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/eng2.70164","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering reports : open access","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/eng2.70164","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Traditional experimental methods for identifying protein–protein interactions (PPI) are expensive and time-consuming. Therefore, using machine learning to treat multiple PPI predictions as binary classifications has become an alternative, but there is a problem of data imbalance. The proposed GLGKAN-PPI method integrates features from both global graphs and local subgraphs to capture the complex structural information of PPI networks comprehensively. Specifically, the method utilizes the pre-trained model MASSA to extract multimodal features of proteins. The global graph features are extracted using the GKAN (Graph Kolmogorov-Arnold Network) algorithm. Meanwhile, the local subgraph features are extracted using the MOE-GKAN (Mixture of Experts-Graph Kolmogorov-Arnold Network) algorithm. To mitigate data imbalance, an asymmetric loss function is utilized to better handle minority classes and improve overall prediction accuracy. Experimental results demonstrate that GLGKAN-PPI outperforms a range of existing intelligent approaches across multiple datasets and partitioning strategies.