Bo Wang , Peilong Wu , Xiaoxin Du , JianFei Zhang , Chunyu Zhang
{"title":"Adaptive dynamic K-nearest neighbors and context-aware similarity optimization for microbe-disease association prediction","authors":"Bo Wang , Peilong Wu , Xiaoxin Du , JianFei Zhang , Chunyu Zhang","doi":"10.1016/j.compeleceng.2025.110725","DOIUrl":null,"url":null,"abstract":"<div><div>Microbes play a crucial role in disease occurrence, progression, and treatment. Traditional experimental methods are time-consuming, prompting researchers to turn to computational models. However, existing models often suffer from limited data adaptability and improper feature selection, making them prone to noise interference. To address these limitations, we propose ADKNN-KFGCN, a novel adaptive framework that integrates dynamic K-nearest neighbors, graph convolutional networks, and context-aware similarity optimization. The model constructs multi-source similarity networks by integrating various similarity measures between microbes and diseases, forming a comprehensive foundation for association inference. To better capture complex local patterns, it employs adaptive dynamic K-nearest neighbors to adjust the number of neighbors based on local structure, enhancing the accuracy of network construction. This is followed by context-aware similarity optimization, which filters out low-similarity nodes to suppress noise and emphasize the most informative connections. On this refined graph, graph convolutional networks are used to extract high-level representations, effectively capturing intricate topological relationships. These features are then fused through kernel-based strategies, combining multiple similarity sources via averaging and weighted integration to form a unified representation. Finally, Laplacian Regularized Least Squares leverages the global graph structure during prediction, improving generalization and ensuring robust performance. Experimental results show that ADKNN-KFGCN outperforms seven state-of-the-art methods, achieving an AUC of 0.9851±0.0025 and AUPR of 0.9587±0.0032 on the HMDAD dataset. Case studies on rheumatoid arthritis and inflammatory bowel disease further demonstrate its potential to uncover novel associations, provide insights into disease mechanisms, and support therapeutic target discovery.</div></div>","PeriodicalId":50630,"journal":{"name":"Computers & Electrical Engineering","volume":"128 ","pages":"Article 110725"},"PeriodicalIF":4.9000,"publicationDate":"2025-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Electrical Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045790625006688","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
Microbes play a crucial role in disease occurrence, progression, and treatment. Traditional experimental methods are time-consuming, prompting researchers to turn to computational models. However, existing models often suffer from limited data adaptability and improper feature selection, making them prone to noise interference. To address these limitations, we propose ADKNN-KFGCN, a novel adaptive framework that integrates dynamic K-nearest neighbors, graph convolutional networks, and context-aware similarity optimization. The model constructs multi-source similarity networks by integrating various similarity measures between microbes and diseases, forming a comprehensive foundation for association inference. To better capture complex local patterns, it employs adaptive dynamic K-nearest neighbors to adjust the number of neighbors based on local structure, enhancing the accuracy of network construction. This is followed by context-aware similarity optimization, which filters out low-similarity nodes to suppress noise and emphasize the most informative connections. On this refined graph, graph convolutional networks are used to extract high-level representations, effectively capturing intricate topological relationships. These features are then fused through kernel-based strategies, combining multiple similarity sources via averaging and weighted integration to form a unified representation. Finally, Laplacian Regularized Least Squares leverages the global graph structure during prediction, improving generalization and ensuring robust performance. Experimental results show that ADKNN-KFGCN outperforms seven state-of-the-art methods, achieving an AUC of 0.9851±0.0025 and AUPR of 0.9587±0.0032 on the HMDAD dataset. Case studies on rheumatoid arthritis and inflammatory bowel disease further demonstrate its potential to uncover novel associations, provide insights into disease mechanisms, and support therapeutic target discovery.
期刊介绍:
The impact of computers has nowhere been more revolutionary than in electrical engineering. The design, analysis, and operation of electrical and electronic systems are now dominated by computers, a transformation that has been motivated by the natural ease of interface between computers and electrical systems, and the promise of spectacular improvements in speed and efficiency.
Published since 1973, Computers & Electrical Engineering provides rapid publication of topical research into the integration of computer technology and computational techniques with electrical and electronic systems. The journal publishes papers featuring novel implementations of computers and computational techniques in areas like signal and image processing, high-performance computing, parallel processing, and communications. Special attention will be paid to papers describing innovative architectures, algorithms, and software tools.