Sicong Huo, Pengying Deng, Jie Zhou, Tao Lu, Qingnian Li, Xiaowei Wang
{"title":"基于改进基因表达编程算法和蛋白-蛋白相互作用网络表征的蛋白质组合评分预测","authors":"Sicong Huo, Pengying Deng, Jie Zhou, Tao Lu, Qingnian Li, Xiaowei Wang","doi":"10.1049/syb2.70024","DOIUrl":null,"url":null,"abstract":"<p>Predicting the combined score in protein–protein interaction (PPI) networks represents a critical research focus in bioinformatics, as it contributes to enhancing the accuracy of PPI data and uncovering the inherent complexity of biological systems. However, existing intelligent algorithms encounter significant challenges in effectively integrating heterogeneous data sources, capturing the nonlinear dependencies within PPI networks, and improving model generalizability. To address these limitations, this study introduces an enhanced gene expression programming (DF-GEP) algorithm that incorporates dynamic factor optimization. The proposed DF-GEP framework integrates Spearman correlation analysis with kernel ridge regression (SC-KRR) to extract and assign refined weights to key PPI network features. Additionally, the algorithm adaptively regulates selection, crossover, mutation and fitness evaluation processes via dynamic factor adjustment, thereby improving adaptability and predictive precision. Experimental results show that the DF-GEP algorithm consistently outperforms baseline models in both predictive accuracy and stability. Beyond its application to PPI-combined score prediction, the proposed algorithm also exhibits strong potential for addressing complex nonlinear problems in other domains.</p>","PeriodicalId":50379,"journal":{"name":"IET Systems Biology","volume":"19 1","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2025-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/syb2.70024","citationCount":"0","resultStr":"{\"title\":\"Proteins Combined Score Prediction Based on Improved Gene Expression Programming Algorithm and Protein–Protein Interaction Network Characterization\",\"authors\":\"Sicong Huo, Pengying Deng, Jie Zhou, Tao Lu, Qingnian Li, Xiaowei Wang\",\"doi\":\"10.1049/syb2.70024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Predicting the combined score in protein–protein interaction (PPI) networks represents a critical research focus in bioinformatics, as it contributes to enhancing the accuracy of PPI data and uncovering the inherent complexity of biological systems. However, existing intelligent algorithms encounter significant challenges in effectively integrating heterogeneous data sources, capturing the nonlinear dependencies within PPI networks, and improving model generalizability. To address these limitations, this study introduces an enhanced gene expression programming (DF-GEP) algorithm that incorporates dynamic factor optimization. The proposed DF-GEP framework integrates Spearman correlation analysis with kernel ridge regression (SC-KRR) to extract and assign refined weights to key PPI network features. Additionally, the algorithm adaptively regulates selection, crossover, mutation and fitness evaluation processes via dynamic factor adjustment, thereby improving adaptability and predictive precision. Experimental results show that the DF-GEP algorithm consistently outperforms baseline models in both predictive accuracy and stability. Beyond its application to PPI-combined score prediction, the proposed algorithm also exhibits strong potential for addressing complex nonlinear problems in other domains.</p>\",\"PeriodicalId\":50379,\"journal\":{\"name\":\"IET Systems Biology\",\"volume\":\"19 1\",\"pages\":\"\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2025-06-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1049/syb2.70024\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IET Systems Biology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1049/syb2.70024\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"CELL BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Systems Biology","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/syb2.70024","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
Proteins Combined Score Prediction Based on Improved Gene Expression Programming Algorithm and Protein–Protein Interaction Network Characterization
Predicting the combined score in protein–protein interaction (PPI) networks represents a critical research focus in bioinformatics, as it contributes to enhancing the accuracy of PPI data and uncovering the inherent complexity of biological systems. However, existing intelligent algorithms encounter significant challenges in effectively integrating heterogeneous data sources, capturing the nonlinear dependencies within PPI networks, and improving model generalizability. To address these limitations, this study introduces an enhanced gene expression programming (DF-GEP) algorithm that incorporates dynamic factor optimization. The proposed DF-GEP framework integrates Spearman correlation analysis with kernel ridge regression (SC-KRR) to extract and assign refined weights to key PPI network features. Additionally, the algorithm adaptively regulates selection, crossover, mutation and fitness evaluation processes via dynamic factor adjustment, thereby improving adaptability and predictive precision. Experimental results show that the DF-GEP algorithm consistently outperforms baseline models in both predictive accuracy and stability. Beyond its application to PPI-combined score prediction, the proposed algorithm also exhibits strong potential for addressing complex nonlinear problems in other domains.
期刊介绍:
IET Systems Biology covers intra- and inter-cellular dynamics, using systems- and signal-oriented approaches. Papers that analyse genomic data in order to identify variables and basic relationships between them are considered if the results provide a basis for mathematical modelling and simulation of cellular dynamics. Manuscripts on molecular and cell biological studies are encouraged if the aim is a systems approach to dynamic interactions within and between cells.
The scope includes the following topics:
Genomics, transcriptomics, proteomics, metabolomics, cells, tissue and the physiome; molecular and cellular interaction, gene, cell and protein function; networks and pathways; metabolism and cell signalling; dynamics, regulation and control; systems, signals, and information; experimental data analysis; mathematical modelling, simulation and theoretical analysis; biological modelling, simulation, prediction and control; methodologies, databases, tools and algorithms for modelling and simulation; modelling, analysis and control of biological networks; synthetic biology and bioengineering based on systems biology.