Zepeng Chen, Xingchen Wang, Changfang Xiao, Yongqing Cao
{"title":"基于加权基因共表达网络分析和机器学习的溃疡性结肠炎潜在生物标志物筛选与实验研究。","authors":"Zepeng Chen, Xingchen Wang, Changfang Xiao, Yongqing Cao","doi":"10.1186/s12967-025-07058-1","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Ulcerative colitis (UC) is a chronic nonspecific inflammatory intestinal disease affecting the mucosa and submucosa, characterized by continuous and diffuse active inflammation. However, its underlying pathogenesis remains unclear.</p><p><strong>Objective: </strong>This study aimed to identify potential UC biomarkers by integrating weighted gene co-expression network analysis (WGCNA) with machine learning, followed by validation in an experimental UC mouse model.</p><p><strong>Methods: </strong>The Gene Expression Omnibus database was systematically queried, and the GSE87466 dataset, comprising of colonic tissues from 87 patients with UC and 21 healthy controls, was retrieved. Differentially expressed genes (DEGs) were identified and subjected to Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses. WGCNA was used to extract UC-related DEGs. Two machine learning algorithms, the Least Absolute Shrinkage and Selection Operator (LASSO) and Support Vector Machine Recursive Feature Elimination (SVM-RFE), were used to screen potential biomarkers. These biomarkers were then validated using animal experiments.</p><p><strong>Results: </strong>A total of 1,097 DEGs were identified. WGCNA constructed nine co-expression gene modules, with the turquoise module (520 genes) exhibiting the highest relevance to UC. LASSO and SVM-RFE analysis identified poly(ADP-ribose) polymerase family member 8 (PARP8) as a potential biomarker of UC. Immunological analysis revealed significantly higher proportions of naive B cells, activated CD4<sup>+</sup> memory T cells, follicular helper T cells, γδT cells, M0 macrophages, M1 macrophages, activated mast cells, and neutrophils in UC samples compared to controls. PARP8 expression positively correlated with neutrophils, M1 macrophages, and activated CD4<sup>+</sup> T cells, but negatively correlated with plasma cells. In vivo validation confirmed elevated PARP8 expression in dextran sulfate sodium-induced UC mice compared to controls.</p><p><strong>Conclusion: </strong>PARP8 may contribute to UC pathogenesis via immune-related pathways and holds promise as a diagnostic and predictive biomarker.</p>","PeriodicalId":17458,"journal":{"name":"Journal of Translational Medicine","volume":"23 1","pages":"1029"},"PeriodicalIF":7.5000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12486846/pdf/","citationCount":"0","resultStr":"{\"title\":\"Screening and experimental study of potential biomarkers for ulcerative colitis based on weighted gene co-expression network analysis and machine learning.\",\"authors\":\"Zepeng Chen, Xingchen Wang, Changfang Xiao, Yongqing Cao\",\"doi\":\"10.1186/s12967-025-07058-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Ulcerative colitis (UC) is a chronic nonspecific inflammatory intestinal disease affecting the mucosa and submucosa, characterized by continuous and diffuse active inflammation. However, its underlying pathogenesis remains unclear.</p><p><strong>Objective: </strong>This study aimed to identify potential UC biomarkers by integrating weighted gene co-expression network analysis (WGCNA) with machine learning, followed by validation in an experimental UC mouse model.</p><p><strong>Methods: </strong>The Gene Expression Omnibus database was systematically queried, and the GSE87466 dataset, comprising of colonic tissues from 87 patients with UC and 21 healthy controls, was retrieved. Differentially expressed genes (DEGs) were identified and subjected to Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses. WGCNA was used to extract UC-related DEGs. Two machine learning algorithms, the Least Absolute Shrinkage and Selection Operator (LASSO) and Support Vector Machine Recursive Feature Elimination (SVM-RFE), were used to screen potential biomarkers. These biomarkers were then validated using animal experiments.</p><p><strong>Results: </strong>A total of 1,097 DEGs were identified. WGCNA constructed nine co-expression gene modules, with the turquoise module (520 genes) exhibiting the highest relevance to UC. LASSO and SVM-RFE analysis identified poly(ADP-ribose) polymerase family member 8 (PARP8) as a potential biomarker of UC. Immunological analysis revealed significantly higher proportions of naive B cells, activated CD4<sup>+</sup> memory T cells, follicular helper T cells, γδT cells, M0 macrophages, M1 macrophages, activated mast cells, and neutrophils in UC samples compared to controls. PARP8 expression positively correlated with neutrophils, M1 macrophages, and activated CD4<sup>+</sup> T cells, but negatively correlated with plasma cells. In vivo validation confirmed elevated PARP8 expression in dextran sulfate sodium-induced UC mice compared to controls.</p><p><strong>Conclusion: </strong>PARP8 may contribute to UC pathogenesis via immune-related pathways and holds promise as a diagnostic and predictive biomarker.</p>\",\"PeriodicalId\":17458,\"journal\":{\"name\":\"Journal of Translational Medicine\",\"volume\":\"23 1\",\"pages\":\"1029\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12486846/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Translational Medicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12967-025-07058-1\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MEDICINE, RESEARCH & EXPERIMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Translational Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12967-025-07058-1","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
Screening and experimental study of potential biomarkers for ulcerative colitis based on weighted gene co-expression network analysis and machine learning.
Background: Ulcerative colitis (UC) is a chronic nonspecific inflammatory intestinal disease affecting the mucosa and submucosa, characterized by continuous and diffuse active inflammation. However, its underlying pathogenesis remains unclear.
Objective: This study aimed to identify potential UC biomarkers by integrating weighted gene co-expression network analysis (WGCNA) with machine learning, followed by validation in an experimental UC mouse model.
Methods: The Gene Expression Omnibus database was systematically queried, and the GSE87466 dataset, comprising of colonic tissues from 87 patients with UC and 21 healthy controls, was retrieved. Differentially expressed genes (DEGs) were identified and subjected to Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses. WGCNA was used to extract UC-related DEGs. Two machine learning algorithms, the Least Absolute Shrinkage and Selection Operator (LASSO) and Support Vector Machine Recursive Feature Elimination (SVM-RFE), were used to screen potential biomarkers. These biomarkers were then validated using animal experiments.
Results: A total of 1,097 DEGs were identified. WGCNA constructed nine co-expression gene modules, with the turquoise module (520 genes) exhibiting the highest relevance to UC. LASSO and SVM-RFE analysis identified poly(ADP-ribose) polymerase family member 8 (PARP8) as a potential biomarker of UC. Immunological analysis revealed significantly higher proportions of naive B cells, activated CD4+ memory T cells, follicular helper T cells, γδT cells, M0 macrophages, M1 macrophages, activated mast cells, and neutrophils in UC samples compared to controls. PARP8 expression positively correlated with neutrophils, M1 macrophages, and activated CD4+ T cells, but negatively correlated with plasma cells. In vivo validation confirmed elevated PARP8 expression in dextran sulfate sodium-induced UC mice compared to controls.
Conclusion: PARP8 may contribute to UC pathogenesis via immune-related pathways and holds promise as a diagnostic and predictive biomarker.
期刊介绍:
The Journal of Translational Medicine is an open-access journal that publishes articles focusing on information derived from human experimentation to enhance communication between basic and clinical science. It covers all areas of translational medicine.