通过WGCNA和机器学习识别早期非酒精性脂肪肝的免疫和重度抑郁症相关诊断标志物

IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY
Frontiers in bioinformatics Pub Date : 2025-06-26 eCollection Date: 2025-01-01 DOI:10.3389/fbinf.2025.1594971
Yuyun Jia, Yanping Cao, Qin Yin, Xueqian Li, Xiu Wen
{"title":"通过WGCNA和机器学习识别早期非酒精性脂肪肝的免疫和重度抑郁症相关诊断标志物","authors":"Yuyun Jia, Yanping Cao, Qin Yin, Xueqian Li, Xiu Wen","doi":"10.3389/fbinf.2025.1594971","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Major depressive disorder (MDD) and nonalcoholic fatty liver disease (NAFLD) are highly prevalent conditions that exhibit significant pathophysiological overlap, particularly in metabolic and immune pathways.</p><p><strong>Objective: </strong>This study aims to bridge this gap by integrating transcriptomic data from publicly available repositories and advanced machine learning algorithms to identify novel biomarkers and construct a predictive model facilitates the provision of clinical psychological nursing interventions for early-stage NAFLD in MDD patients.</p><p><strong>Method: </strong>We systematically analyzed transcriptomic data of simple steatosis (SS), nonalcoholic steatohepatitis (NASH), and major depressive disorder (MDD) from GEO databases to construct and validate a diagnostic model. After removing batch effects, we identified differentially expressed genes (DEGs) that distinguished disease and control groups. We further applied Weighted Gene Co-expression Network Analysis (WGCNA) to identify immune-related genes in SS/NASH patients versus controls. The intersection of shared DEGs across both conditions and WGCNA-identified genes was determined and subjected to functional enrichment analysis. Immune cell infiltration levels were quantified using single-sample gene set enrichment analysis (ssGSEA). A predictive model for SS/NASH was developed by evaluating nine machine-learning algorithms with 10-fold cross-validation on the datasets.</p><p><strong>Results: </strong>Fourteen genes strongly linked to both the immune system and the two conditions were identified. Immune cell infiltration profiling revealed distinct immune landscapes in patients versus healthy controls. Moreover, an eight-gene signature was developed, demonstrating superior diagnostic accuracy in both testing and training cohorts. Notably, these eight genes were found to correlate with the severity of early-stage NAFLD.</p><p><strong>Conclusion: </strong>This study established a predictive model for early-stage NAFLD through the integration of bioinformatics and machine learning approaches, with a focus on immune- and MDD-related genes. The eight-gene signature identified in this study represents a novel diagnostic tool for precision medicine, enabling targeted psychological nursing intervention in comorbid populations.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1594971"},"PeriodicalIF":3.9000,"publicationDate":"2025-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12271764/pdf/","citationCount":"0","resultStr":"{\"title\":\"Identification of immune and major depressive disorder-related diagnostic markers for early nonalcoholic fatty liver disease by WGCNA and machine learning.\",\"authors\":\"Yuyun Jia, Yanping Cao, Qin Yin, Xueqian Li, Xiu Wen\",\"doi\":\"10.3389/fbinf.2025.1594971\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Major depressive disorder (MDD) and nonalcoholic fatty liver disease (NAFLD) are highly prevalent conditions that exhibit significant pathophysiological overlap, particularly in metabolic and immune pathways.</p><p><strong>Objective: </strong>This study aims to bridge this gap by integrating transcriptomic data from publicly available repositories and advanced machine learning algorithms to identify novel biomarkers and construct a predictive model facilitates the provision of clinical psychological nursing interventions for early-stage NAFLD in MDD patients.</p><p><strong>Method: </strong>We systematically analyzed transcriptomic data of simple steatosis (SS), nonalcoholic steatohepatitis (NASH), and major depressive disorder (MDD) from GEO databases to construct and validate a diagnostic model. After removing batch effects, we identified differentially expressed genes (DEGs) that distinguished disease and control groups. We further applied Weighted Gene Co-expression Network Analysis (WGCNA) to identify immune-related genes in SS/NASH patients versus controls. The intersection of shared DEGs across both conditions and WGCNA-identified genes was determined and subjected to functional enrichment analysis. Immune cell infiltration levels were quantified using single-sample gene set enrichment analysis (ssGSEA). A predictive model for SS/NASH was developed by evaluating nine machine-learning algorithms with 10-fold cross-validation on the datasets.</p><p><strong>Results: </strong>Fourteen genes strongly linked to both the immune system and the two conditions were identified. Immune cell infiltration profiling revealed distinct immune landscapes in patients versus healthy controls. Moreover, an eight-gene signature was developed, demonstrating superior diagnostic accuracy in both testing and training cohorts. Notably, these eight genes were found to correlate with the severity of early-stage NAFLD.</p><p><strong>Conclusion: </strong>This study established a predictive model for early-stage NAFLD through the integration of bioinformatics and machine learning approaches, with a focus on immune- and MDD-related genes. The eight-gene signature identified in this study represents a novel diagnostic tool for precision medicine, enabling targeted psychological nursing intervention in comorbid populations.</p>\",\"PeriodicalId\":73066,\"journal\":{\"name\":\"Frontiers in bioinformatics\",\"volume\":\"5 \",\"pages\":\"1594971\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-06-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12271764/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/fbinf.2025.1594971\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"MATHEMATICAL & COMPUTATIONAL BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fbinf.2025.1594971","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:重度抑郁症(MDD)和非酒精性脂肪性肝病(NAFLD)是非常普遍的疾病,表现出显著的病理生理重叠,特别是在代谢和免疫途径中。目的:本研究旨在通过整合来自公开可用数据库的转录组数据和先进的机器学习算法来识别新的生物标志物,并构建一个预测模型,为MDD患者早期NAFLD的临床心理护理干预提供便利。方法:系统分析GEO数据库中单纯性脂肪变性(SS)、非酒精性脂肪性肝炎(NASH)和重度抑郁症(MDD)的转录组学数据,构建并验证诊断模型。在去除批效应后,我们确定了区分疾病组和对照组的差异表达基因(DEGs)。我们进一步应用加权基因共表达网络分析(WGCNA)来鉴定SS/NASH患者与对照组的免疫相关基因。确定了两种条件和wgna鉴定基因之间的共享deg交集,并进行了功能富集分析。采用单样本基因集富集分析(ssGSEA)定量免疫细胞浸润水平。通过对数据集进行10倍交叉验证,评估了9种机器学习算法,开发了SS/NASH的预测模型。结果:确定了14个与免疫系统和两种疾病密切相关的基因。免疫细胞浸润分析揭示了不同的免疫景观患者与健康对照。此外,开发了一个八基因签名,在测试和训练队列中都显示出卓越的诊断准确性。值得注意的是,这8个基因被发现与早期NAFLD的严重程度相关。结论:本研究通过生物信息学和机器学习方法的结合,建立了早期NAFLD的预测模型,重点关注免疫和mdd相关基因。在这项研究中确定的八个基因特征代表了一种新的精确医学诊断工具,使有针对性的心理护理干预合并症人群。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Identification of immune and major depressive disorder-related diagnostic markers for early nonalcoholic fatty liver disease by WGCNA and machine learning.

Background: Major depressive disorder (MDD) and nonalcoholic fatty liver disease (NAFLD) are highly prevalent conditions that exhibit significant pathophysiological overlap, particularly in metabolic and immune pathways.

Objective: This study aims to bridge this gap by integrating transcriptomic data from publicly available repositories and advanced machine learning algorithms to identify novel biomarkers and construct a predictive model facilitates the provision of clinical psychological nursing interventions for early-stage NAFLD in MDD patients.

Method: We systematically analyzed transcriptomic data of simple steatosis (SS), nonalcoholic steatohepatitis (NASH), and major depressive disorder (MDD) from GEO databases to construct and validate a diagnostic model. After removing batch effects, we identified differentially expressed genes (DEGs) that distinguished disease and control groups. We further applied Weighted Gene Co-expression Network Analysis (WGCNA) to identify immune-related genes in SS/NASH patients versus controls. The intersection of shared DEGs across both conditions and WGCNA-identified genes was determined and subjected to functional enrichment analysis. Immune cell infiltration levels were quantified using single-sample gene set enrichment analysis (ssGSEA). A predictive model for SS/NASH was developed by evaluating nine machine-learning algorithms with 10-fold cross-validation on the datasets.

Results: Fourteen genes strongly linked to both the immune system and the two conditions were identified. Immune cell infiltration profiling revealed distinct immune landscapes in patients versus healthy controls. Moreover, an eight-gene signature was developed, demonstrating superior diagnostic accuracy in both testing and training cohorts. Notably, these eight genes were found to correlate with the severity of early-stage NAFLD.

Conclusion: This study established a predictive model for early-stage NAFLD through the integration of bioinformatics and machine learning approaches, with a focus on immune- and MDD-related genes. The eight-gene signature identified in this study represents a novel diagnostic tool for precision medicine, enabling targeted psychological nursing intervention in comorbid populations.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
2.60
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信