Identification of Biomarkers for Sepsis-Induced Acute Lung Injury Through Bioinformatics and Machine Learning Approaches, with Experimental Validation.

IF 4.1 2区 医学 Q2 IMMUNOLOGY
Journal of Inflammation Research Pub Date : 2025-10-04 eCollection Date: 2025-01-01 DOI:10.2147/JIR.S539899
Yannian Luo, Juan Xu, Nannan He, Wen Cao
{"title":"Identification of Biomarkers for Sepsis-Induced Acute Lung Injury Through Bioinformatics and Machine Learning Approaches, with Experimental Validation.","authors":"Yannian Luo, Juan Xu, Nannan He, Wen Cao","doi":"10.2147/JIR.S539899","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Sepsis-induced acute lung injury (ALI) remains a life-threatening condition due to the lack of reliable early diagnostic biomarkers. Machine learning offers powerful tools for analyzing high-dimensional gene expression data and identifying potential biomarkers and therapeutic targets.</p><p><strong>Methods: </strong>Five datasets (GSE10474, GSE32707, GSE66890, GSE10361, GSE3037) were obtained from the GEO database. After assessment and normalization, GSE10474, GSE32707, and GSE66890 were combined as a training set to identify differentially expressed genes (DEGs). DEGs were intersected with genes from key modules identified by weighted gene co-expression network analysis (WGCNA), yielding 213 overlapping genes. These were analyzed via Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment. Eight machine learning algorithms (RF, SVM, GLM, GBM, KNN, NNET, LASSO, DT) were used to develop diagnostic models, which were validated on GSE10361 and GSE3037. Model performance was evaluated using a nomogram, calibration curves, and decision curve analysis (DCA). Immune and inflammatory states were assessed using the CIBERSORT algorithm. Potential therapeutic compounds were identified through the DSigDB database via the Enrichr platform. Molecular docking and molecular dynamics simulations examined interactions between Resveratrol and selected targets. In vitro experiments validated these findings.</p><p><strong>Results: </strong>A total of 213 candidate genes were identified by intersecting DEGs with WGCNA-derived MEblue module genes. GO and KEGG analyses indicated associations with immune activation and bacterial infection. Four key genes (DDAH2, PNPLA2, STXBP2, TCN1) were selected using eight machine learning algorithms. The diagnostic model showed good performance via nomogram, calibration curve, and DCA. Molecular docking revealed stable binding of Resveratrol to these genes. In vitro, Resveratrol pretreatment alleviated LPS-induced ALI by modulating the core genes.</p><p><strong>Conclusion: </strong>The four genes may serve as diagnostic biomarkers for sepsis-ALI. Resveratrol represents a potential therapeutic strategy by targeting these genes.</p>","PeriodicalId":16107,"journal":{"name":"Journal of Inflammation Research","volume":"18 ","pages":"13635-13650"},"PeriodicalIF":4.1000,"publicationDate":"2025-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12506793/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Inflammation Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/JIR.S539899","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"IMMUNOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Sepsis-induced acute lung injury (ALI) remains a life-threatening condition due to the lack of reliable early diagnostic biomarkers. Machine learning offers powerful tools for analyzing high-dimensional gene expression data and identifying potential biomarkers and therapeutic targets.

Methods: Five datasets (GSE10474, GSE32707, GSE66890, GSE10361, GSE3037) were obtained from the GEO database. After assessment and normalization, GSE10474, GSE32707, and GSE66890 were combined as a training set to identify differentially expressed genes (DEGs). DEGs were intersected with genes from key modules identified by weighted gene co-expression network analysis (WGCNA), yielding 213 overlapping genes. These were analyzed via Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment. Eight machine learning algorithms (RF, SVM, GLM, GBM, KNN, NNET, LASSO, DT) were used to develop diagnostic models, which were validated on GSE10361 and GSE3037. Model performance was evaluated using a nomogram, calibration curves, and decision curve analysis (DCA). Immune and inflammatory states were assessed using the CIBERSORT algorithm. Potential therapeutic compounds were identified through the DSigDB database via the Enrichr platform. Molecular docking and molecular dynamics simulations examined interactions between Resveratrol and selected targets. In vitro experiments validated these findings.

Results: A total of 213 candidate genes were identified by intersecting DEGs with WGCNA-derived MEblue module genes. GO and KEGG analyses indicated associations with immune activation and bacterial infection. Four key genes (DDAH2, PNPLA2, STXBP2, TCN1) were selected using eight machine learning algorithms. The diagnostic model showed good performance via nomogram, calibration curve, and DCA. Molecular docking revealed stable binding of Resveratrol to these genes. In vitro, Resveratrol pretreatment alleviated LPS-induced ALI by modulating the core genes.

Conclusion: The four genes may serve as diagnostic biomarkers for sepsis-ALI. Resveratrol represents a potential therapeutic strategy by targeting these genes.

通过生物信息学和机器学习方法鉴定败血症诱导的急性肺损伤的生物标志物,并进行实验验证。
背景:由于缺乏可靠的早期诊断生物标志物,脓毒症引起的急性肺损伤(ALI)仍然是一种危及生命的疾病。机器学习为分析高维基因表达数据和识别潜在的生物标志物和治疗靶点提供了强大的工具。方法:从GEO数据库中获取GSE10474、GSE32707、GSE66890、GSE10361、GSE3037 5个数据集。经过评估和归一化后,将GSE10474、GSE32707和GSE66890作为一个训练集进行差异表达基因(differential expression genes, DEGs)的鉴定。deg与加权基因共表达网络分析(WGCNA)鉴定的关键模块基因相交,得到213个重叠基因。通过基因本体(GO)和京都基因与基因组百科全书(KEGG)富集分析。采用RF、SVM、GLM、GBM、KNN、NNET、LASSO、DT等8种机器学习算法建立诊断模型,分别在GSE10361和GSE3037上进行了验证。使用nomogram、校准曲线和决策曲线分析(decision curve analysis, DCA)来评估模型的性能。使用CIBERSORT算法评估免疫和炎症状态。通过enrichment平台通过DSigDB数据库确定潜在的治疗化合物。分子对接和分子动力学模拟研究了白藜芦醇与选定靶点之间的相互作用。体外实验证实了这些发现。结果:通过deg与wgna来源的MEblue模块基因相交,共鉴定出213个候选基因。GO和KEGG分析显示与免疫激活和细菌感染有关。通过8种机器学习算法选择4个关键基因(DDAH2、PNPLA2、STXBP2、TCN1)。经nomogram、calibration curve和DCA分析,该模型具有良好的诊断效果。分子对接揭示了白藜芦醇与这些基因的稳定结合。在体外,白藜芦醇预处理通过调节核心基因减轻lps诱导的ALI。结论:这4个基因可作为脓毒症- ali的诊断标志物。白藜芦醇通过靶向这些基因代表了一种潜在的治疗策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Inflammation Research
Journal of Inflammation Research Immunology and Microbiology-Immunology
CiteScore
6.10
自引率
2.20%
发文量
658
审稿时长
16 weeks
期刊介绍: An international, peer-reviewed, open access, online journal that welcomes laboratory and clinical findings on the molecular basis, cell biology and pharmacology of inflammation.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信