Identification of Biomarkers for Sepsis-Induced Acute Lung Injury Through Bioinformatics and Machine Learning Approaches, with Experimental Validation.
{"title":"Identification of Biomarkers for Sepsis-Induced Acute Lung Injury Through Bioinformatics and Machine Learning Approaches, with Experimental Validation.","authors":"Yannian Luo, Juan Xu, Nannan He, Wen Cao","doi":"10.2147/JIR.S539899","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Sepsis-induced acute lung injury (ALI) remains a life-threatening condition due to the lack of reliable early diagnostic biomarkers. Machine learning offers powerful tools for analyzing high-dimensional gene expression data and identifying potential biomarkers and therapeutic targets.</p><p><strong>Methods: </strong>Five datasets (GSE10474, GSE32707, GSE66890, GSE10361, GSE3037) were obtained from the GEO database. After assessment and normalization, GSE10474, GSE32707, and GSE66890 were combined as a training set to identify differentially expressed genes (DEGs). DEGs were intersected with genes from key modules identified by weighted gene co-expression network analysis (WGCNA), yielding 213 overlapping genes. These were analyzed via Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment. Eight machine learning algorithms (RF, SVM, GLM, GBM, KNN, NNET, LASSO, DT) were used to develop diagnostic models, which were validated on GSE10361 and GSE3037. Model performance was evaluated using a nomogram, calibration curves, and decision curve analysis (DCA). Immune and inflammatory states were assessed using the CIBERSORT algorithm. Potential therapeutic compounds were identified through the DSigDB database via the Enrichr platform. Molecular docking and molecular dynamics simulations examined interactions between Resveratrol and selected targets. In vitro experiments validated these findings.</p><p><strong>Results: </strong>A total of 213 candidate genes were identified by intersecting DEGs with WGCNA-derived MEblue module genes. GO and KEGG analyses indicated associations with immune activation and bacterial infection. Four key genes (DDAH2, PNPLA2, STXBP2, TCN1) were selected using eight machine learning algorithms. The diagnostic model showed good performance via nomogram, calibration curve, and DCA. Molecular docking revealed stable binding of Resveratrol to these genes. In vitro, Resveratrol pretreatment alleviated LPS-induced ALI by modulating the core genes.</p><p><strong>Conclusion: </strong>The four genes may serve as diagnostic biomarkers for sepsis-ALI. Resveratrol represents a potential therapeutic strategy by targeting these genes.</p>","PeriodicalId":16107,"journal":{"name":"Journal of Inflammation Research","volume":"18 ","pages":"13635-13650"},"PeriodicalIF":4.1000,"publicationDate":"2025-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12506793/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Inflammation Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/JIR.S539899","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"IMMUNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Sepsis-induced acute lung injury (ALI) remains a life-threatening condition due to the lack of reliable early diagnostic biomarkers. Machine learning offers powerful tools for analyzing high-dimensional gene expression data and identifying potential biomarkers and therapeutic targets.
Methods: Five datasets (GSE10474, GSE32707, GSE66890, GSE10361, GSE3037) were obtained from the GEO database. After assessment and normalization, GSE10474, GSE32707, and GSE66890 were combined as a training set to identify differentially expressed genes (DEGs). DEGs were intersected with genes from key modules identified by weighted gene co-expression network analysis (WGCNA), yielding 213 overlapping genes. These were analyzed via Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment. Eight machine learning algorithms (RF, SVM, GLM, GBM, KNN, NNET, LASSO, DT) were used to develop diagnostic models, which were validated on GSE10361 and GSE3037. Model performance was evaluated using a nomogram, calibration curves, and decision curve analysis (DCA). Immune and inflammatory states were assessed using the CIBERSORT algorithm. Potential therapeutic compounds were identified through the DSigDB database via the Enrichr platform. Molecular docking and molecular dynamics simulations examined interactions between Resveratrol and selected targets. In vitro experiments validated these findings.
Results: A total of 213 candidate genes were identified by intersecting DEGs with WGCNA-derived MEblue module genes. GO and KEGG analyses indicated associations with immune activation and bacterial infection. Four key genes (DDAH2, PNPLA2, STXBP2, TCN1) were selected using eight machine learning algorithms. The diagnostic model showed good performance via nomogram, calibration curve, and DCA. Molecular docking revealed stable binding of Resveratrol to these genes. In vitro, Resveratrol pretreatment alleviated LPS-induced ALI by modulating the core genes.
Conclusion: The four genes may serve as diagnostic biomarkers for sepsis-ALI. Resveratrol represents a potential therapeutic strategy by targeting these genes.
期刊介绍:
An international, peer-reviewed, open access, online journal that welcomes laboratory and clinical findings on the molecular basis, cell biology and pharmacology of inflammation.