Single-cell sequencing analysis and multiple machine learning methods identified immune-associated SERPINB1 and CPEB4 as novel biomarkers for COVID-19-induced ARDS

IF 2.1 3区 生物学 Q2 MULTIDISCIPLINARY SCIENCES
Hua Yang, Wenjing Wang, Junnan Huang, Yan Yan, Shan Wang, Qianran Shen, Jingjie Li, Tianbo Jin
{"title":"Single-cell sequencing analysis and multiple machine learning methods identified immune-associated SERPINB1 and CPEB4 as novel biomarkers for COVID-19-induced ARDS","authors":"Hua Yang,&nbsp;Wenjing Wang,&nbsp;Junnan Huang,&nbsp;Yan Yan,&nbsp;Shan Wang,&nbsp;Qianran Shen,&nbsp;Jingjie Li,&nbsp;Tianbo Jin","doi":"10.1007/s00114-025-02016-9","DOIUrl":null,"url":null,"abstract":"<div><p>Acute respiratory distress syndrome (ARDS) is a life-threatening complication of COVID-19, often resulting in respiratory failure and high mortality. Identifying effective molecular biomarkers is crucial for understanding its pathogenesis and improving diagnosis and treatment strategies. We analyzed transcriptomic and single-cell RNA-seq data from public datasets (GSE172114, GSE149878, and GSE213313). Differentially expressed genes (DEGs) were identified using the limma package and weighted gene co-expression network analysis (WGCNA). Single-cell analysis was used to define cell-type–specific expression. Three machine learning algorithms—LASSO, SVM-RFE, and Random Forest—were applied to identify robust hub genes. External dataset GSE213313 was used for validation. CIBERSORT was applied to estimate immune cell infiltration in ARDS tissues. We identified 915 DEGs between COVID-19-induced ARDS and controls, mainly enriched in immune receptor activity and cytokine signaling. Through integrative machine learning and validation, SERPINB1 and CPEB4 were identified as key genes, with strong diagnostic performance (AUCs: 0.940 and 0.948, respectively). Immune infiltration analysis revealed that both genes were highly correlated with neutrophils, and also associated with B memory cells, T cells, NK cells, monocytes, and mast cells. GSEA showed these genes were involved in immune and inflammatory pathways, indicating functional relevance in ARDS. SERPINB1 and CPEB4 were identified as novel immune-related biomarkers for COVID-19-induced ARDS. Their strong association with neutrophil infiltration suggests that they may play critical roles in disease progression. These findings provide new insights into immune mechanisms and offer promising targets for early diagnosis and therapeutic intervention in ARDS.</p></div>","PeriodicalId":794,"journal":{"name":"The Science of Nature","volume":"112 5","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2025-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Science of Nature","FirstCategoryId":"6","ListUrlMain":"https://link.springer.com/article/10.1007/s00114-025-02016-9","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Acute respiratory distress syndrome (ARDS) is a life-threatening complication of COVID-19, often resulting in respiratory failure and high mortality. Identifying effective molecular biomarkers is crucial for understanding its pathogenesis and improving diagnosis and treatment strategies. We analyzed transcriptomic and single-cell RNA-seq data from public datasets (GSE172114, GSE149878, and GSE213313). Differentially expressed genes (DEGs) were identified using the limma package and weighted gene co-expression network analysis (WGCNA). Single-cell analysis was used to define cell-type–specific expression. Three machine learning algorithms—LASSO, SVM-RFE, and Random Forest—were applied to identify robust hub genes. External dataset GSE213313 was used for validation. CIBERSORT was applied to estimate immune cell infiltration in ARDS tissues. We identified 915 DEGs between COVID-19-induced ARDS and controls, mainly enriched in immune receptor activity and cytokine signaling. Through integrative machine learning and validation, SERPINB1 and CPEB4 were identified as key genes, with strong diagnostic performance (AUCs: 0.940 and 0.948, respectively). Immune infiltration analysis revealed that both genes were highly correlated with neutrophils, and also associated with B memory cells, T cells, NK cells, monocytes, and mast cells. GSEA showed these genes were involved in immune and inflammatory pathways, indicating functional relevance in ARDS. SERPINB1 and CPEB4 were identified as novel immune-related biomarkers for COVID-19-induced ARDS. Their strong association with neutrophil infiltration suggests that they may play critical roles in disease progression. These findings provide new insights into immune mechanisms and offer promising targets for early diagnosis and therapeutic intervention in ARDS.

单细胞测序分析和多种机器学习方法鉴定出免疫相关的SERPINB1和CPEB4是covid -19诱导的ARDS的新生物标志物
急性呼吸窘迫综合征(ARDS)是COVID-19的一种危及生命的并发症,通常导致呼吸衰竭和高死亡率。识别有效的分子生物标志物对于了解其发病机制和改进诊断和治疗策略至关重要。我们分析了来自公共数据集(GSE172114、GSE149878和GSE213313)的转录组学和单细胞RNA-seq数据。采用limma包和加权基因共表达网络分析(WGCNA)鉴定差异表达基因(deg)。单细胞分析用于确定细胞类型特异性表达。三种机器学习算法- lasso, SVM-RFE和Random forest -被用于识别鲁棒轮毂基因。使用外部数据集GSE213313进行验证。应用CIBERSORT评估ARDS组织中免疫细胞的浸润情况。我们在covid -19诱导的ARDS与对照组之间鉴定出915个deg,主要富集免疫受体活性和细胞因子信号。通过综合机器学习和验证,鉴定出SERPINB1和CPEB4为关键基因,具有较强的诊断效能(auc分别为0.940和0.948)。免疫浸润分析显示,这两个基因与中性粒细胞高度相关,也与B记忆细胞、T细胞、NK细胞、单核细胞和肥大细胞相关。GSEA显示这些基因参与免疫和炎症途径,提示ARDS的功能相关性。SERPINB1和CPEB4被确定为covid -19诱导的ARDS的新型免疫相关生物标志物。它们与中性粒细胞浸润密切相关,表明它们可能在疾病进展中起关键作用。这些发现为ARDS的免疫机制提供了新的见解,并为ARDS的早期诊断和治疗干预提供了有希望的靶点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
The Science of Nature
The Science of Nature 综合性期刊-综合性期刊
CiteScore
3.40
自引率
0.00%
发文量
47
审稿时长
4-8 weeks
期刊介绍: The Science of Nature - Naturwissenschaften - is Springer''s flagship multidisciplinary science journal. The journal is dedicated to the fast publication and global dissemination of high-quality research and invites papers, which are of interest to the broader community in the biological sciences. Contributions from the chemical, geological, and physical sciences are welcome if contributing to questions of general biological significance. Particularly welcomed are contributions that bridge between traditionally isolated areas and attempt to increase the conceptual understanding of systems and processes that demand an interdisciplinary approach.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信