Xinghong Liu, Yi Peng, Ling Guo, Weilan Xiong, Weijiang Liao, Jiangang Fan
{"title":"揭示和验证慢性鼻窦炎伴鼻息肉中与IL-10家族相关的生物标志物:来自转录组学和单细胞RNA测序分析的见解","authors":"Xinghong Liu, Yi Peng, Ling Guo, Weilan Xiong, Weijiang Liao, Jiangang Fan","doi":"10.3389/fmolb.2024.1513951","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Extensive efforts have been made to explore members of the IL-10 family as potential therapeutic strategies for various diseases; however, their biological role in chronic rhinosinusitis with nasal polyps (CRSwNP) remains underexplored.</p><p><strong>Methods: </strong>Gene expression datasets GSE136825, GSE179265, and GSE196169 were retrieved from the Gene Expression Omnibus (GEO) for analysis. Candidate genes were identified by intersecting differentially expressed genes (DEGs) between the CRSwNP and control groups (DEGsall) with those between the high- and low-score groups within the CRSwNP cohort (DEGsNP). Biomarker selection was performed using the Least Absolute Shrinkage and Selection Operator (LASSO), Support Vector Machine Recursive Feature Elimination (SVM-RFE), and the Boruta algorithm. Further refinement of biomarkers was carried out using receiver operating characteristic (ROC) analysis, with genes demonstrating an area under the curve (AUC) greater than 0.7 being considered significant. Genes exhibiting consistent expression trends and significant differences across both GSE136825 and GSE179265 were selected as potential biomarkers. Cell-type annotation was performed on GSE196169, and the expression profiles of the biomarkers across various cell types were analyzed. A competing endogenous RNA (ceRNA) network and a biomarker-drug interaction network were also established. Additionally, the mRNALocater database was utilized to determine the cellular localization of the identified biomarkers.</p><p><strong>Results: </strong>The intersection of 1817 DEGsall and 24 DEGsNP yielded 15 candidate genes. Further filtering through LASSO, SVM-RFE, and Boruta led to the identification of seven candidate biomarkers: PRB3, KRT16, MUC6, SPAG4, FGFBP1, NR4A1, and GSTA2. Six of these genes demonstrated strong diagnostic performance in GSE179265, while four biomarkers, showing both significant differences and consistent expression trends, were validated in both GSE179265 and GSE136825. Single-cell sequencing analysis of GSE196169 revealed seven distinct cell types, including endothelial cells, with the biomarkers predominantly expressed in epithelial cells. The ceRNA network comprised nine nodes and eleven edges, with only FGFBP1 exhibiting a complete lncRNA-miRNA-mRNA interaction.</p><p><strong>Discussion: </strong>This study identifies several novel biomarkers and their associated drugs for CRSwNP therapy, as well as potential therapeutic targets, such as spiperone and arnenous acid, identified through molecular docking. Ultimately, this work underscores the identification of four IL-10 family-related biomarkers, providing a theoretical foundation for future clinical research in CRSwNP.</p>","PeriodicalId":12465,"journal":{"name":"Frontiers in Molecular Biosciences","volume":"11 ","pages":"1513951"},"PeriodicalIF":3.9000,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11738911/pdf/","citationCount":"0","resultStr":"{\"title\":\"Unveiling and validating biomarkers related to the IL-10 family in chronic sinusitis with nasal polyps: insights from transcriptomics and single-cell RNA sequencing analysis.\",\"authors\":\"Xinghong Liu, Yi Peng, Ling Guo, Weilan Xiong, Weijiang Liao, Jiangang Fan\",\"doi\":\"10.3389/fmolb.2024.1513951\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>Extensive efforts have been made to explore members of the IL-10 family as potential therapeutic strategies for various diseases; however, their biological role in chronic rhinosinusitis with nasal polyps (CRSwNP) remains underexplored.</p><p><strong>Methods: </strong>Gene expression datasets GSE136825, GSE179265, and GSE196169 were retrieved from the Gene Expression Omnibus (GEO) for analysis. Candidate genes were identified by intersecting differentially expressed genes (DEGs) between the CRSwNP and control groups (DEGsall) with those between the high- and low-score groups within the CRSwNP cohort (DEGsNP). Biomarker selection was performed using the Least Absolute Shrinkage and Selection Operator (LASSO), Support Vector Machine Recursive Feature Elimination (SVM-RFE), and the Boruta algorithm. Further refinement of biomarkers was carried out using receiver operating characteristic (ROC) analysis, with genes demonstrating an area under the curve (AUC) greater than 0.7 being considered significant. Genes exhibiting consistent expression trends and significant differences across both GSE136825 and GSE179265 were selected as potential biomarkers. Cell-type annotation was performed on GSE196169, and the expression profiles of the biomarkers across various cell types were analyzed. A competing endogenous RNA (ceRNA) network and a biomarker-drug interaction network were also established. Additionally, the mRNALocater database was utilized to determine the cellular localization of the identified biomarkers.</p><p><strong>Results: </strong>The intersection of 1817 DEGsall and 24 DEGsNP yielded 15 candidate genes. Further filtering through LASSO, SVM-RFE, and Boruta led to the identification of seven candidate biomarkers: PRB3, KRT16, MUC6, SPAG4, FGFBP1, NR4A1, and GSTA2. Six of these genes demonstrated strong diagnostic performance in GSE179265, while four biomarkers, showing both significant differences and consistent expression trends, were validated in both GSE179265 and GSE136825. Single-cell sequencing analysis of GSE196169 revealed seven distinct cell types, including endothelial cells, with the biomarkers predominantly expressed in epithelial cells. The ceRNA network comprised nine nodes and eleven edges, with only FGFBP1 exhibiting a complete lncRNA-miRNA-mRNA interaction.</p><p><strong>Discussion: </strong>This study identifies several novel biomarkers and their associated drugs for CRSwNP therapy, as well as potential therapeutic targets, such as spiperone and arnenous acid, identified through molecular docking. Ultimately, this work underscores the identification of four IL-10 family-related biomarkers, providing a theoretical foundation for future clinical research in CRSwNP.</p>\",\"PeriodicalId\":12465,\"journal\":{\"name\":\"Frontiers in Molecular Biosciences\",\"volume\":\"11 \",\"pages\":\"1513951\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-01-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11738911/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Molecular Biosciences\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.3389/fmolb.2024.1513951\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Molecular Biosciences","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3389/fmolb.2024.1513951","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
导读:广泛的努力已被用于探索IL-10家族成员作为各种疾病的潜在治疗策略;然而,它们在慢性鼻窦炎伴鼻息肉(CRSwNP)中的生物学作用仍未得到充分研究。方法:从Gene expression Omnibus (GEO)检索基因表达数据集GSE136825、GSE179265和GSE196169进行分析。候选基因是通过交叉CRSwNP和对照组(degsmall)之间的差异表达基因(DEGs)以及CRSwNP队列中高分组和低分组之间的差异表达基因(DEGsNP)来鉴定的。生物标志物的选择使用最小绝对收缩和选择算子(LASSO)、支持向量机递归特征消除(SVM-RFE)和Boruta算法进行。使用受试者工作特征(ROC)分析对生物标志物进行进一步细化,曲线下面积(AUC)大于0.7的基因被认为是显著的。选择GSE136825和GSE179265表达趋势一致且差异显著的基因作为潜在的生物标志物。对GSE196169进行细胞类型注释,分析生物标志物在不同细胞类型中的表达谱。此外,还建立了竞争性内源性RNA (ceRNA)网络和生物标志物-药物相互作用网络。此外,利用mRNALocater数据库确定已鉴定生物标志物的细胞定位。结果:1817 degsmall与24 DEGsNP交叉得到15个候选基因。通过LASSO、SVM-RFE和Boruta进一步筛选,鉴定出7个候选生物标志物:PRB3、KRT16、MUC6、SPAG4、FGFBP1、NR4A1和GSTA2。其中6个基因在GSE179265中表现出较强的诊断性能,而4个生物标志物在GSE179265和GSE136825中均被证实具有显著差异和一致的表达趋势。GSE196169的单细胞测序分析揭示了包括内皮细胞在内的七种不同的细胞类型,生物标志物主要在上皮细胞中表达。ceRNA网络包括9个节点和11个边,只有FGFBP1表现出完整的lncRNA-miRNA-mRNA相互作用。讨论:本研究通过分子对接确定了几种新的CRSwNP治疗生物标志物及其相关药物,以及潜在的治疗靶点,如spiperone和arnenous acid。最终,本工作强调了4个IL-10家族相关生物标志物的鉴定,为CRSwNP的未来临床研究提供了理论基础。
Unveiling and validating biomarkers related to the IL-10 family in chronic sinusitis with nasal polyps: insights from transcriptomics and single-cell RNA sequencing analysis.
Introduction: Extensive efforts have been made to explore members of the IL-10 family as potential therapeutic strategies for various diseases; however, their biological role in chronic rhinosinusitis with nasal polyps (CRSwNP) remains underexplored.
Methods: Gene expression datasets GSE136825, GSE179265, and GSE196169 were retrieved from the Gene Expression Omnibus (GEO) for analysis. Candidate genes were identified by intersecting differentially expressed genes (DEGs) between the CRSwNP and control groups (DEGsall) with those between the high- and low-score groups within the CRSwNP cohort (DEGsNP). Biomarker selection was performed using the Least Absolute Shrinkage and Selection Operator (LASSO), Support Vector Machine Recursive Feature Elimination (SVM-RFE), and the Boruta algorithm. Further refinement of biomarkers was carried out using receiver operating characteristic (ROC) analysis, with genes demonstrating an area under the curve (AUC) greater than 0.7 being considered significant. Genes exhibiting consistent expression trends and significant differences across both GSE136825 and GSE179265 were selected as potential biomarkers. Cell-type annotation was performed on GSE196169, and the expression profiles of the biomarkers across various cell types were analyzed. A competing endogenous RNA (ceRNA) network and a biomarker-drug interaction network were also established. Additionally, the mRNALocater database was utilized to determine the cellular localization of the identified biomarkers.
Results: The intersection of 1817 DEGsall and 24 DEGsNP yielded 15 candidate genes. Further filtering through LASSO, SVM-RFE, and Boruta led to the identification of seven candidate biomarkers: PRB3, KRT16, MUC6, SPAG4, FGFBP1, NR4A1, and GSTA2. Six of these genes demonstrated strong diagnostic performance in GSE179265, while four biomarkers, showing both significant differences and consistent expression trends, were validated in both GSE179265 and GSE136825. Single-cell sequencing analysis of GSE196169 revealed seven distinct cell types, including endothelial cells, with the biomarkers predominantly expressed in epithelial cells. The ceRNA network comprised nine nodes and eleven edges, with only FGFBP1 exhibiting a complete lncRNA-miRNA-mRNA interaction.
Discussion: This study identifies several novel biomarkers and their associated drugs for CRSwNP therapy, as well as potential therapeutic targets, such as spiperone and arnenous acid, identified through molecular docking. Ultimately, this work underscores the identification of four IL-10 family-related biomarkers, providing a theoretical foundation for future clinical research in CRSwNP.
期刊介绍:
Much of contemporary investigation in the life sciences is devoted to the molecular-scale understanding of the relationships between genes and the environment — in particular, dynamic alterations in the levels, modifications, and interactions of cellular effectors, including proteins. Frontiers in Molecular Biosciences offers an international publication platform for basic as well as applied research; we encourage contributions spanning both established and emerging areas of biology. To this end, the journal draws from empirical disciplines such as structural biology, enzymology, biochemistry, and biophysics, capitalizing as well on the technological advancements that have enabled metabolomics and proteomics measurements in massively parallel throughput, and the development of robust and innovative computational biology strategies. We also recognize influences from medicine and technology, welcoming studies in molecular genetics, molecular diagnostics and therapeutics, and nanotechnology.
Our ultimate objective is the comprehensive illustration of the molecular mechanisms regulating proteins, nucleic acids, carbohydrates, lipids, and small metabolites in organisms across all branches of life.
In addition to interesting new findings, techniques, and applications, Frontiers in Molecular Biosciences will consider new testable hypotheses to inspire different perspectives and stimulate scientific dialogue. The integration of in silico, in vitro, and in vivo approaches will benefit endeavors across all domains of the life sciences.