Screening of pathologically significant diagnostic biomarkers in tears of thyroid eye disease based on bioinformatic analysis and machine learning.

IF 4.6 2区 生物学 Q2 CELL BIOLOGY
Frontiers in Cell and Developmental Biology Pub Date : 2024-10-30 eCollection Date: 2024-01-01 DOI:10.3389/fcell.2024.1486170
Xingyi Shu, Chengcheng Zeng, Yanfei Zhu, Yuqing Chen, Xiao Huang, Ruili Wei
{"title":"Screening of pathologically significant diagnostic biomarkers in tears of thyroid eye disease based on bioinformatic analysis and machine learning.","authors":"Xingyi Shu, Chengcheng Zeng, Yanfei Zhu, Yuqing Chen, Xiao Huang, Ruili Wei","doi":"10.3389/fcell.2024.1486170","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Lacrimal gland enlargement is a common pathological change in patients with thyroid eye disease (TED). Tear fluid has emerged as a new source of diagnostic biomarkers, but tear-based diagnostic biomarkers for TED with high efficacy are still lacking.</p><p><strong>Objective: </strong>We aim to investigate genes associated with TED-associated lacrimal gland lesions. Additionally, we seek to identify potential biomarkers for diagnosing TED in tear fluid.</p><p><strong>Methods: </strong>We obtained two expression profiling datasets related to TED lacrimal gland samples from the Gene Expression Omnibus (GEO). Subsequently, we combined the two separate datasets and conducted differential gene expression analysis and weighted gene co-expression network analysis (WGCNA) on the obtained integrated dataset. The genes were employed for Gene Ontology (GO) enrichment analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis. The genes were intersected with the secretory proteins profile to get the potential proteins in the tear fluid. Machine learning techniques were then employed to identify optimal biomarkers and develop a diagnostic nomogram for predicting TED. Finally, gene set enrichment analysis (GSEA) and immune infiltration analysis were conducted on screened hub genes to further elucidate their potential mechanisms in TED.</p><p><strong>Results: </strong>In our analysis of the integrated TED dataset, we identified 2,918 key module genes and 157 differentially expressed genes and finally obtained 84 lacrimal-associated key genes. Enrichment analysis disclosed that these 84 genes primarily pertain to endoplasmic reticulum organization. After intersecting with the secretory proteins, 13 lacrimal gland-associated secretory protein genes (LaSGs) were identified. The results from machine learning indicated the substantial diagnostic value of dyslexia associated gene (KIAA0319) and peroxiredoxin4 (PRDX4) in TED-associated lacrimal gland lesions. The two hub genes were chosen as candidate biomarkers in tear fluid and employed to establish a diagnostic nomogram. Furthermore, single-gene GSEA results and immune cell infiltration analysis unveiled immune dysregulation in the lacrimal gland of TED, with KIAA0319 and PRDX4 showing significant associations with infiltrating immune cells.</p><p><strong>Conclusions: </strong>We uncovered the distinct pathophysiology of TED-associated lacrimal gland enlargement compared to TED-associated orbital adipose tissue enlargement. We have demonstrated the endoplasmic reticulum-related pathways involved in TED-associated lacrimal gland lesions and established a diagnostic nomogram for TED utilizing KIAA0319 and PRDX4 through integrated bioinformatics analysis. This contribution offers novel insights for non-invasive, prospective diagnostic approaches in the context of TED.</p>","PeriodicalId":12448,"journal":{"name":"Frontiers in Cell and Developmental Biology","volume":"12 ","pages":"1486170"},"PeriodicalIF":4.6000,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11561714/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Cell and Developmental Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3389/fcell.2024.1486170","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Lacrimal gland enlargement is a common pathological change in patients with thyroid eye disease (TED). Tear fluid has emerged as a new source of diagnostic biomarkers, but tear-based diagnostic biomarkers for TED with high efficacy are still lacking.

Objective: We aim to investigate genes associated with TED-associated lacrimal gland lesions. Additionally, we seek to identify potential biomarkers for diagnosing TED in tear fluid.

Methods: We obtained two expression profiling datasets related to TED lacrimal gland samples from the Gene Expression Omnibus (GEO). Subsequently, we combined the two separate datasets and conducted differential gene expression analysis and weighted gene co-expression network analysis (WGCNA) on the obtained integrated dataset. The genes were employed for Gene Ontology (GO) enrichment analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis. The genes were intersected with the secretory proteins profile to get the potential proteins in the tear fluid. Machine learning techniques were then employed to identify optimal biomarkers and develop a diagnostic nomogram for predicting TED. Finally, gene set enrichment analysis (GSEA) and immune infiltration analysis were conducted on screened hub genes to further elucidate their potential mechanisms in TED.

Results: In our analysis of the integrated TED dataset, we identified 2,918 key module genes and 157 differentially expressed genes and finally obtained 84 lacrimal-associated key genes. Enrichment analysis disclosed that these 84 genes primarily pertain to endoplasmic reticulum organization. After intersecting with the secretory proteins, 13 lacrimal gland-associated secretory protein genes (LaSGs) were identified. The results from machine learning indicated the substantial diagnostic value of dyslexia associated gene (KIAA0319) and peroxiredoxin4 (PRDX4) in TED-associated lacrimal gland lesions. The two hub genes were chosen as candidate biomarkers in tear fluid and employed to establish a diagnostic nomogram. Furthermore, single-gene GSEA results and immune cell infiltration analysis unveiled immune dysregulation in the lacrimal gland of TED, with KIAA0319 and PRDX4 showing significant associations with infiltrating immune cells.

Conclusions: We uncovered the distinct pathophysiology of TED-associated lacrimal gland enlargement compared to TED-associated orbital adipose tissue enlargement. We have demonstrated the endoplasmic reticulum-related pathways involved in TED-associated lacrimal gland lesions and established a diagnostic nomogram for TED utilizing KIAA0319 and PRDX4 through integrated bioinformatics analysis. This contribution offers novel insights for non-invasive, prospective diagnostic approaches in the context of TED.

基于生物信息分析和机器学习筛选甲状腺眼病泪液中具有重要病理诊断意义的生物标记物。
背景:泪腺肿大是甲状腺眼病(TED)患者常见的病理变化。泪液已成为诊断生物标志物的新来源,但基于泪液的TED高效诊断生物标志物仍然缺乏:我们旨在研究与 TED 相关的泪腺病变的相关基因。此外,我们还试图确定诊断泪液中 TED 的潜在生物标志物:我们从基因表达总库(GEO)中获得了两个与 TED 泪腺样本相关的表达谱数据集。随后,我们合并了这两个独立的数据集,并对整合后的数据集进行了差异基因表达分析和加权基因共表达网络分析(WGCNA)。这些基因被用于基因本体(GO)富集分析和京都基因组百科全书(KEGG)通路分析。将这些基因与分泌蛋白图谱进行交叉,以获得泪液中的潜在蛋白。然后采用机器学习技术来确定最佳生物标志物,并开发出用于预测 TED 的诊断提名图。最后,对筛选出的枢纽基因进行了基因组富集分析(GSEA)和免疫浸润分析,以进一步阐明它们在TED中的潜在机制:结果:在对整合的 TED 数据集进行分析时,我们发现了 2,918 个关键模块基因和 157 个差异表达基因,并最终获得了 84 个泪腺相关关键基因。富集分析显示,这84个基因主要与内质网组织有关。在与分泌蛋白交叉后,确定了 13 个泪腺相关分泌蛋白基因(LaSGs)。机器学习的结果表明,阅读障碍相关基因(KIAA0319)和过氧化物歧化酶4(PRDX4)在TED相关泪腺病变中具有重要的诊断价值。这两个枢纽基因被选为泪液中的候选生物标志物,并被用于建立诊断提名图。此外,单基因GSEA结果和免疫细胞浸润分析揭示了TED泪腺中的免疫失调,其中KIAA0319和PRDX4与浸润免疫细胞有显著关联:我们发现了TED相关泪腺肿大与TED相关眼眶脂肪组织肿大不同的病理生理学。我们证明了参与 TED 相关泪腺病变的内质网相关通路,并通过综合生物信息学分析,利用 KIAA0319 和 PRDX4 建立了 TED 诊断提名图。这一贡献为 TED 的无创、前瞻性诊断方法提供了新的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Frontiers in Cell and Developmental Biology
Frontiers in Cell and Developmental Biology Biochemistry, Genetics and Molecular Biology-Cell Biology
CiteScore
9.70
自引率
3.60%
发文量
2531
审稿时长
12 weeks
期刊介绍: Frontiers in Cell and Developmental Biology is a broad-scope, interdisciplinary open-access journal, focusing on the fundamental processes of life, led by Prof Amanda Fisher and supported by a geographically diverse, high-quality editorial board. The journal welcomes submissions on a wide spectrum of cell and developmental biology, covering intracellular and extracellular dynamics, with sections focusing on signaling, adhesion, migration, cell death and survival and membrane trafficking. Additionally, the journal offers sections dedicated to the cutting edge of fundamental and translational research in molecular medicine and stem cell biology. With a collaborative, rigorous and transparent peer-review, the journal produces the highest scientific quality in both fundamental and applied research, and advanced article level metrics measure the real-time impact and influence of each publication.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信