Machine learning model based on SERPING1, C1QB, and C1QC: A novel diagnostic approach for latent tuberculosis infection

iLABMED Pub Date : 2024-11-16 DOI:10.1002/ila2.65
Linsheng Li, Li Zhuang, Ling Yang, Zhaoyang Ye, Ruizi Ni, Yajing An, Weiguo Zhao, Wenping Gong
{"title":"Machine learning model based on SERPING1, C1QB, and C1QC: A novel diagnostic approach for latent tuberculosis infection","authors":"Linsheng Li,&nbsp;Li Zhuang,&nbsp;Ling Yang,&nbsp;Zhaoyang Ye,&nbsp;Ruizi Ni,&nbsp;Yajing An,&nbsp;Weiguo Zhao,&nbsp;Wenping Gong","doi":"10.1002/ila2.65","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>Latent tuberculosis infection (LTBI) is a significant source of active tuberculosis (ATB), yet distinguishing between them is challenging because specific biomarkers are lacking.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>We analyzed four microarray datasets (GSE19491, GSE37250, GSE54992, GSE28623) from the gene expression omnibus to identify differentially expressed genes (DEGs). Using protein–protein interaction (PPI) networks and LASSO-SVM algorithms, we selected three candidate biomarkers and evaluated their diagnostic efficacy. The expression levels of core genes were validated by RNA sequencing of healthy, ATB, and LTBI groups in a real-world cohort. We conducted Gene Ontology and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses, predicted shared upstream miRNAs, constructed miRNA–hub and transcription factor (TF)–hub gene networks, and performed immune infiltration analysis.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>Three hub genes (<i>SERPING1</i>, <i>C1QC</i>, <i>C1QB</i>) were identified from 45 DEGs by PPI networks and machine learning screening. The diagnostic model based on the three hub genes had an area under the curve (AUC) value of 0.843 in the training set GSE19491 and 0.865 in the validation set GSE28623. Real-world transcriptome sequencing confirmed the expression trends of the hub genes across healthy, LTBI, and ATB groups. GO analysis showed that the 45 hub genes were primarily associated with immune inflammatory responses and pattern recognition receptors, whereas KEGG analysis indicated enrichment in complement and coagulation cascades. The miRNA–hub and TF–hub gene network analysis identified nine miRNAs and the zinc finger TF GATA2 as potential co-regulators of <i>SERPING1</i>, <i>C1QC</i>, and <i>C1QB</i>. Immune cell infiltration analysis identified significant differences in the immune microenvironment between LTBI and ATB, with macrophages and natural killer cells showing significant correlations with tuberculosis infection.</p>\n </section>\n \n <section>\n \n <h3> Conclusion</h3>\n \n <p>The diagnostic model with <i>SERPING1</i>, <i>C1QC</i>, and <i>C1QB</i> shows promise in distinguishing LTBI from ATB, indicating its potential as a diagnostic tool.</p>\n </section>\n </div>","PeriodicalId":100656,"journal":{"name":"iLABMED","volume":"2 4","pages":"248-265"},"PeriodicalIF":0.0000,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ila2.65","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"iLABMED","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ila2.65","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background

Latent tuberculosis infection (LTBI) is a significant source of active tuberculosis (ATB), yet distinguishing between them is challenging because specific biomarkers are lacking.

Methods

We analyzed four microarray datasets (GSE19491, GSE37250, GSE54992, GSE28623) from the gene expression omnibus to identify differentially expressed genes (DEGs). Using protein–protein interaction (PPI) networks and LASSO-SVM algorithms, we selected three candidate biomarkers and evaluated their diagnostic efficacy. The expression levels of core genes were validated by RNA sequencing of healthy, ATB, and LTBI groups in a real-world cohort. We conducted Gene Ontology and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses, predicted shared upstream miRNAs, constructed miRNA–hub and transcription factor (TF)–hub gene networks, and performed immune infiltration analysis.

Results

Three hub genes (SERPING1, C1QC, C1QB) were identified from 45 DEGs by PPI networks and machine learning screening. The diagnostic model based on the three hub genes had an area under the curve (AUC) value of 0.843 in the training set GSE19491 and 0.865 in the validation set GSE28623. Real-world transcriptome sequencing confirmed the expression trends of the hub genes across healthy, LTBI, and ATB groups. GO analysis showed that the 45 hub genes were primarily associated with immune inflammatory responses and pattern recognition receptors, whereas KEGG analysis indicated enrichment in complement and coagulation cascades. The miRNA–hub and TF–hub gene network analysis identified nine miRNAs and the zinc finger TF GATA2 as potential co-regulators of SERPING1, C1QC, and C1QB. Immune cell infiltration analysis identified significant differences in the immune microenvironment between LTBI and ATB, with macrophages and natural killer cells showing significant correlations with tuberculosis infection.

Conclusion

The diagnostic model with SERPING1, C1QC, and C1QB shows promise in distinguishing LTBI from ATB, indicating its potential as a diagnostic tool.

Abstract Image

基于SERPING1、C1QB和C1QC的机器学习模型:一种新的潜伏性结核感染诊断方法
潜伏结核感染(LTBI)是活动性结核(ATB)的重要来源,但由于缺乏特定的生物标志物,区分它们具有挑战性。方法分析基因表达综合数据库中4个基因芯片数据集(GSE19491、GSE37250、GSE54992、GSE28623),鉴定差异表达基因。利用蛋白-蛋白相互作用(PPI)网络和LASSO-SVM算法,我们选择了三个候选生物标志物,并评估了它们的诊断效果。核心基因的表达水平通过现实世界队列中健康、ATB和LTBI组的RNA测序进行验证。我们进行了基因本体(Gene Ontology)和京都基因与基因组百科全书(KEGG)通路富集分析,预测了共享的上游mirna,构建了miRNA-hub和转录因子(TF) -hub基因网络,并进行了免疫浸润分析。结果通过PPI网络和机器学习筛选,从45个基因中鉴定出3个中心基因(SERPING1、C1QC、C1QB)。基于三个枢纽基因的诊断模型在训练集GSE19491和验证集GSE28623的曲线下面积(AUC)分别为0.843和0.865。真实世界转录组测序证实了枢纽基因在健康、LTBI和ATB组中的表达趋势。GO分析显示,45个枢纽基因主要与免疫炎症反应和模式识别受体相关,而KEGG分析显示补体和凝血级联富集。miRNA-hub和TF - hub基因网络分析发现,9个mirna和锌指TF GATA2可能是SERPING1、C1QC和C1QB的共同调节因子。免疫细胞浸润分析发现,LTBI和ATB的免疫微环境存在显著差异,巨噬细胞和自然杀伤细胞与结核感染存在显著相关性。结论基于SERPING1、C1QC和C1QB的诊断模型在区分LTBI和ATB方面有良好的前景,表明其作为诊断工具的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信