Youssra Boumait, Boutaina Ettetuani, Manal Chrairi, Afaf Lamzouri, Rajaa Chahboune
{"title":"使用机器学习方法识别预测潜伏性结核感染的基因表达生物标志物。","authors":"Youssra Boumait, Boutaina Ettetuani, Manal Chrairi, Afaf Lamzouri, Rajaa Chahboune","doi":"10.3390/genes16060715","DOIUrl":null,"url":null,"abstract":"<p><p>Latent tuberculosis infection (LTBi) affects nearly a quarter of the global population, yet current diagnostic methods are limited by low sensitivity and specificity. This study applied an integrative bioinformatics framework, incorporating machine learning techniques, to identify robust gene expression biomarkers associated with LTBi. We analyzed four publicly available transcriptomic datasets from peripheral blood mononuclear cells (PBMCs), representing latent, active, and healthy states. Differentially expressed genes (DEGs) were identified, followed by gene ontology (GO) enrichment, functional clustering, and miRNA interaction analysis. Semantic similarity, unsupervised clustering, and pathway enrichment were applied to refine the gene list. Key biomarkers were prioritized using receiver operating characteristic (ROC) curve analysis, with CCL2 and CXCL10 emerging as top candidates (AUC > 0.85). This multi-step approach demonstrates the potential of combining transcriptomic profiling with established machine learning and bioinformatics tools to uncover candidate biomarkers for improved LTBi detection, and it also provides a foundation for future experimental validation.</p>","PeriodicalId":12688,"journal":{"name":"Genes","volume":"16 6","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12192713/pdf/","citationCount":"0","resultStr":"{\"title\":\"Identification of Gene Expression Biomarkers Predictive of Latent Tuberculosis Infection Using Machine Learning Approaches.\",\"authors\":\"Youssra Boumait, Boutaina Ettetuani, Manal Chrairi, Afaf Lamzouri, Rajaa Chahboune\",\"doi\":\"10.3390/genes16060715\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Latent tuberculosis infection (LTBi) affects nearly a quarter of the global population, yet current diagnostic methods are limited by low sensitivity and specificity. This study applied an integrative bioinformatics framework, incorporating machine learning techniques, to identify robust gene expression biomarkers associated with LTBi. We analyzed four publicly available transcriptomic datasets from peripheral blood mononuclear cells (PBMCs), representing latent, active, and healthy states. Differentially expressed genes (DEGs) were identified, followed by gene ontology (GO) enrichment, functional clustering, and miRNA interaction analysis. Semantic similarity, unsupervised clustering, and pathway enrichment were applied to refine the gene list. Key biomarkers were prioritized using receiver operating characteristic (ROC) curve analysis, with CCL2 and CXCL10 emerging as top candidates (AUC > 0.85). This multi-step approach demonstrates the potential of combining transcriptomic profiling with established machine learning and bioinformatics tools to uncover candidate biomarkers for improved LTBi detection, and it also provides a foundation for future experimental validation.</p>\",\"PeriodicalId\":12688,\"journal\":{\"name\":\"Genes\",\"volume\":\"16 6\",\"pages\":\"\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12192713/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Genes\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.3390/genes16060715\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genes","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3390/genes16060715","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
Identification of Gene Expression Biomarkers Predictive of Latent Tuberculosis Infection Using Machine Learning Approaches.
Latent tuberculosis infection (LTBi) affects nearly a quarter of the global population, yet current diagnostic methods are limited by low sensitivity and specificity. This study applied an integrative bioinformatics framework, incorporating machine learning techniques, to identify robust gene expression biomarkers associated with LTBi. We analyzed four publicly available transcriptomic datasets from peripheral blood mononuclear cells (PBMCs), representing latent, active, and healthy states. Differentially expressed genes (DEGs) were identified, followed by gene ontology (GO) enrichment, functional clustering, and miRNA interaction analysis. Semantic similarity, unsupervised clustering, and pathway enrichment were applied to refine the gene list. Key biomarkers were prioritized using receiver operating characteristic (ROC) curve analysis, with CCL2 and CXCL10 emerging as top candidates (AUC > 0.85). This multi-step approach demonstrates the potential of combining transcriptomic profiling with established machine learning and bioinformatics tools to uncover candidate biomarkers for improved LTBi detection, and it also provides a foundation for future experimental validation.
期刊介绍:
Genes (ISSN 2073-4425) is an international, peer-reviewed open access journal which provides an advanced forum for studies related to genes, genetics and genomics. It publishes reviews, research articles, communications and technical notes. There is no restriction on the length of the papers and we encourage scientists to publish their results in as much detail as possible.