Xutao Wang , Katie Harper , Pranay Sinha , W. Evan Johnson , Prasad Patil
{"title":"Analysis of the cross-study replicability of tuberculosis gene signatures using 49 curated human transcriptomic datasets","authors":"Xutao Wang , Katie Harper , Pranay Sinha , W. Evan Johnson , Prasad Patil","doi":"10.1016/j.tube.2025.102649","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Tuberculosis (TB) is the leading cause of infectious disease mortality worldwide. Numerous host blood-based gene expression signatures have been proposed in the literature as alternative tools for diagnosing TB infection. However, the generalizability of these signatures to different patient contexts is not well-characterized. There is a pressing need for a well-curated database of TB gene expression studies for the systematic assessment of existing and newly developed TB gene signatures.</div></div><div><h3>Results</h3><div>We built curatedTBData, a manually-curated database of 49 human TB transcriptomic studies. This data resource is freely available through GitHub and as an R Bioconductor package that allows users to validate new and existing biomarkers without the challenges of harmonizing heterogeneous studies. We demonstrate the use of this data resource with cross-study comparisons for 72 human host blood-based TB gene signatures. For the comparison of subjects with active TB from healthy controls, 19 gene signatures had weighted mean AUC of 0.90 or greater, with the highest result of 0.94. In active TB disease versus latent TB infection, 7 gene signatures had weighted mean AUC of 0.90 or greater, with a maximum of 0.93.</div></div><div><h3>Conclusions</h3><div>The curatedTBData data package offers a comprehensive resource of curated human blood-based gene expression and clinically annotated data. This resource will facilitate the development of new signatures that are generalizable across cohorts or more applicable to specific subsets of patients.</div></div>","PeriodicalId":23383,"journal":{"name":"Tuberculosis","volume":"153 ","pages":"Article 102649"},"PeriodicalIF":2.8000,"publicationDate":"2025-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tuberculosis","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1472979225000447","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"IMMUNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Tuberculosis (TB) is the leading cause of infectious disease mortality worldwide. Numerous host blood-based gene expression signatures have been proposed in the literature as alternative tools for diagnosing TB infection. However, the generalizability of these signatures to different patient contexts is not well-characterized. There is a pressing need for a well-curated database of TB gene expression studies for the systematic assessment of existing and newly developed TB gene signatures.
Results
We built curatedTBData, a manually-curated database of 49 human TB transcriptomic studies. This data resource is freely available through GitHub and as an R Bioconductor package that allows users to validate new and existing biomarkers without the challenges of harmonizing heterogeneous studies. We demonstrate the use of this data resource with cross-study comparisons for 72 human host blood-based TB gene signatures. For the comparison of subjects with active TB from healthy controls, 19 gene signatures had weighted mean AUC of 0.90 or greater, with the highest result of 0.94. In active TB disease versus latent TB infection, 7 gene signatures had weighted mean AUC of 0.90 or greater, with a maximum of 0.93.
Conclusions
The curatedTBData data package offers a comprehensive resource of curated human blood-based gene expression and clinically annotated data. This resource will facilitate the development of new signatures that are generalizable across cohorts or more applicable to specific subsets of patients.
期刊介绍:
Tuberculosis is a speciality journal focusing on basic experimental research on tuberculosis, notably on bacteriological, immunological and pathogenesis aspects of the disease. The journal publishes original research and reviews on the host response and immunology of tuberculosis and the molecular biology, genetics and physiology of the organism, however discourages submissions with a meta-analytical focus (for example, articles based on searches of published articles in public electronic databases, especially where there is lack of evidence of the personal involvement of authors in the generation of such material). We do not publish Clinical Case-Studies.
Areas on which submissions are welcomed include:
-Clinical TrialsDiagnostics-
Antimicrobial resistance-
Immunology-
Leprosy-
Microbiology, including microbial physiology-
Molecular epidemiology-
Non-tuberculous Mycobacteria-
Pathogenesis-
Pathology-
Vaccine development.
This Journal does not accept case-reports.
The resurgence of interest in tuberculosis has accelerated the pace of relevant research and Tuberculosis has grown with it, as the only journal dedicated to experimental biomedical research in tuberculosis.