Improving meningitis surveillance and diagnosis with machine learning: Insights from São Paulo.

IF 7.7

PLOS digital health Pub Date : 2025-07-10 eCollection Date: 2025-07-01 DOI:10.1371/journal.pdig.0000925

Audêncio Victor, Diego Augusto Medeiros Santos, Eduardo Koerich Nery, Danilo Pereira Mori, Pamella Cristina de Carvalho Lucas, Denise Cammarota, Guillermo Leonardo Florez Montero, Fabiano Novaes Barcellos Filho, Ana Lúcia Frugis Yu, Telma Regina Marques Pinto Carvalhanas

{"title":"Improving meningitis surveillance and diagnosis with machine learning: Insights from São Paulo.","authors":"Audêncio Victor, Diego Augusto Medeiros Santos, Eduardo Koerich Nery, Danilo Pereira Mori, Pamella Cristina de Carvalho Lucas, Denise Cammarota, Guillermo Leonardo Florez Montero, Fabiano Novaes Barcellos Filho, Ana Lúcia Frugis Yu, Telma Regina Marques Pinto Carvalhanas","doi":"10.1371/journal.pdig.0000925","DOIUrl":null,"url":null,"abstract":"Introduction: Meningitis, an inflammatory condition of the membranes surrounding the brain and spinal cord, can be caused by various agents. Bacterial meningitis is particularly severe due to its high morbidity and mortality rates. This study aims to develop machine learning (ML) models to classify the aetiology of bacterial meningitis using data from the Notifiable Diseases Information System (SINAN) in São Paulo State, Brazil.Methods: Data were collected from the SINAN database, including sociodemographic variables, clinical symptoms, and cerebrospinal fluid (CSF) analyses. Five ML models Random Forest, LightGBM, XGBoost, CatBoost, and AdaBoost were applied to classify meningitis cases into bacterial, fungal, viral, and other types. Models were evaluated using metrics such as AUC-ROC, accuracy, precision, recall, F1-score, and MCC.Results: The CatBoost model demonstrated superior performance, achieving an AUC-ROC of 0.95 for binary classification (bacterial vs. non-bacterial) and 0.85 for multiclass classification (Neisseria meningitidis, Streptococcus pneumoniae, and Haemophilus influenzae). XGBoost and LightGBM also showed promising results with AUC-ROC scores of 0.94 and 0.92, respectively, for binary classification. The CatBoost model exhibited high sensitivity and reasonable specificity, highlighting its applicability in the rapid and accurate diagnosis of meningitis. SHAP analysis identified variables such as leukocyte count and the presence of petechiae as influential predictors in the models.Conclusion: ML algorithms, particularly CatBoost, XGBoost, and LightGBM, proved highly effective in the differential diagnosis of meningitis, offering a valuable tool for the rapid identification of meningitis types and bacterial serogroups. These techniques can be integrated into public health protocols to improve meningitis outbreak responses and optimize patient treatment.","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"4 7","pages":"e0000925"},"PeriodicalIF":7.7000,"publicationDate":"2025-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12244477/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLOS digital health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1371/journal.pdig.0000925","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction: Meningitis, an inflammatory condition of the membranes surrounding the brain and spinal cord, can be caused by various agents. Bacterial meningitis is particularly severe due to its high morbidity and mortality rates. This study aims to develop machine learning (ML) models to classify the aetiology of bacterial meningitis using data from the Notifiable Diseases Information System (SINAN) in São Paulo State, Brazil.

Methods: Data were collected from the SINAN database, including sociodemographic variables, clinical symptoms, and cerebrospinal fluid (CSF) analyses. Five ML models Random Forest, LightGBM, XGBoost, CatBoost, and AdaBoost were applied to classify meningitis cases into bacterial, fungal, viral, and other types. Models were evaluated using metrics such as AUC-ROC, accuracy, precision, recall, F1-score, and MCC.

Results: The CatBoost model demonstrated superior performance, achieving an AUC-ROC of 0.95 for binary classification (bacterial vs. non-bacterial) and 0.85 for multiclass classification (Neisseria meningitidis, Streptococcus pneumoniae, and Haemophilus influenzae). XGBoost and LightGBM also showed promising results with AUC-ROC scores of 0.94 and 0.92, respectively, for binary classification. The CatBoost model exhibited high sensitivity and reasonable specificity, highlighting its applicability in the rapid and accurate diagnosis of meningitis. SHAP analysis identified variables such as leukocyte count and the presence of petechiae as influential predictors in the models.

Conclusion: ML algorithms, particularly CatBoost, XGBoost, and LightGBM, proved highly effective in the differential diagnosis of meningitis, offering a valuable tool for the rapid identification of meningitis types and bacterial serogroups. These techniques can be integrated into public health protocols to improve meningitis outbreak responses and optimize patient treatment.

Abstract Image

查看原文本刊更多论文

用机器学习改进脑膜炎监测和诊断：来自圣保罗的见解。

简介：脑膜炎是一种围绕大脑和脊髓的膜的炎症性疾病，可由多种因素引起。由于发病率和死亡率高，细菌性脑膜炎特别严重。本研究旨在开发机器学习（ML）模型，利用巴西圣保罗州法定疾病信息系统（SINAN）的数据对细菌性脑膜炎的病因进行分类。方法：从SINAN数据库收集数据，包括社会人口学变量、临床症状和脑脊液（CSF）分析。采用Random Forest、LightGBM、XGBoost、CatBoost和AdaBoost五种ML模型将脑膜炎病例分为细菌性、真菌性、病毒性和其他类型。使用AUC-ROC、准确度、精密度、召回率、f1评分和MCC等指标对模型进行评估。结果：CatBoost模型表现出优异的性能，二元分类（细菌与非细菌）的AUC-ROC为0.95，多分类（脑膜炎奈瑟菌、肺炎链球菌和流感嗜血杆菌）的AUC-ROC为0.85。XGBoost和LightGBM的二元分类AUC-ROC评分分别为0.94和0.92，也显示出很好的结果。CatBoost模型具有较高的敏感性和合理的特异性，可用于脑膜炎的快速准确诊断。SHAP分析确定了白细胞计数和斑点存在等变量作为模型中有影响的预测因子。结论：ML算法，特别是CatBoost、XGBoost和LightGBM，在脑膜炎的鉴别诊断中被证明是非常有效的，为快速识别脑膜炎类型和细菌血清群提供了有价值的工具。这些技术可纳入公共卫生规程，以改善脑膜炎疫情应对并优化患者治疗。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

PLOS digital health

自引率

0.00%

发文量