机器学习在波多黎各改善登革热诊断。

IF 2.3 4区医学 Q2 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH

Tropical Medicine & International Health Pub Date : 2025-09-15 DOI:10.1111/tmi.70036

Zachary J Madewell, Dania M Rodriguez, Maile B Thayer, Vanessa Rivera-Amill, Jomil Torres Aponte, Melissa Marzan-Rodriguez, Gabriela Paz-Bailey, Laura E Adams, Joshua M Wong

{"title":"机器学习在波多黎各改善登革热诊断。","authors":"Zachary J Madewell, Dania M Rodriguez, Maile B Thayer, Vanessa Rivera-Amill, Jomil Torres Aponte, Melissa Marzan-Rodriguez, Gabriela Paz-Bailey, Laura E Adams, Joshua M Wong","doi":"10.1111/tmi.70036","DOIUrl":null,"url":null,"abstract":"Objectives: Diagnosing dengue accurately, especially in resource-limited settings, remains challenging due to overlapping symptoms with other febrile illnesses and limitations of current diagnostic methods. This study aimed to develop machine learning models that leverage readily available clinical data to improve diagnostic accuracy for dengue, potentially offering a more accessible and rapid diagnostic tool for healthcare providers.Methods: We used data from the Sentinel Enhanced Dengue Surveillance System in Puerto Rico (May 2012-June 2024). The Sentinel Enhanced Dengue Surveillance System primarily targets acute febrile illness but also includes cases with other symptoms during outbreaks (e.g., Zika and COVID-19). Machine learning models (logistic regression, random forest, support vector machine, artificial neural network, adaptive boosting, light gradient boosting machine [LightGBM] and extreme gradient boosting [XGBoost]) were evaluated across different feature sets, including demographic, clinical, laboratory and epidemiological variables. Model performance was assessed using the area under the receiver operating characteristic curve (AUC), where higher AUC values indicate better performance in distinguishing dengue cases from non-dengue cases.Results: Among 49,679 patients in SEDSS, 1640 laboratory-confirmed dengue cases were identified. The XGBoost and LightGBM models achieved the highest diagnostic accuracy, with AUCs exceeding 90%, particularly with comprehensive feature sets. Incorporating predictors such as monthly dengue incidence, leukopenia, thrombocytopenia, rash, age and absence of nasal discharge significantly enhanced model sensitivity and specificity for diagnosing dengue. Adding more relevant clinical and epidemiological features consistently improved the models' ability to correctly identify dengue cases.Conclusions: Machine learning models, especially XGBoost and LightGBM, show promise for improving diagnostic accuracy for dengue using widely accessible clinical data, even in resource-limited settings. Future research should focus on developing user-friendly tools, such as mobile apps, web-based platforms, or clinical decision systems integrated into electronic health records, to implement these models in clinical practice and exploring their application for predicting dengue.","PeriodicalId":23962,"journal":{"name":"Tropical Medicine & International Health","volume":" ","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning for Improved Dengue Diagnosis in Puerto Rico.\",\"authors\":\"Zachary J Madewell, Dania M Rodriguez, Maile B Thayer, Vanessa Rivera-Amill, Jomil Torres Aponte, Melissa Marzan-Rodriguez, Gabriela Paz-Bailey, Laura E Adams, Joshua M Wong\",\"doi\":\"10.1111/tmi.70036\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objectives: Diagnosing dengue accurately, especially in resource-limited settings, remains challenging due to overlapping symptoms with other febrile illnesses and limitations of current diagnostic methods. This study aimed to develop machine learning models that leverage readily available clinical data to improve diagnostic accuracy for dengue, potentially offering a more accessible and rapid diagnostic tool for healthcare providers.Methods: We used data from the Sentinel Enhanced Dengue Surveillance System in Puerto Rico (May 2012-June 2024). The Sentinel Enhanced Dengue Surveillance System primarily targets acute febrile illness but also includes cases with other symptoms during outbreaks (e.g., Zika and COVID-19). Machine learning models (logistic regression, random forest, support vector machine, artificial neural network, adaptive boosting, light gradient boosting machine [LightGBM] and extreme gradient boosting [XGBoost]) were evaluated across different feature sets, including demographic, clinical, laboratory and epidemiological variables. Model performance was assessed using the area under the receiver operating characteristic curve (AUC), where higher AUC values indicate better performance in distinguishing dengue cases from non-dengue cases.Results: Among 49,679 patients in SEDSS, 1640 laboratory-confirmed dengue cases were identified. The XGBoost and LightGBM models achieved the highest diagnostic accuracy, with AUCs exceeding 90%, particularly with comprehensive feature sets. Incorporating predictors such as monthly dengue incidence, leukopenia, thrombocytopenia, rash, age and absence of nasal discharge significantly enhanced model sensitivity and specificity for diagnosing dengue. Adding more relevant clinical and epidemiological features consistently improved the models' ability to correctly identify dengue cases.Conclusions: Machine learning models, especially XGBoost and LightGBM, show promise for improving diagnostic accuracy for dengue using widely accessible clinical data, even in resource-limited settings. Future research should focus on developing user-friendly tools, such as mobile apps, web-based platforms, or clinical decision systems integrated into electronic health records, to implement these models in clinical practice and exploring their application for predicting dengue.\",\"PeriodicalId\":23962,\"journal\":{\"name\":\"Tropical Medicine & International Health\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Tropical Medicine & International Health\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1111/tmi.70036\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tropical Medicine & International Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/tmi.70036","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}

引用次数: 0

摘要

目的：准确诊断登革热，特别是在资源有限的情况下，由于与其他发热性疾病的症状重叠以及当前诊断方法的局限性，仍然具有挑战性。本研究旨在开发机器学习模型，利用现成的临床数据来提高登革热的诊断准确性，为医疗保健提供者提供更容易获得和快速的诊断工具。方法：我们使用波多黎各哨兵加强登革热监测系统（2012年5月- 2024年6月）的数据。哨点强化登革热监测系统主要针对急性发热性疾病，但也包括疫情期间出现其他症状的病例（如寨卡和COVID-19）。机器学习模型（逻辑回归、随机森林、支持向量机、人工神经网络、自适应增强、光梯度增强机[LightGBM]和极端梯度增强[XGBoost]）在不同的特征集上进行了评估，包括人口统计学、临床、实验室和流行病学变量。使用受试者工作特征曲线（AUC）下的面积来评估模型的性能，AUC值越高表明在区分登革热病例和非登革热病例方面的性能越好。结果：在49,679例SEDSS患者中，发现实验室确诊登革热病例1640例。XGBoost和LightGBM模型实现了最高的诊断准确性，auc超过90%，特别是在综合功能集方面。纳入登革热月发病率、白细胞减少、血小板减少、皮疹、年龄和无鼻分泌物等预测因素显著提高了模型诊断登革热的敏感性和特异性。增加更多相关的临床和流行病学特征不断提高模型正确识别登革热病例的能力。结论：机器学习模型，特别是XGBoost和LightGBM，即使在资源有限的情况下，也有望利用广泛可获得的临床数据提高登革热诊断的准确性。未来的研究应侧重于开发用户友好的工具，如移动应用程序、基于网络的平台或集成到电子健康记录中的临床决策系统，以便在临床实践中实施这些模型，并探索它们在预测登革热方面的应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Machine Learning for Improved Dengue Diagnosis in Puerto Rico.

Objectives: Diagnosing dengue accurately, especially in resource-limited settings, remains challenging due to overlapping symptoms with other febrile illnesses and limitations of current diagnostic methods. This study aimed to develop machine learning models that leverage readily available clinical data to improve diagnostic accuracy for dengue, potentially offering a more accessible and rapid diagnostic tool for healthcare providers.

Methods: We used data from the Sentinel Enhanced Dengue Surveillance System in Puerto Rico (May 2012-June 2024). The Sentinel Enhanced Dengue Surveillance System primarily targets acute febrile illness but also includes cases with other symptoms during outbreaks (e.g., Zika and COVID-19). Machine learning models (logistic regression, random forest, support vector machine, artificial neural network, adaptive boosting, light gradient boosting machine [LightGBM] and extreme gradient boosting [XGBoost]) were evaluated across different feature sets, including demographic, clinical, laboratory and epidemiological variables. Model performance was assessed using the area under the receiver operating characteristic curve (AUC), where higher AUC values indicate better performance in distinguishing dengue cases from non-dengue cases.

Results: Among 49,679 patients in SEDSS, 1640 laboratory-confirmed dengue cases were identified. The XGBoost and LightGBM models achieved the highest diagnostic accuracy, with AUCs exceeding 90%, particularly with comprehensive feature sets. Incorporating predictors such as monthly dengue incidence, leukopenia, thrombocytopenia, rash, age and absence of nasal discharge significantly enhanced model sensitivity and specificity for diagnosing dengue. Adding more relevant clinical and epidemiological features consistently improved the models' ability to correctly identify dengue cases.

Conclusions: Machine learning models, especially XGBoost and LightGBM, show promise for improving diagnostic accuracy for dengue using widely accessible clinical data, even in resource-limited settings. Future research should focus on developing user-friendly tools, such as mobile apps, web-based platforms, or clinical decision systems integrated into electronic health records, to implement these models in clinical practice and exploring their application for predicting dengue.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Tropical Medicine & International Health 医学-公共卫生、环境卫生与职业卫生

CiteScore

4.80

自引率

0.00%

发文量

129

审稿时长

6 months

期刊介绍： Tropical Medicine & International Health is published on behalf of the London School of Hygiene and Tropical Medicine, Swiss Tropical and Public Health Institute, Foundation Tropical Medicine and International Health, Belgian Institute of Tropical Medicine and Bernhard-Nocht-Institute for Tropical Medicine. Tropical Medicine & International Health is the official journal of the Federation of European Societies for Tropical Medicine and International Health (FESTMIH).