A multicenter study on developing a prognostic model for severe fever with thrombocytopenia syndrome using machine learning.

IF 4 2区 生物学 Q2 MICROBIOLOGY
Frontiers in Microbiology Pub Date : 2025-03-19 eCollection Date: 2025-01-01 DOI:10.3389/fmicb.2025.1557922
Jian-She Xu, Kai Yang, Bin Quan, Jing Xie, Yi-Shan Zheng
{"title":"A multicenter study on developing a prognostic model for severe fever with thrombocytopenia syndrome using machine learning.","authors":"Jian-She Xu, Kai Yang, Bin Quan, Jing Xie, Yi-Shan Zheng","doi":"10.3389/fmicb.2025.1557922","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Severe Fever with Thrombocytopenia Syndrome (SFTS) is a disease caused by infection with the Severe Fever with Thrombocytopenia Syndrome virus (SFTSV), a novel Bunyavirus. Accurate prognostic assessment is crucial for developing individualized prevention and treatment strategies. However, machine learning prognostic models for SFTS are rare and need further improvement and clinical validation.</p><p><strong>Objective: </strong>This study aims to develop and validate an interpretable prognostic model based on machine learning (ML) methods to enhance the understanding of SFTS progression.</p><p><strong>Methods: </strong>This multicenter retrospective study analyzed patient data from two provinces in China. The derivation cohort included 292 patients treated at The Second Hospital of Nanjing from January 2022 to December 2023, with a 7:3 split for model training and internal validation. The external validation cohort consisted of 104 patients from The First Affiliated Hospital of Wannan Medical College during the same period. Twenty-four commonly available clinical features were selected, and the Boruta algorithm identified 12 candidate predictors, ranked by Z-scores, which were progressively incorporated into 10 machine learning models to develop prognostic models. Model performance was assessed using the area under the receiver-operating-characteristic curve (AUC), accuracy, recall, and F1 score. The clinical utility of the best-performing model was evaluated through decision curve analysis (DCA) based on net benefit. Robustness was tested with 10-fold cross-validation, and feature importance was explained using SHapley Additive exPlanation (SHAP) both globally and locally.</p><p><strong>Results: </strong>Among the 10 machine learning models, the XGBoost model demonstrated the best overall discriminatory ability. Considering both AUC index and feature simplicity, a final interpretable XGBoost model with 7 key features was constructed. The model showed high predictive accuracy for patient outcomes in both internal (AUC = 0.911, 95% CI: 0.842-0.967) and external validations (AUC = 0.891, 95% CI: 0.786-0.977). A clinical tool based on this model has been developed and implemented using the Streamlit framework.</p><p><strong>Conclusion: </strong>The interpretable XGBoost-based prognostic model for SFTS shows high predictive accuracy and has been translated into a clinical tool. The model's 7 key features serve as valuable indicators for early prognosis of SFTS, warranting close attention from healthcare professionals in clinical practice.</p>","PeriodicalId":12466,"journal":{"name":"Frontiers in Microbiology","volume":"16 ","pages":"1557922"},"PeriodicalIF":4.0000,"publicationDate":"2025-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11962041/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Microbiology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3389/fmicb.2025.1557922","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Severe Fever with Thrombocytopenia Syndrome (SFTS) is a disease caused by infection with the Severe Fever with Thrombocytopenia Syndrome virus (SFTSV), a novel Bunyavirus. Accurate prognostic assessment is crucial for developing individualized prevention and treatment strategies. However, machine learning prognostic models for SFTS are rare and need further improvement and clinical validation.

Objective: This study aims to develop and validate an interpretable prognostic model based on machine learning (ML) methods to enhance the understanding of SFTS progression.

Methods: This multicenter retrospective study analyzed patient data from two provinces in China. The derivation cohort included 292 patients treated at The Second Hospital of Nanjing from January 2022 to December 2023, with a 7:3 split for model training and internal validation. The external validation cohort consisted of 104 patients from The First Affiliated Hospital of Wannan Medical College during the same period. Twenty-four commonly available clinical features were selected, and the Boruta algorithm identified 12 candidate predictors, ranked by Z-scores, which were progressively incorporated into 10 machine learning models to develop prognostic models. Model performance was assessed using the area under the receiver-operating-characteristic curve (AUC), accuracy, recall, and F1 score. The clinical utility of the best-performing model was evaluated through decision curve analysis (DCA) based on net benefit. Robustness was tested with 10-fold cross-validation, and feature importance was explained using SHapley Additive exPlanation (SHAP) both globally and locally.

Results: Among the 10 machine learning models, the XGBoost model demonstrated the best overall discriminatory ability. Considering both AUC index and feature simplicity, a final interpretable XGBoost model with 7 key features was constructed. The model showed high predictive accuracy for patient outcomes in both internal (AUC = 0.911, 95% CI: 0.842-0.967) and external validations (AUC = 0.891, 95% CI: 0.786-0.977). A clinical tool based on this model has been developed and implemented using the Streamlit framework.

Conclusion: The interpretable XGBoost-based prognostic model for SFTS shows high predictive accuracy and has been translated into a clinical tool. The model's 7 key features serve as valuable indicators for early prognosis of SFTS, warranting close attention from healthcare professionals in clinical practice.

利用机器学习建立发热伴血小板减少综合征预后模型的多中心研究。
背景:发热伴血小板减少综合征(SFTS)是由感染发热伴血小板减少综合征病毒(SFTSV)引起的疾病,SFTSV是一种新型布尼亚病毒。准确的预后评估对于制定个性化的预防和治疗策略至关重要。然而,SFTS的机器学习预后模型很少,需要进一步改进和临床验证。目的:本研究旨在开发和验证基于机器学习(ML)方法的可解释预后模型,以增强对SFTS进展的理解。方法:本多中心回顾性研究分析了中国两个省份的患者资料。衍生队列包括从2022年1月至2023年12月在南京第二医院治疗的292例患者,以7:3的比例进行模型训练和内部验证。外部验证队列包括同期皖南医学院第一附属医院的104例患者。选择了24个常见的临床特征,Boruta算法确定了12个候选预测因子,按z分数排序,逐步将其纳入10个机器学习模型以开发预后模型。模型的性能是用接受者-工作特征曲线下面积(AUC)、准确率、召回率和F1分数来评估的。通过基于净收益的决策曲线分析(DCA)评估最佳模型的临床效用。鲁棒性测试采用10倍交叉验证,并使用SHapley加性解释(SHAP)在全局和局部解释特征的重要性。结果:在10个机器学习模型中,XGBoost模型表现出最好的综合判别能力。考虑到AUC指数和特征简单性,最终构建了包含7个关键特征的可解释XGBoost模型。该模型在内部验证(AUC = 0.911, 95% CI: 0.842-0.967)和外部验证(AUC = 0.891, 95% CI: 0.786-0.977)中均显示出较高的预测准确性。基于该模型的临床工具已经开发并使用Streamlit框架实现。结论:可解释的基于xgboost的SFTS预后模型具有较高的预测准确性,并已转化为临床工具。该模型的7个关键特征为SFTS的早期预后提供了有价值的指标,值得医疗专业人员在临床实践中密切关注。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.70
自引率
9.60%
发文量
4837
审稿时长
14 weeks
期刊介绍: Frontiers in Microbiology is a leading journal in its field, publishing rigorously peer-reviewed research across the entire spectrum of microbiology. Field Chief Editor Martin G. Klotz at Washington State University is supported by an outstanding Editorial Board of international researchers. This multidisciplinary open-access journal is at the forefront of disseminating and communicating scientific knowledge and impactful discoveries to researchers, academics, clinicians and the public worldwide.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信