Using machine learning to improve the diagnostic accuracy of the modified Duke/ESC 2015 criteria in patients with suspected prosthetic valve endocarditis - a proof of concept study.

IF 8.6 1区 医学 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
D Ten Hove, R H J A Slart, A W J M Glaudemans, D F Postma, A Gomes, L E Swart, W Tanis, P P van Geel, G Mecozzi, R P J Budde, K Mouridsen, B Sinha
{"title":"Using machine learning to improve the diagnostic accuracy of the modified Duke/ESC 2015 criteria in patients with suspected prosthetic valve endocarditis - a proof of concept study.","authors":"D Ten Hove, R H J A Slart, A W J M Glaudemans, D F Postma, A Gomes, L E Swart, W Tanis, P P van Geel, G Mecozzi, R P J Budde, K Mouridsen, B Sinha","doi":"10.1007/s00259-024-06774-y","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Prosthetic valve endocarditis (PVE) is a serious complication of prosthetic valve implantation, with an estimated yearly incidence of at least 0.4-1.0%. The Duke criteria and subsequent modifications have been developed as a diagnostic framework for infective endocarditis (IE) in clinical studies. However, their sensitivity and specificity are limited, especially for PVE. Furthermore, their most recent versions (ESC2015 and ESC2023) include advanced imaging modalities, e.g., cardiac CTA and [<sup>18</sup>F]FDG PET/CT as major criteria. However, despite these significant changes, the weighing system using major and minor criteria has remained unchanged. This may have introduced bias to the diagnostic set of criteria. Here, we aimed to evaluate and improve the predictive value of the modified Duke/ESC 2015 (MDE2015) criteria by using machine learning algorithms.</p><p><strong>Methods: </strong>In this proof-of-concept study, we used data of a well-defined retrospective multicentre cohort of 160 patients evaluated for suspected PVE. Four machine learning algorithms were compared to the prediction of the diagnosis according to the MDE2015 criteria: Lasso logistic regression, decision tree with gradient boosting (XGBoost), decision tree without gradient boosting, and a model combining predictions of these (ensemble learning). All models used the same features that also constitute the MDE2015 criteria. The final diagnosis of PVE, based on endocarditis team consensus using all available clinical information, including surgical findings whenever performed, and with at least 1 year follow up, was used as the composite gold standard.</p><p><strong>Results: </strong>The diagnostic performance of the MDE2015 criteria varied depending on how the category of 'possible' PVE cases were handled. Considering these cases as positive for PVE, sensitivity and specificity were 0.96 and 0.60, respectively. Whereas treating these cases as negative, sensitivity and specificity were 0.74 and 0.98, respectively. Combining the approaches of considering possible endocarditis as positive and as negative for ROC-analysis resulted in an excellent AUC of 0.917. For the machine learning models, the sensitivity and specificity were as follows: logistic regression, 0.92 and 0.85; XGBoost, 0.90 and 0.85; decision trees, 0.88 and 0.86; and ensemble learning, 0.91 and 0.85, respectively. The resulting AUCs were, in the same order: 0.938, 0.937, 0.930, and 0.941, respectively.</p><p><strong>Discussion: </strong>In this proof-of-concept study, machine learning algorithms achieved improved diagnostic performance compared to the major/minor weighing system as used in the MDE2015 criteria. Moreover, these models provide quantifiable certainty levels of the diagnosis, potentially enhancing interpretability for clinicians. Additionally, they allow for easy incorporation of new and/or refined criteria, such as the individual weight of advanced imaging modalities such as CTA or [<sup>18</sup>F]FDG PET/CT. These promising preliminary findings warrant further studies for validation, ideally in a prospective cohort encompassing the full spectrum of patients with suspected IE.</p>","PeriodicalId":11909,"journal":{"name":"European Journal of Nuclear Medicine and Molecular Imaging","volume":null,"pages":null},"PeriodicalIF":8.6000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11527948/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Nuclear Medicine and Molecular Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00259-024-06774-y","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/6/21 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: Prosthetic valve endocarditis (PVE) is a serious complication of prosthetic valve implantation, with an estimated yearly incidence of at least 0.4-1.0%. The Duke criteria and subsequent modifications have been developed as a diagnostic framework for infective endocarditis (IE) in clinical studies. However, their sensitivity and specificity are limited, especially for PVE. Furthermore, their most recent versions (ESC2015 and ESC2023) include advanced imaging modalities, e.g., cardiac CTA and [18F]FDG PET/CT as major criteria. However, despite these significant changes, the weighing system using major and minor criteria has remained unchanged. This may have introduced bias to the diagnostic set of criteria. Here, we aimed to evaluate and improve the predictive value of the modified Duke/ESC 2015 (MDE2015) criteria by using machine learning algorithms.

Methods: In this proof-of-concept study, we used data of a well-defined retrospective multicentre cohort of 160 patients evaluated for suspected PVE. Four machine learning algorithms were compared to the prediction of the diagnosis according to the MDE2015 criteria: Lasso logistic regression, decision tree with gradient boosting (XGBoost), decision tree without gradient boosting, and a model combining predictions of these (ensemble learning). All models used the same features that also constitute the MDE2015 criteria. The final diagnosis of PVE, based on endocarditis team consensus using all available clinical information, including surgical findings whenever performed, and with at least 1 year follow up, was used as the composite gold standard.

Results: The diagnostic performance of the MDE2015 criteria varied depending on how the category of 'possible' PVE cases were handled. Considering these cases as positive for PVE, sensitivity and specificity were 0.96 and 0.60, respectively. Whereas treating these cases as negative, sensitivity and specificity were 0.74 and 0.98, respectively. Combining the approaches of considering possible endocarditis as positive and as negative for ROC-analysis resulted in an excellent AUC of 0.917. For the machine learning models, the sensitivity and specificity were as follows: logistic regression, 0.92 and 0.85; XGBoost, 0.90 and 0.85; decision trees, 0.88 and 0.86; and ensemble learning, 0.91 and 0.85, respectively. The resulting AUCs were, in the same order: 0.938, 0.937, 0.930, and 0.941, respectively.

Discussion: In this proof-of-concept study, machine learning algorithms achieved improved diagnostic performance compared to the major/minor weighing system as used in the MDE2015 criteria. Moreover, these models provide quantifiable certainty levels of the diagnosis, potentially enhancing interpretability for clinicians. Additionally, they allow for easy incorporation of new and/or refined criteria, such as the individual weight of advanced imaging modalities such as CTA or [18F]FDG PET/CT. These promising preliminary findings warrant further studies for validation, ideally in a prospective cohort encompassing the full spectrum of patients with suspected IE.

Abstract Image

在疑似人工瓣膜心内膜炎患者中使用机器学习提高改良的 Duke/ESC 2015 标准的诊断准确性--一项概念验证研究。
导言:人工瓣膜心内膜炎(PVE)是人工瓣膜植入术的一种严重并发症,估计每年的发病率至少为 0.4-1.0%。在临床研究中,杜克标准和随后的修订版已被开发为感染性心内膜炎(IE)的诊断框架。然而,它们的灵敏度和特异性都很有限,尤其是对 PVE 而言。此外,其最新版本(ESC2015 和ESC2023)将先进的成像模式(如心脏CTA和[18F]FDG PET/CT)作为主要标准。然而,尽管发生了这些重大变化,使用主要标准和次要标准的权衡系统却一直未变。这可能会给诊断标准集带来偏差。在此,我们旨在通过使用机器学习算法来评估和提高修改后的杜克/ESC 2015(MDE2015)标准的预测价值:在这项概念验证研究中,我们使用了一个定义明确的回顾性多中心队列的数据,该队列包括 160 名接受评估的疑似 PVE 患者。根据 MDE2015 标准,对四种机器学习算法进行了诊断预测比较:拉索逻辑回归(Lasso logistic regression)、梯度提升决策树(XGBoost)、无梯度提升决策树,以及综合这些算法预测结果的模型(集合学习)。所有模型都使用了构成 MDE2015 标准的相同特征。PVE的最终诊断以心内膜炎小组的共识为基础,使用所有可用的临床信息,包括外科手术结果,并进行至少1年的随访,作为综合金标准:MDE2015标准的诊断效果因如何处理 "可能 "PVE病例而异。将这些病例视为PVE阳性病例,敏感性和特异性分别为0.96和0.60。而将这些病例视为阴性,敏感性和特异性分别为 0.74 和 0.98。结合将可能的心内膜炎视为阳性和阴性的方法进行 ROC 分析,得出的 AUC 高达 0.917。机器学习模型的灵敏度和特异性如下:逻辑回归分别为 0.92 和 0.85;XGBoost 分别为 0.90 和 0.85;决策树分别为 0.88 和 0.86;集合学习分别为 0.91 和 0.85。得出的 AUC 依次为讨论:在这项概念验证研究中,与 MDE2015 标准中使用的主要/次要权衡系统相比,机器学习算法提高了诊断性能。此外,这些模型提供了可量化的诊断确定性水平,可能会提高临床医生的可解释性。此外,这些模型还便于纳入新的和/或改进的标准,如 CTA 或 [18F]FDG PET/CT 等先进成像模式的个体权重。这些令人鼓舞的初步研究结果需要进一步研究验证,最好是在包括所有疑似 IE 患者的前瞻性队列中进行。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
15.60
自引率
9.90%
发文量
392
审稿时长
3 months
期刊介绍: The European Journal of Nuclear Medicine and Molecular Imaging serves as a platform for the exchange of clinical and scientific information within nuclear medicine and related professions. It welcomes international submissions from professionals involved in the functional, metabolic, and molecular investigation of diseases. The journal's coverage spans physics, dosimetry, radiation biology, radiochemistry, and pharmacy, providing high-quality peer review by experts in the field. Known for highly cited and downloaded articles, it ensures global visibility for research work and is part of the EJNMMI journal family.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信