Development and validation of supervised machine learning multivariable prediction models for the diagnosis of Pneumocystis jirovecii pneumonia using nasopharyngeal swab PCR in adults in a low-HIV prevalence setting.

IF 2.3 4区 医学 Q2 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH
Rusheng Chew, Marion L Woods, David L Paterson
{"title":"Development and validation of supervised machine learning multivariable prediction models for the diagnosis of Pneumocystis jirovecii pneumonia using nasopharyngeal swab PCR in adults in a low-HIV prevalence setting.","authors":"Rusheng Chew, Marion L Woods, David L Paterson","doi":"10.1093/inthealth/ihae052","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The global burden of the opportunistic fungal disease Pneumocystis jirovecii pneumonia (PJP) remains substantial. Polymerase chain reaction (PCR) on nasopharyngeal swabs (NPS) has high specificity and may be a viable alternative to the gold standard diagnostic of PCR on invasively collected lower respiratory tract specimens, but has low sensitivity. Sensitivity may be improved by incorporating NPS PCR results into machine learning models.</p><p><strong>Methods: </strong>Three supervised multivariable diagnostic models (random forest, logistic regression and extreme gradient boosting) were constructed and validated using a 111-person Australian dataset. The predictors were age, gender, immunosuppression type and NPS PCR result. Model performance metrics such as accuracy, sensitivity, specificity and predictive values were compared to select the best-performing model.</p><p><strong>Results: </strong>The logistic regression model performed best, with 80% accuracy, improving sensitivity to 86% and maintaining acceptable specificity of 70%. Using this model, positive and negative NPS PCR results indicated post-test probabilities of 84% (likely PJP) and 26% (unlikely PJP), respectively.</p><p><strong>Conclusions: </strong>The logistic regression model should be externally validated in a wider range of settings. As the predictors are simple, routinely collected patient variables, this model may represent a diagnostic advance suitable for settings where collection of lower respiratory tract specimens is difficult but PCR is available.</p>","PeriodicalId":49060,"journal":{"name":"International Health","volume":" ","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/inthealth/ihae052","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0

Abstract

Background: The global burden of the opportunistic fungal disease Pneumocystis jirovecii pneumonia (PJP) remains substantial. Polymerase chain reaction (PCR) on nasopharyngeal swabs (NPS) has high specificity and may be a viable alternative to the gold standard diagnostic of PCR on invasively collected lower respiratory tract specimens, but has low sensitivity. Sensitivity may be improved by incorporating NPS PCR results into machine learning models.

Methods: Three supervised multivariable diagnostic models (random forest, logistic regression and extreme gradient boosting) were constructed and validated using a 111-person Australian dataset. The predictors were age, gender, immunosuppression type and NPS PCR result. Model performance metrics such as accuracy, sensitivity, specificity and predictive values were compared to select the best-performing model.

Results: The logistic regression model performed best, with 80% accuracy, improving sensitivity to 86% and maintaining acceptable specificity of 70%. Using this model, positive and negative NPS PCR results indicated post-test probabilities of 84% (likely PJP) and 26% (unlikely PJP), respectively.

Conclusions: The logistic regression model should be externally validated in a wider range of settings. As the predictors are simple, routinely collected patient variables, this model may represent a diagnostic advance suitable for settings where collection of lower respiratory tract specimens is difficult but PCR is available.

利用鼻咽拭子聚合酶链式反应(PCR)对低艾滋病毒感染率环境中的成人进行肺孢子虫肺炎诊断的监督机器学习多变量预测模型的开发与验证。
背景:机会性真菌疾病肺孢子菌肺炎(PJP)在全球造成的负担仍然很重。鼻咽拭子上的聚合酶链反应(PCR)特异性高,可以替代有创采集的下呼吸道标本上的聚合酶链反应这一金标准诊断方法,但灵敏度较低。将 NPS PCR 结果纳入机器学习模型可提高灵敏度:方法: 使用 111 人的澳大利亚数据集构建并验证了三个有监督的多变量诊断模型(随机森林、逻辑回归和极端梯度提升)。预测因素包括年龄、性别、免疫抑制类型和新农合 PCR 结果。比较了准确性、灵敏度、特异性和预测值等模型性能指标,以选出性能最佳的模型:结果:逻辑回归模型表现最佳,准确率达 80%,灵敏度提高到 86%,特异性维持在 70% 的可接受水平。使用该模型,NPS PCR 阳性和阴性结果显示的检测后概率分别为 84%(可能为 PJP)和 26%(不可能为 PJP):逻辑回归模型应在更广泛的环境中进行外部验证。由于预测因素都是简单的、常规收集的患者变量,该模型可能代表了一种诊断上的进步,适用于难以收集下呼吸道标本但可进行 PCR 的环境。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
International Health
International Health PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH-
CiteScore
4.50
自引率
0.00%
发文量
83
审稿时长
>12 weeks
期刊介绍: International Health is an official journal of the Royal Society of Tropical Medicine and Hygiene. It publishes original, peer-reviewed articles and reviews on all aspects of global health including the social and economic aspects of communicable and non-communicable diseases, health systems research, policy and implementation, and the evaluation of disease control programmes and healthcare delivery solutions. It aims to stimulate scientific and policy debate and provide a forum for analysis and opinion sharing for individuals and organisations engaged in all areas of global health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信