23S rRNA A2063G突变致大环内酯耐药肺炎支原体肺炎多叶性肺实变的机器学习预测模型

IF 3.8 2区 生物学 Q2 MICROBIOLOGY
Yan Guo, Yonghan Luo
{"title":"23S rRNA A2063G突变致大环内酯耐药肺炎支原体肺炎多叶性肺实变的机器学习预测模型","authors":"Yan Guo, Yonghan Luo","doi":"10.1128/spectrum.02458-25","DOIUrl":null,"url":null,"abstract":"<p><p>This study aims to develop a machine learning (ML)-based predictive model for assessing the risk of multilobar pulmonary consolidation in children with macrolide-resistant <i>Mycoplasma pneumoniae</i> pneumonia (MRMP) caused by the 23S rRNA A2063G mutation, a subgroup underrepresented in prior studies. A total of 404 MRMP cases diagnosed between October 2024 and February 2025 were included in this study. Key clinical characteristics, including laboratory test results, symptoms, and treatment outcomes, were extracted from electronic medical records. Six ML models, including Logistic Regression, Naive Bayes, K-Nearest Neighbors, Multilayer Perceptron, Random Forest, and XG-Boost, were developed to predict multilobar pulmonary consolidation. Least absolute shrinkage and selection operator (LASSO) regression was used to select relevant variables. Model performance was then evaluated using receiver operating characteristic (ROC) curves and decision curve analysis (DCA). Finally, Sharpley Additive Explanations was used for model interpretability. XG-Boost demonstrated the highest predictive performance with an area under the ROC curve of 0.976 and 0.904 in the training and validation sets, respectively, showing a high sensitivity of 0.97, specificity of 0.81, accuracy of 0.94, and an F1 score of 0.95. Key predictors identified for multilobar pulmonary consolidation included the top 10 variables: C-reactive protein, lactate dehydrogenase, fibrinogen, platelet count, albumin, hemoglobin, creatinine, aspartate aminotransferase, interleukin-6, and oxygen therapy. DCA showed that the model also exhibited strong clinical utility. The XG-Boost predictive model offers a robust tool for identifying high-risk children with MRMP caused by the 23S rRNA A2063G mutation. By integrating clinical features, the model enhances early risk stratification and can support clinical decision-making, improving the accuracy and efficiency of treatment plans.IMPORTANCEMacrolide-resistant <i>Mycoplasma pneumoniae</i> pneumonia caused by the 23S rRNA A2063G mutation poses a significant threat to pediatric health, often leading to severe multilobar pulmonary consolidation. This study develops a high-performance machine learning model (XG-Boost) that accurately predicts this complication using key clinical indicators such as C-reactive protein, lactate dehydrogenase, and IL-6. With an area under the ROC curve of 0.976, the model enables early risk stratification, guiding clinicians in optimizing treatment for high-risk children. By improving diagnostic precision and intervention timing, this tool can reduce disease severity, minimize hospital stays, and enhance patient outcomes. The interpretability of the model via Sharpley Additive Explanations analysis further ensures its clinical applicability, making it a valuable advancement in managing antibiotic-resistant pediatric pneumonia.</p>","PeriodicalId":18670,"journal":{"name":"Microbiology spectrum","volume":" ","pages":"e0245825"},"PeriodicalIF":3.8000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A machine learning-based predictive model for multilobar pulmonary consolidation induced by macrolide-resistant <i>Mycoplasma pneumoniae</i> pneumonia caused by the 23S rRNA A2063G mutation.\",\"authors\":\"Yan Guo, Yonghan Luo\",\"doi\":\"10.1128/spectrum.02458-25\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>This study aims to develop a machine learning (ML)-based predictive model for assessing the risk of multilobar pulmonary consolidation in children with macrolide-resistant <i>Mycoplasma pneumoniae</i> pneumonia (MRMP) caused by the 23S rRNA A2063G mutation, a subgroup underrepresented in prior studies. A total of 404 MRMP cases diagnosed between October 2024 and February 2025 were included in this study. Key clinical characteristics, including laboratory test results, symptoms, and treatment outcomes, were extracted from electronic medical records. Six ML models, including Logistic Regression, Naive Bayes, K-Nearest Neighbors, Multilayer Perceptron, Random Forest, and XG-Boost, were developed to predict multilobar pulmonary consolidation. Least absolute shrinkage and selection operator (LASSO) regression was used to select relevant variables. Model performance was then evaluated using receiver operating characteristic (ROC) curves and decision curve analysis (DCA). Finally, Sharpley Additive Explanations was used for model interpretability. XG-Boost demonstrated the highest predictive performance with an area under the ROC curve of 0.976 and 0.904 in the training and validation sets, respectively, showing a high sensitivity of 0.97, specificity of 0.81, accuracy of 0.94, and an F1 score of 0.95. Key predictors identified for multilobar pulmonary consolidation included the top 10 variables: C-reactive protein, lactate dehydrogenase, fibrinogen, platelet count, albumin, hemoglobin, creatinine, aspartate aminotransferase, interleukin-6, and oxygen therapy. DCA showed that the model also exhibited strong clinical utility. The XG-Boost predictive model offers a robust tool for identifying high-risk children with MRMP caused by the 23S rRNA A2063G mutation. By integrating clinical features, the model enhances early risk stratification and can support clinical decision-making, improving the accuracy and efficiency of treatment plans.IMPORTANCEMacrolide-resistant <i>Mycoplasma pneumoniae</i> pneumonia caused by the 23S rRNA A2063G mutation poses a significant threat to pediatric health, often leading to severe multilobar pulmonary consolidation. This study develops a high-performance machine learning model (XG-Boost) that accurately predicts this complication using key clinical indicators such as C-reactive protein, lactate dehydrogenase, and IL-6. With an area under the ROC curve of 0.976, the model enables early risk stratification, guiding clinicians in optimizing treatment for high-risk children. By improving diagnostic precision and intervention timing, this tool can reduce disease severity, minimize hospital stays, and enhance patient outcomes. The interpretability of the model via Sharpley Additive Explanations analysis further ensures its clinical applicability, making it a valuable advancement in managing antibiotic-resistant pediatric pneumonia.</p>\",\"PeriodicalId\":18670,\"journal\":{\"name\":\"Microbiology spectrum\",\"volume\":\" \",\"pages\":\"e0245825\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2025-10-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Microbiology spectrum\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1128/spectrum.02458-25\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microbiology spectrum","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1128/spectrum.02458-25","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

本研究旨在开发一种基于机器学习(ML)的预测模型,用于评估由23S rRNA A2063G突变引起的大环内酯耐药肺炎支原体肺炎(MRMP)儿童多叶性肺实变的风险,该亚组在先前的研究中未被充分代表。2024年10月至2025年2月诊断的404例MRMP病例纳入本研究。从电子病历中提取关键临床特征,包括实验室检查结果、症状和治疗结果。六种ML模型,包括逻辑回归、朴素贝叶斯、k近邻、多层感知器、随机森林和XG-Boost,被开发用于预测多叶肺实变。最小绝对收缩和选择算子(LASSO)回归选择相关变量。然后使用受试者工作特征(ROC)曲线和决策曲线分析(DCA)评估模型的性能。最后,模型可解释性采用Sharpley加性解释。XG-Boost在训练集和验证集的ROC曲线下面积分别为0.976和0.904,预测性能最高,灵敏度为0.97,特异度为0.81,准确度为0.94,F1评分为0.95。确定的多叶肺实变的关键预测因子包括前10个变量:c反应蛋白、乳酸脱氢酶、纤维蛋白原、血小板计数、白蛋白、血红蛋白、肌酐、天冬氨酸转氨酶、白细胞介素-6和氧治疗。DCA结果表明,该模型具有较强的临床应用价值。XG-Boost预测模型为识别由23S rRNA A2063G突变引起的MRMP高危儿童提供了一个强大的工具。该模型结合临床特点,加强早期风险分层,支持临床决策,提高治疗方案的准确性和效率。由23S rRNA A2063G突变引起的耐药肺炎支原体肺炎对儿童健康构成重大威胁,常导致严重的多叶性肺实变。本研究开发了一种高性能机器学习模型(XG-Boost),可以使用c反应蛋白、乳酸脱氢酶和IL-6等关键临床指标准确预测这种并发症。该模型的ROC曲线下面积为0.976,可以进行早期风险分层,指导临床医生对高危儿童进行优化治疗。通过提高诊断精度和干预时机,该工具可以降低疾病严重程度,最大限度地减少住院时间,并提高患者的预后。该模型通过Sharpley加性解释分析的可解释性进一步确保了其临床适用性,使其成为治疗耐药儿童肺炎的宝贵进展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A machine learning-based predictive model for multilobar pulmonary consolidation induced by macrolide-resistant Mycoplasma pneumoniae pneumonia caused by the 23S rRNA A2063G mutation.

This study aims to develop a machine learning (ML)-based predictive model for assessing the risk of multilobar pulmonary consolidation in children with macrolide-resistant Mycoplasma pneumoniae pneumonia (MRMP) caused by the 23S rRNA A2063G mutation, a subgroup underrepresented in prior studies. A total of 404 MRMP cases diagnosed between October 2024 and February 2025 were included in this study. Key clinical characteristics, including laboratory test results, symptoms, and treatment outcomes, were extracted from electronic medical records. Six ML models, including Logistic Regression, Naive Bayes, K-Nearest Neighbors, Multilayer Perceptron, Random Forest, and XG-Boost, were developed to predict multilobar pulmonary consolidation. Least absolute shrinkage and selection operator (LASSO) regression was used to select relevant variables. Model performance was then evaluated using receiver operating characteristic (ROC) curves and decision curve analysis (DCA). Finally, Sharpley Additive Explanations was used for model interpretability. XG-Boost demonstrated the highest predictive performance with an area under the ROC curve of 0.976 and 0.904 in the training and validation sets, respectively, showing a high sensitivity of 0.97, specificity of 0.81, accuracy of 0.94, and an F1 score of 0.95. Key predictors identified for multilobar pulmonary consolidation included the top 10 variables: C-reactive protein, lactate dehydrogenase, fibrinogen, platelet count, albumin, hemoglobin, creatinine, aspartate aminotransferase, interleukin-6, and oxygen therapy. DCA showed that the model also exhibited strong clinical utility. The XG-Boost predictive model offers a robust tool for identifying high-risk children with MRMP caused by the 23S rRNA A2063G mutation. By integrating clinical features, the model enhances early risk stratification and can support clinical decision-making, improving the accuracy and efficiency of treatment plans.IMPORTANCEMacrolide-resistant Mycoplasma pneumoniae pneumonia caused by the 23S rRNA A2063G mutation poses a significant threat to pediatric health, often leading to severe multilobar pulmonary consolidation. This study develops a high-performance machine learning model (XG-Boost) that accurately predicts this complication using key clinical indicators such as C-reactive protein, lactate dehydrogenase, and IL-6. With an area under the ROC curve of 0.976, the model enables early risk stratification, guiding clinicians in optimizing treatment for high-risk children. By improving diagnostic precision and intervention timing, this tool can reduce disease severity, minimize hospital stays, and enhance patient outcomes. The interpretability of the model via Sharpley Additive Explanations analysis further ensures its clinical applicability, making it a valuable advancement in managing antibiotic-resistant pediatric pneumonia.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Microbiology spectrum
Microbiology spectrum Biochemistry, Genetics and Molecular Biology-Genetics
CiteScore
3.20
自引率
5.40%
发文量
1800
期刊介绍: Microbiology Spectrum publishes commissioned review articles on topics in microbiology representing ten content areas: Archaea; Food Microbiology; Bacterial Genetics, Cell Biology, and Physiology; Clinical Microbiology; Environmental Microbiology and Ecology; Eukaryotic Microbes; Genomics, Computational, and Synthetic Microbiology; Immunology; Pathogenesis; and Virology. Reviews are interrelated, with each review linking to other related content. A large board of Microbiology Spectrum editors aids in the development of topics for potential reviews and in the identification of an editor, or editors, who shepherd each collection.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信