小儿阑尾炎的机器学习与特征选择。

IF 2.2 4区医学 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Tomography Pub Date : 2025-08-13 DOI:10.3390/tomography11080090

John Kendall, Gabriel Gaspar, Derek Berger, Jacob Levman

{"title":"小儿阑尾炎的机器学习与特征选择。","authors":"John Kendall, Gabriel Gaspar, Derek Berger, Jacob Levman","doi":"10.3390/tomography11080090","DOIUrl":null,"url":null,"abstract":"Background/objectives: Accurate prediction of pediatric appendicitis diagnosis, management, and severity is critical for clinical decision-making. We aimed to evaluate the predictive performance of a wide range of machine learning models, combined with various feature selection techniques, on a pediatric appendicitis dataset. A particular focus was placed on the role of ultrasound (US) image-descriptive features in model performance and explainability.Methods: We conducted a retrospective cohort study on a dataset of 781 pediatric patients aged 0-18 presenting to Children's Hospital St. Hedwig in Regensburg, Germany, between January 2016 and February 2023. We developed and validated predictive models; machine learning algorithms included the random forest, logistic regression, stochastic gradient descent, and the light gradient boosting machine (LGBM). These were paired exhaustively with feature selection methods spanning filter-based (association and prediction), embedded (LGBM and linear), and a novel redundancy-aware step-up wrapper approach. We employed a machine learning benchmarking study design where AI models were trained to predict diagnosis, management, and severity outcomes, both with and without US image-descriptive features, and evaluated on held-out testing samples. Model performance was assessed using overall accuracy and area under the receiver operating characteristic curve (AUROC). A deep learner optimized for tabular data, GANDALF, was also evaluated in these applications.Results: US features significantly improved diagnostic accuracy, supporting their use in reducing model bias. However, they were not essential for maximizing accuracy in predicting management or severity. In summary, our best-performing models were, for diagnosis, the random forest with embedded LGBM feature selection (98.1% accuracy, AUROC: 0.993), for management, the random forest without feature selection (93.9% accuracy, AUROC: 0.980), and for severity, the LGBM with filter-based association feature selection (90.1% accuracy, AUROC: 0.931).Conclusions: Our results demonstrate that high-performing, interpretable machine learning models can predict key clinical outcomes in pediatric appendicitis. US image features improve diagnostic accuracy but are not critical for predicting management or severity.","PeriodicalId":51330,"journal":{"name":"Tomography","volume":"11 8","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12390108/pdf/","citationCount":"0","resultStr":"{\"title\":\"Machine Learning and Feature Selection in Pediatric Appendicitis.\",\"authors\":\"John Kendall, Gabriel Gaspar, Derek Berger, Jacob Levman\",\"doi\":\"10.3390/tomography11080090\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background/objectives: Accurate prediction of pediatric appendicitis diagnosis, management, and severity is critical for clinical decision-making. We aimed to evaluate the predictive performance of a wide range of machine learning models, combined with various feature selection techniques, on a pediatric appendicitis dataset. A particular focus was placed on the role of ultrasound (US) image-descriptive features in model performance and explainability.Methods: We conducted a retrospective cohort study on a dataset of 781 pediatric patients aged 0-18 presenting to Children's Hospital St. Hedwig in Regensburg, Germany, between January 2016 and February 2023. We developed and validated predictive models; machine learning algorithms included the random forest, logistic regression, stochastic gradient descent, and the light gradient boosting machine (LGBM). These were paired exhaustively with feature selection methods spanning filter-based (association and prediction), embedded (LGBM and linear), and a novel redundancy-aware step-up wrapper approach. We employed a machine learning benchmarking study design where AI models were trained to predict diagnosis, management, and severity outcomes, both with and without US image-descriptive features, and evaluated on held-out testing samples. Model performance was assessed using overall accuracy and area under the receiver operating characteristic curve (AUROC). A deep learner optimized for tabular data, GANDALF, was also evaluated in these applications.Results: US features significantly improved diagnostic accuracy, supporting their use in reducing model bias. However, they were not essential for maximizing accuracy in predicting management or severity. In summary, our best-performing models were, for diagnosis, the random forest with embedded LGBM feature selection (98.1% accuracy, AUROC: 0.993), for management, the random forest without feature selection (93.9% accuracy, AUROC: 0.980), and for severity, the LGBM with filter-based association feature selection (90.1% accuracy, AUROC: 0.931).Conclusions: Our results demonstrate that high-performing, interpretable machine learning models can predict key clinical outcomes in pediatric appendicitis. US image features improve diagnostic accuracy but are not critical for predicting management or severity.\",\"PeriodicalId\":51330,\"journal\":{\"name\":\"Tomography\",\"volume\":\"11 8\",\"pages\":\"\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-08-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12390108/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Tomography\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.3390/tomography11080090\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tomography","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3390/tomography11080090","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

摘要

背景/目的：准确预测小儿阑尾炎的诊断、处理和严重程度对临床决策至关重要。我们的目标是评估广泛的机器学习模型的预测性能，结合各种特征选择技术，在儿童阑尾炎数据集上。特别关注超声（US）图像描述特征在模型性能和可解释性中的作用。方法：我们对2016年1月至2023年2月期间在德国雷根斯堡圣海德维格儿童医院就诊的781名0-18岁儿童患者的数据集进行了回顾性队列研究。我们开发并验证了预测模型；机器学习算法包括随机森林、逻辑回归、随机梯度下降和光梯度增强机（LGBM）。这些方法与基于过滤器的特征选择方法（关联和预测）、嵌入式（LGBM和线性）以及一种新颖的冗余感知升级包装方法进行了详尽的配对。我们采用了机器学习基准研究设计，其中人工智能模型被训练来预测诊断、管理和严重程度结果，包括有无美国图像描述性特征，并在测试样本上进行评估。使用总体精度和受试者工作特征曲线下面积（AUROC）评估模型性能。针对表格数据优化的深度学习器GANDALF也在这些应用中进行了评估。结果：美国特征显著提高了诊断准确性，支持其用于减少模型偏差。然而，它们并不是预测管理或严重程度的最大准确性所必需的。综上所述，我们表现最好的模型是，在诊断方面，带有嵌入LGBM特征选择的随机森林（准确率98.1%,AUROC: 0.993），在管理方面，没有特征选择的随机森林（准确率93.9%,AUROC: 0.980），在严重性方面，带有基于过滤器的关联特征选择的LGBM（准确率90.1%,AUROC: 0.931）。结论：我们的研究结果表明，高性能、可解释的机器学习模型可以预测小儿阑尾炎的关键临床结果。超声影像特征提高了诊断的准确性，但对预测病情或严重程度并不重要。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Machine Learning and Feature Selection in Pediatric Appendicitis.

查看原文本刊更多论文

Machine Learning and Feature Selection in Pediatric Appendicitis.

Background/objectives: Accurate prediction of pediatric appendicitis diagnosis, management, and severity is critical for clinical decision-making. We aimed to evaluate the predictive performance of a wide range of machine learning models, combined with various feature selection techniques, on a pediatric appendicitis dataset. A particular focus was placed on the role of ultrasound (US) image-descriptive features in model performance and explainability.

Methods: We conducted a retrospective cohort study on a dataset of 781 pediatric patients aged 0-18 presenting to Children's Hospital St. Hedwig in Regensburg, Germany, between January 2016 and February 2023. We developed and validated predictive models; machine learning algorithms included the random forest, logistic regression, stochastic gradient descent, and the light gradient boosting machine (LGBM). These were paired exhaustively with feature selection methods spanning filter-based (association and prediction), embedded (LGBM and linear), and a novel redundancy-aware step-up wrapper approach. We employed a machine learning benchmarking study design where AI models were trained to predict diagnosis, management, and severity outcomes, both with and without US image-descriptive features, and evaluated on held-out testing samples. Model performance was assessed using overall accuracy and area under the receiver operating characteristic curve (AUROC). A deep learner optimized for tabular data, GANDALF, was also evaluated in these applications.

Results: US features significantly improved diagnostic accuracy, supporting their use in reducing model bias. However, they were not essential for maximizing accuracy in predicting management or severity. In summary, our best-performing models were, for diagnosis, the random forest with embedded LGBM feature selection (98.1% accuracy, AUROC: 0.993), for management, the random forest without feature selection (93.9% accuracy, AUROC: 0.980), and for severity, the LGBM with filter-based association feature selection (90.1% accuracy, AUROC: 0.931).

Conclusions: Our results demonstrate that high-performing, interpretable machine learning models can predict key clinical outcomes in pediatric appendicitis. US image features improve diagnostic accuracy but are not critical for predicting management or severity.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Tomography Medicine-Radiology, Nuclear Medicine and Imaging

CiteScore

2.70

自引率

10.50%

发文量

222

期刊介绍： TomographyTM publishes basic (technical and pre-clinical) and clinical scientific articles which involve the advancement of imaging technologies. Tomography encompasses studies that use single or multiple imaging modalities including for example CT, US, PET, SPECT, MR and hyperpolarization technologies, as well as optical modalities (i.e. bioluminescence, photoacoustic, endomicroscopy, fiber optic imaging and optical computed tomography) in basic sciences, engineering, preclinical and clinical medicine. Tomography also welcomes studies involving exploration and refinement of contrast mechanisms and image-derived metrics within and across modalities toward the development of novel imaging probes for image-based feedback and intervention. The use of imaging in biology and medicine provides unparalleled opportunities to noninvasively interrogate tissues to obtain real-time dynamic and quantitative information required for diagnosis and response to interventions and to follow evolving pathological conditions. As multi-modal studies and the complexities of imaging technologies themselves are ever increasing to provide advanced information to scientists and clinicians. Tomography provides a unique publication venue allowing investigators the opportunity to more precisely communicate integrated findings related to the diverse and heterogeneous features associated with underlying anatomical, physiological, functional, metabolic and molecular genetic activities of normal and diseased tissue. Thus Tomography publishes peer-reviewed articles which involve the broad use of imaging of any tissue and disease type including both preclinical and clinical investigations. In addition, hardware/software along with chemical and molecular probe advances are welcome as they are deemed to significantly contribute towards the long-term goal of improving the overall impact of imaging on scientific and clinical discovery.