用于预测早产儿视网膜病变治疗的机器学习分类器的开发和验证。

IF 3.3 3区医学 Q2 MEDICAL INFORMATICS

BMC Medical Informatics and Decision Making Pub Date : 2025-07-01 DOI:10.1186/s12911-025-03057-w

Nasser Shoeibi, Majid Abrishami, Seyedeh Maryam Hosseini, Mohammad-Reza Ansari-Astaneh, Razieh Farrahi, Bahareh Gharib, Fatemeh Neghabi, Mojtaba Abrishami, Mehdi Sakhaee, Mehrdad Motamed Shariati

{"title":"用于预测早产儿视网膜病变治疗的机器学习分类器的开发和验证。","authors":"Nasser Shoeibi, Majid Abrishami, Seyedeh Maryam Hosseini, Mohammad-Reza Ansari-Astaneh, Razieh Farrahi, Bahareh Gharib, Fatemeh Neghabi, Mojtaba Abrishami, Mehdi Sakhaee, Mehrdad Motamed Shariati","doi":"10.1186/s12911-025-03057-w","DOIUrl":null,"url":null,"abstract":"Background: This study aims to design and evaluate various supervised machine-learning models for identifying premature infants who require treatment based on demographic data and clinical findings from screening examinations.Methods: We conducted a retrospective review of medical records for infants screened for retinopathy of prematurity (ROP) at our clinic over the past decade. We extracted demographic and clinical data, including eleven features: sex, maternal education, paternal education, birth weight, gestational age, ROP stage, zone of retinal involvement, age at examination, weight at examination, and CPR. We developed and assessed several classifiers: logistic regression (LR), decision tree (DT), support vector machine (SVM), naïve Bayes (NB), K-nearest neighbors (KNN), XGBoost, artificial neural networks (ANN), and random forest (RF). The target variable was defined as whether the neonate received any treatment during the follow-up period.Results: Our analysis included data from 9,692 infants. Among the machine learning models evaluated, the XGBoost and ANN models achieved the highest accuracy at 96%. In terms of sensitivity (recall), the NB model exhibited the lowest false negative rate, indicating the highest sensitivity (0.99). In the context of premature neonates, accurately diagnosing those who require treatment is crucial. Therefore, from a clinical perspective, prioritizing a model with the lowest false negative rate may be more beneficial than selecting one based solely on the highest accuracy.Conclusion: While AI can enhance decision-making processes by providing real-time risk assessments, these tools must be used to augment-not replace-clinical judgment. Clinicians must remain involved in interpreting model outputs and making final treatment decisions based on a holistic understanding of each patient's unique circumstances.Clinical trial number: Not applicable.","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"221"},"PeriodicalIF":3.3000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12211727/pdf/","citationCount":"0","resultStr":"{\"title\":\"Development and validation of machine learning classifiers for predicting treatment-needed retinopathy of prematurity.\",\"authors\":\"Nasser Shoeibi, Majid Abrishami, Seyedeh Maryam Hosseini, Mohammad-Reza Ansari-Astaneh, Razieh Farrahi, Bahareh Gharib, Fatemeh Neghabi, Mojtaba Abrishami, Mehdi Sakhaee, Mehrdad Motamed Shariati\",\"doi\":\"10.1186/s12911-025-03057-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: This study aims to design and evaluate various supervised machine-learning models for identifying premature infants who require treatment based on demographic data and clinical findings from screening examinations.Methods: We conducted a retrospective review of medical records for infants screened for retinopathy of prematurity (ROP) at our clinic over the past decade. We extracted demographic and clinical data, including eleven features: sex, maternal education, paternal education, birth weight, gestational age, ROP stage, zone of retinal involvement, age at examination, weight at examination, and CPR. We developed and assessed several classifiers: logistic regression (LR), decision tree (DT), support vector machine (SVM), naïve Bayes (NB), K-nearest neighbors (KNN), XGBoost, artificial neural networks (ANN), and random forest (RF). The target variable was defined as whether the neonate received any treatment during the follow-up period.Results: Our analysis included data from 9,692 infants. Among the machine learning models evaluated, the XGBoost and ANN models achieved the highest accuracy at 96%. In terms of sensitivity (recall), the NB model exhibited the lowest false negative rate, indicating the highest sensitivity (0.99). In the context of premature neonates, accurately diagnosing those who require treatment is crucial. Therefore, from a clinical perspective, prioritizing a model with the lowest false negative rate may be more beneficial than selecting one based solely on the highest accuracy.Conclusion: While AI can enhance decision-making processes by providing real-time risk assessments, these tools must be used to augment-not replace-clinical judgment. Clinicians must remain involved in interpreting model outputs and making final treatment decisions based on a holistic understanding of each patient's unique circumstances.Clinical trial number: Not applicable.\",\"PeriodicalId\":9340,\"journal\":{\"name\":\"BMC Medical Informatics and Decision Making\",\"volume\":\"25 1\",\"pages\":\"221\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12211727/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Medical Informatics and Decision Making\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12911-025-03057-w\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICAL INFORMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03057-w","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}

引用次数: 0

摘要

背景：本研究旨在设计和评估各种监督机器学习模型，用于根据人口统计数据和筛查检查的临床结果识别需要治疗的早产儿。方法：我们对过去十年来在我们诊所接受早产儿视网膜病变（ROP）筛查的婴儿病历进行了回顾性分析。我们提取了人口统计学和临床数据，包括11个特征：性别、母亲教育程度、父亲教育程度、出生体重、胎龄、ROP分期、视网膜受累区、检查时年龄、检查时体重和心肺复苏术。我们开发并评估了几种分类器：逻辑回归（LR）、决策树（DT）、支持向量机（SVM）、naïve贝叶斯（NB）、k近邻（KNN）、XGBoost、人工神经网络（ANN）和随机森林（RF）。目标变量定义为新生儿在随访期间是否接受任何治疗。结果：我们的分析包括9692名婴儿的数据。在评估的机器学习模型中，XGBoost和ANN模型的准确率最高，达到96%。在灵敏度（召回率）方面，NB模型的假阴性率最低，灵敏度最高（0.99）。在早产儿的情况下，准确诊断那些需要治疗的人是至关重要的。因此，从临床角度来看，优先选择假阴性率最低的模型可能比仅根据最高准确率选择模型更有益。结论：虽然人工智能可以通过提供实时风险评估来增强决策过程，但这些工具必须用于增强而不是取代临床判断。临床医生必须继续参与解释模型输出，并根据对每个患者独特情况的全面了解做出最终的治疗决定。临床试验号：不适用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Development and validation of machine learning classifiers for predicting treatment-needed retinopathy of prematurity.

Background: This study aims to design and evaluate various supervised machine-learning models for identifying premature infants who require treatment based on demographic data and clinical findings from screening examinations.

Methods: We conducted a retrospective review of medical records for infants screened for retinopathy of prematurity (ROP) at our clinic over the past decade. We extracted demographic and clinical data, including eleven features: sex, maternal education, paternal education, birth weight, gestational age, ROP stage, zone of retinal involvement, age at examination, weight at examination, and CPR. We developed and assessed several classifiers: logistic regression (LR), decision tree (DT), support vector machine (SVM), naïve Bayes (NB), K-nearest neighbors (KNN), XGBoost, artificial neural networks (ANN), and random forest (RF). The target variable was defined as whether the neonate received any treatment during the follow-up period.

Results: Our analysis included data from 9,692 infants. Among the machine learning models evaluated, the XGBoost and ANN models achieved the highest accuracy at 96%. In terms of sensitivity (recall), the NB model exhibited the lowest false negative rate, indicating the highest sensitivity (0.99). In the context of premature neonates, accurately diagnosing those who require treatment is crucial. Therefore, from a clinical perspective, prioritizing a model with the lowest false negative rate may be more beneficial than selecting one based solely on the highest accuracy.

Conclusion: While AI can enhance decision-making processes by providing real-time risk assessments, these tools must be used to augment-not replace-clinical judgment. Clinicians must remain involved in interpreting model outputs and making final treatment decisions based on a holistic understanding of each patient's unique circumstances.

Clinical trial number: Not applicable.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

BMC Medical Informatics and Decision Making 医学-医学：信息

CiteScore

7.20

自引率

5.70%

发文量

297

审稿时长

1 months

期刊介绍： BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.