Development and validation of machine learning classifiers for predicting treatment-needed retinopathy of prematurity.

IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS
Nasser Shoeibi, Majid Abrishami, Seyedeh Maryam Hosseini, Mohammad-Reza Ansari-Astaneh, Razieh Farrahi, Bahareh Gharib, Fatemeh Neghabi, Mojtaba Abrishami, Mehdi Sakhaee, Mehrdad Motamed Shariati
{"title":"Development and validation of machine learning classifiers for predicting treatment-needed retinopathy of prematurity.","authors":"Nasser Shoeibi, Majid Abrishami, Seyedeh Maryam Hosseini, Mohammad-Reza Ansari-Astaneh, Razieh Farrahi, Bahareh Gharib, Fatemeh Neghabi, Mojtaba Abrishami, Mehdi Sakhaee, Mehrdad Motamed Shariati","doi":"10.1186/s12911-025-03057-w","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>This study aims to design and evaluate various supervised machine-learning models for identifying premature infants who require treatment based on demographic data and clinical findings from screening examinations.</p><p><strong>Methods: </strong>We conducted a retrospective review of medical records for infants screened for retinopathy of prematurity (ROP) at our clinic over the past decade. We extracted demographic and clinical data, including eleven features: sex, maternal education, paternal education, birth weight, gestational age, ROP stage, zone of retinal involvement, age at examination, weight at examination, and CPR. We developed and assessed several classifiers: logistic regression (LR), decision tree (DT), support vector machine (SVM), naïve Bayes (NB), K-nearest neighbors (KNN), XGBoost, artificial neural networks (ANN), and random forest (RF). The target variable was defined as whether the neonate received any treatment during the follow-up period.</p><p><strong>Results: </strong>Our analysis included data from 9,692 infants. Among the machine learning models evaluated, the XGBoost and ANN models achieved the highest accuracy at 96%. In terms of sensitivity (recall), the NB model exhibited the lowest false negative rate, indicating the highest sensitivity (0.99). In the context of premature neonates, accurately diagnosing those who require treatment is crucial. Therefore, from a clinical perspective, prioritizing a model with the lowest false negative rate may be more beneficial than selecting one based solely on the highest accuracy.</p><p><strong>Conclusion: </strong>While AI can enhance decision-making processes by providing real-time risk assessments, these tools must be used to augment-not replace-clinical judgment. Clinicians must remain involved in interpreting model outputs and making final treatment decisions based on a holistic understanding of each patient's unique circumstances.</p><p><strong>Clinical trial number: </strong>Not applicable.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"221"},"PeriodicalIF":3.3000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12211727/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03057-w","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: This study aims to design and evaluate various supervised machine-learning models for identifying premature infants who require treatment based on demographic data and clinical findings from screening examinations.

Methods: We conducted a retrospective review of medical records for infants screened for retinopathy of prematurity (ROP) at our clinic over the past decade. We extracted demographic and clinical data, including eleven features: sex, maternal education, paternal education, birth weight, gestational age, ROP stage, zone of retinal involvement, age at examination, weight at examination, and CPR. We developed and assessed several classifiers: logistic regression (LR), decision tree (DT), support vector machine (SVM), naïve Bayes (NB), K-nearest neighbors (KNN), XGBoost, artificial neural networks (ANN), and random forest (RF). The target variable was defined as whether the neonate received any treatment during the follow-up period.

Results: Our analysis included data from 9,692 infants. Among the machine learning models evaluated, the XGBoost and ANN models achieved the highest accuracy at 96%. In terms of sensitivity (recall), the NB model exhibited the lowest false negative rate, indicating the highest sensitivity (0.99). In the context of premature neonates, accurately diagnosing those who require treatment is crucial. Therefore, from a clinical perspective, prioritizing a model with the lowest false negative rate may be more beneficial than selecting one based solely on the highest accuracy.

Conclusion: While AI can enhance decision-making processes by providing real-time risk assessments, these tools must be used to augment-not replace-clinical judgment. Clinicians must remain involved in interpreting model outputs and making final treatment decisions based on a holistic understanding of each patient's unique circumstances.

Clinical trial number: Not applicable.

用于预测早产儿视网膜病变治疗的机器学习分类器的开发和验证。
背景:本研究旨在设计和评估各种监督机器学习模型,用于根据人口统计数据和筛查检查的临床结果识别需要治疗的早产儿。方法:我们对过去十年来在我们诊所接受早产儿视网膜病变(ROP)筛查的婴儿病历进行了回顾性分析。我们提取了人口统计学和临床数据,包括11个特征:性别、母亲教育程度、父亲教育程度、出生体重、胎龄、ROP分期、视网膜受累区、检查时年龄、检查时体重和心肺复苏术。我们开发并评估了几种分类器:逻辑回归(LR)、决策树(DT)、支持向量机(SVM)、naïve贝叶斯(NB)、k近邻(KNN)、XGBoost、人工神经网络(ANN)和随机森林(RF)。目标变量定义为新生儿在随访期间是否接受任何治疗。结果:我们的分析包括9692名婴儿的数据。在评估的机器学习模型中,XGBoost和ANN模型的准确率最高,达到96%。在灵敏度(召回率)方面,NB模型的假阴性率最低,灵敏度最高(0.99)。在早产儿的情况下,准确诊断那些需要治疗的人是至关重要的。因此,从临床角度来看,优先选择假阴性率最低的模型可能比仅根据最高准确率选择模型更有益。结论:虽然人工智能可以通过提供实时风险评估来增强决策过程,但这些工具必须用于增强而不是取代临床判断。临床医生必须继续参与解释模型输出,并根据对每个患者独特情况的全面了解做出最终的治疗决定。临床试验号:不适用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信