{"title":"用于预测早产儿视网膜病变治疗的机器学习分类器的开发和验证。","authors":"Nasser Shoeibi, Majid Abrishami, Seyedeh Maryam Hosseini, Mohammad-Reza Ansari-Astaneh, Razieh Farrahi, Bahareh Gharib, Fatemeh Neghabi, Mojtaba Abrishami, Mehdi Sakhaee, Mehrdad Motamed Shariati","doi":"10.1186/s12911-025-03057-w","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>This study aims to design and evaluate various supervised machine-learning models for identifying premature infants who require treatment based on demographic data and clinical findings from screening examinations.</p><p><strong>Methods: </strong>We conducted a retrospective review of medical records for infants screened for retinopathy of prematurity (ROP) at our clinic over the past decade. We extracted demographic and clinical data, including eleven features: sex, maternal education, paternal education, birth weight, gestational age, ROP stage, zone of retinal involvement, age at examination, weight at examination, and CPR. We developed and assessed several classifiers: logistic regression (LR), decision tree (DT), support vector machine (SVM), naïve Bayes (NB), K-nearest neighbors (KNN), XGBoost, artificial neural networks (ANN), and random forest (RF). The target variable was defined as whether the neonate received any treatment during the follow-up period.</p><p><strong>Results: </strong>Our analysis included data from 9,692 infants. Among the machine learning models evaluated, the XGBoost and ANN models achieved the highest accuracy at 96%. In terms of sensitivity (recall), the NB model exhibited the lowest false negative rate, indicating the highest sensitivity (0.99). In the context of premature neonates, accurately diagnosing those who require treatment is crucial. Therefore, from a clinical perspective, prioritizing a model with the lowest false negative rate may be more beneficial than selecting one based solely on the highest accuracy.</p><p><strong>Conclusion: </strong>While AI can enhance decision-making processes by providing real-time risk assessments, these tools must be used to augment-not replace-clinical judgment. Clinicians must remain involved in interpreting model outputs and making final treatment decisions based on a holistic understanding of each patient's unique circumstances.</p><p><strong>Clinical trial number: </strong>Not applicable.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"221"},"PeriodicalIF":3.3000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12211727/pdf/","citationCount":"0","resultStr":"{\"title\":\"Development and validation of machine learning classifiers for predicting treatment-needed retinopathy of prematurity.\",\"authors\":\"Nasser Shoeibi, Majid Abrishami, Seyedeh Maryam Hosseini, Mohammad-Reza Ansari-Astaneh, Razieh Farrahi, Bahareh Gharib, Fatemeh Neghabi, Mojtaba Abrishami, Mehdi Sakhaee, Mehrdad Motamed Shariati\",\"doi\":\"10.1186/s12911-025-03057-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>This study aims to design and evaluate various supervised machine-learning models for identifying premature infants who require treatment based on demographic data and clinical findings from screening examinations.</p><p><strong>Methods: </strong>We conducted a retrospective review of medical records for infants screened for retinopathy of prematurity (ROP) at our clinic over the past decade. We extracted demographic and clinical data, including eleven features: sex, maternal education, paternal education, birth weight, gestational age, ROP stage, zone of retinal involvement, age at examination, weight at examination, and CPR. We developed and assessed several classifiers: logistic regression (LR), decision tree (DT), support vector machine (SVM), naïve Bayes (NB), K-nearest neighbors (KNN), XGBoost, artificial neural networks (ANN), and random forest (RF). The target variable was defined as whether the neonate received any treatment during the follow-up period.</p><p><strong>Results: </strong>Our analysis included data from 9,692 infants. Among the machine learning models evaluated, the XGBoost and ANN models achieved the highest accuracy at 96%. In terms of sensitivity (recall), the NB model exhibited the lowest false negative rate, indicating the highest sensitivity (0.99). In the context of premature neonates, accurately diagnosing those who require treatment is crucial. Therefore, from a clinical perspective, prioritizing a model with the lowest false negative rate may be more beneficial than selecting one based solely on the highest accuracy.</p><p><strong>Conclusion: </strong>While AI can enhance decision-making processes by providing real-time risk assessments, these tools must be used to augment-not replace-clinical judgment. Clinicians must remain involved in interpreting model outputs and making final treatment decisions based on a holistic understanding of each patient's unique circumstances.</p><p><strong>Clinical trial number: </strong>Not applicable.</p>\",\"PeriodicalId\":9340,\"journal\":{\"name\":\"BMC Medical Informatics and Decision Making\",\"volume\":\"25 1\",\"pages\":\"221\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12211727/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Medical Informatics and Decision Making\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12911-025-03057-w\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICAL INFORMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03057-w","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
Development and validation of machine learning classifiers for predicting treatment-needed retinopathy of prematurity.
Background: This study aims to design and evaluate various supervised machine-learning models for identifying premature infants who require treatment based on demographic data and clinical findings from screening examinations.
Methods: We conducted a retrospective review of medical records for infants screened for retinopathy of prematurity (ROP) at our clinic over the past decade. We extracted demographic and clinical data, including eleven features: sex, maternal education, paternal education, birth weight, gestational age, ROP stage, zone of retinal involvement, age at examination, weight at examination, and CPR. We developed and assessed several classifiers: logistic regression (LR), decision tree (DT), support vector machine (SVM), naïve Bayes (NB), K-nearest neighbors (KNN), XGBoost, artificial neural networks (ANN), and random forest (RF). The target variable was defined as whether the neonate received any treatment during the follow-up period.
Results: Our analysis included data from 9,692 infants. Among the machine learning models evaluated, the XGBoost and ANN models achieved the highest accuracy at 96%. In terms of sensitivity (recall), the NB model exhibited the lowest false negative rate, indicating the highest sensitivity (0.99). In the context of premature neonates, accurately diagnosing those who require treatment is crucial. Therefore, from a clinical perspective, prioritizing a model with the lowest false negative rate may be more beneficial than selecting one based solely on the highest accuracy.
Conclusion: While AI can enhance decision-making processes by providing real-time risk assessments, these tools must be used to augment-not replace-clinical judgment. Clinicians must remain involved in interpreting model outputs and making final treatment decisions based on a holistic understanding of each patient's unique circumstances.
期刊介绍:
BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.