Flavio Leandro de Morais, Elisson da Silva Rocha, Gabriel Masson, Dimas Cassimiro do Nascimento Filho, Katia Maria Mendes, Raphael Augusto de Sousa Dourado, Waldemar Brandao Neto, Patricia Takako Endo
{"title":"Utilization of tree-based machine learning models for predicting low birth weight cases.","authors":"Flavio Leandro de Morais, Elisson da Silva Rocha, Gabriel Masson, Dimas Cassimiro do Nascimento Filho, Katia Maria Mendes, Raphael Augusto de Sousa Dourado, Waldemar Brandao Neto, Patricia Takako Endo","doi":"10.1186/s12884-025-07303-x","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Low birth weight (LBW) is a health condition that affects over 20 million gestational outcomes worldwide. The current literature indicates that machine learning models have the potential to assist healthcare professionals in predicting LBW and giving them the opportunity to intervene earlier in the pregnancy, which might include adjusting medical treatments or suggesting changes in diet.</p><p><strong>Purpose: </strong>This study proposes the evaluation of machine learning models to predict which pregnant women are at risk of neonatal outcomes with LBW.</p><p><strong>Methods: </strong>The methodology involves six phases, including data analysis and attribute selection through different techniques, which generated four distinct scenarios. We used five machine learning models and validated them through cross-validation and hyper-parameter optimization and evaluated their performance considering seven distinct metrics and statistical analysis, focusing on the effectiveness of the models in predicting LBW.</p><p><strong>Results: </strong>The results revealed that the models achieved varying levels of performance across the scenarios, with the removal of duplicate data resulting in improvements in recall (0.83) and f1-score (0.64). Statistical analysis confirmed significant differences (p < 0.05) among most models.</p><p><strong>Conclusions: </strong>The conclusions of this study indicate that the removal of duplicate data and careful attribute selection positively influenced the performance of the machine learning models in predicting low birth weight. Additionally, the analysis of attribute importance highlighted socio-demographic characteristics and gestational history as the most influential in the training of the models.</p>","PeriodicalId":9033,"journal":{"name":"BMC Pregnancy and Childbirth","volume":"25 1","pages":"207"},"PeriodicalIF":2.8000,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11863662/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Pregnancy and Childbirth","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12884-025-07303-x","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Low birth weight (LBW) is a health condition that affects over 20 million gestational outcomes worldwide. The current literature indicates that machine learning models have the potential to assist healthcare professionals in predicting LBW and giving them the opportunity to intervene earlier in the pregnancy, which might include adjusting medical treatments or suggesting changes in diet.
Purpose: This study proposes the evaluation of machine learning models to predict which pregnant women are at risk of neonatal outcomes with LBW.
Methods: The methodology involves six phases, including data analysis and attribute selection through different techniques, which generated four distinct scenarios. We used five machine learning models and validated them through cross-validation and hyper-parameter optimization and evaluated their performance considering seven distinct metrics and statistical analysis, focusing on the effectiveness of the models in predicting LBW.
Results: The results revealed that the models achieved varying levels of performance across the scenarios, with the removal of duplicate data resulting in improvements in recall (0.83) and f1-score (0.64). Statistical analysis confirmed significant differences (p < 0.05) among most models.
Conclusions: The conclusions of this study indicate that the removal of duplicate data and careful attribute selection positively influenced the performance of the machine learning models in predicting low birth weight. Additionally, the analysis of attribute importance highlighted socio-demographic characteristics and gestational history as the most influential in the training of the models.
期刊介绍:
BMC Pregnancy & Childbirth is an open access, peer-reviewed journal that considers articles on all aspects of pregnancy and childbirth. The journal welcomes submissions on the biomedical aspects of pregnancy, breastfeeding, labor, maternal health, maternity care, trends and sociological aspects of pregnancy and childbirth.