{"title":"Exploring parental factors influencing low birth weight on the 2022 CDC natality dataset.","authors":"Sumaiya Sultana Dola, Camilo E Valderrama","doi":"10.1186/s12911-024-02783-x","DOIUrl":null,"url":null,"abstract":"<p><strong>Background and aims: </strong>Low birth weight (LBW), known as the condition of a newborn weighing less than 2500 g, is a growing concern in the United States (US). Previous studies have identified several contributing factors, but many have analyzed these variables in isolation, limiting their ability to capture the combined influence of multiple factors. Moreover, past research has predominantly focused on maternal health, demographics, and socioeconomic conditions, often neglecting paternal factors such as age, educational level, and ethnicity. Additionally, most studies have utilized localized datasets, which may not reflect the diversity of the US population. To address these gaps, this study leverages machine learning to analyze the 2022 Centers for Disease Control and Prevention's National Natality Dataset, identifying the most significant factors contributing to LBW across the US.</p><p><strong>Methods: </strong>We combined anthropometric, socioeconomic, maternal, and paternal factors to train logistic regression, random forest, XGBoost, conditional inference tree, and attention mechanism models to predict LBW and normal birth weight (NBW) outcomes. These models were interpreted using odds ratio analysis, feature importance, partial dependence plots (PDP), and Shapley Additive Explanations (SHAP) to identify the factors most strongly associated with LBW.</p><p><strong>Results: </strong>Across all five models, the most consistently associated factors with birth weight were maternal height, pre-pregnancy weight, weight gain during pregnancy, and parental ethnicity. Other pregnancy-related factors, such as prenatal visits and avoiding smoking, also significantly influenced birth weight.</p><p><strong>Conclusion: </strong>The relevance of maternal anthropometric factors, pregnancy weight gain, and parental ethnicity can help explain the current differences in LBW and NBW rates among various ethnic groups in the US. Ethnicities with shorter average statures, such as Asians and Hispanics, are more likely to have newborns below the World Health Organization's 2500-gram threshold. Additionally, ethnic groups with historical challenges in accessing nutrition and perinatal care face a higher risk of delivering LBW infants.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"24 1","pages":"367"},"PeriodicalIF":3.3000,"publicationDate":"2024-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11608488/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-024-02783-x","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0
Abstract
Background and aims: Low birth weight (LBW), known as the condition of a newborn weighing less than 2500 g, is a growing concern in the United States (US). Previous studies have identified several contributing factors, but many have analyzed these variables in isolation, limiting their ability to capture the combined influence of multiple factors. Moreover, past research has predominantly focused on maternal health, demographics, and socioeconomic conditions, often neglecting paternal factors such as age, educational level, and ethnicity. Additionally, most studies have utilized localized datasets, which may not reflect the diversity of the US population. To address these gaps, this study leverages machine learning to analyze the 2022 Centers for Disease Control and Prevention's National Natality Dataset, identifying the most significant factors contributing to LBW across the US.
Methods: We combined anthropometric, socioeconomic, maternal, and paternal factors to train logistic regression, random forest, XGBoost, conditional inference tree, and attention mechanism models to predict LBW and normal birth weight (NBW) outcomes. These models were interpreted using odds ratio analysis, feature importance, partial dependence plots (PDP), and Shapley Additive Explanations (SHAP) to identify the factors most strongly associated with LBW.
Results: Across all five models, the most consistently associated factors with birth weight were maternal height, pre-pregnancy weight, weight gain during pregnancy, and parental ethnicity. Other pregnancy-related factors, such as prenatal visits and avoiding smoking, also significantly influenced birth weight.
Conclusion: The relevance of maternal anthropometric factors, pregnancy weight gain, and parental ethnicity can help explain the current differences in LBW and NBW rates among various ethnic groups in the US. Ethnicities with shorter average statures, such as Asians and Hispanics, are more likely to have newborns below the World Health Organization's 2500-gram threshold. Additionally, ethnic groups with historical challenges in accessing nutrition and perinatal care face a higher risk of delivering LBW infants.
期刊介绍:
BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.