利用机器学习和可解释的人工智能，用妊娠早期和中期产前标记预测出生体重。

IF 3.9 2区综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES

Scientific Reports Pub Date : 2025-07-19 DOI:10.1038/s41598-025-11837-7

Manohar Pavanya, Krishnaraj Chadaga, Vennila J, Akhila Vasudeva, Bhamini Krishna Rao, Srikanth Prabhu, Shashikala K Bhat

{"title":"利用机器学习和可解释的人工智能，用妊娠早期和中期产前标记预测出生体重。","authors":"Manohar Pavanya, Krishnaraj Chadaga, Vennila J, Akhila Vasudeva, Bhamini Krishna Rao, Srikanth Prabhu, Shashikala K Bhat","doi":"10.1038/s41598-025-11837-7","DOIUrl":null,"url":null,"abstract":"Low birthweight (LBW) is a significant health challenge worldwide, as these neonates experience both short- and long-term disabilities. Factors affecting maternal and fetal health during early to mid-pregnancy can greatly influence fetal development. Prediction of birthweight using machine learning (ML) models with antenatal data may help in better clinical management. However, the lack of explainability in these models has raised concerns within the medical community. To address this issue, our study aims to develop a more practical ML model by incorporating explainable artificial intelligence (XAI). We prospectively collected real-world clinical data of 19 maternal and fetal clinical features from 237 singleton pregnancies. Statistical analyses were conducted using Jamovi (version: 2.6.26) and JASP team (2024) JASP (version: 0.18.3). Multiple ML classifiers were employed. We developed a stacked ensemble model that integrated various algorithms, including a custom-stacked ensemble approach and three XAI methodologies: Shapley Additive Explanations (SHAP), Local Interpretable Model-agnostic Explanations (LIME), and Anchor. These methods provided meaningful explanations to help construct reliable and optimal clinical predictive models. Among the ML classifiers evaluated, the AdaBoost model achieved the highest performance, with a maximum accuracy of 77%, a precision of 73%, a recall of 77%, and an F1 score of 72%. The stacked model demonstrated an accuracy of 75%, indicating its possibility in clinical application. However, the accuracy of these models might be affected by the limited dataset, which included pregnant women undergoing treatment for thyroid abnormalities, diabetes, and hypertension. Our developed model identified several key attributes that influence birthweight, such as maternal height, nuchal translucency thickness, parity, crown-rump length, glycated hemoglobin, hypertensive disorders of pregnancy, and pregnancy-associated plasma protein A. This model can assist medical professionals in making more precise birthweight predictions using routinely collected antenatal parameters, enabling timely medical decisions and treatments.","PeriodicalId":21811,"journal":{"name":"Scientific Reports","volume":"15 1","pages":"26223"},"PeriodicalIF":3.9000,"publicationDate":"2025-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12276256/pdf/","citationCount":"0","resultStr":"{\"title\":\"Prediction of birthweight with early and mid-pregnancy antenatal markers utilising machine learning and explainable artificial intelligence.\",\"authors\":\"Manohar Pavanya, Krishnaraj Chadaga, Vennila J, Akhila Vasudeva, Bhamini Krishna Rao, Srikanth Prabhu, Shashikala K Bhat\",\"doi\":\"10.1038/s41598-025-11837-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Low birthweight (LBW) is a significant health challenge worldwide, as these neonates experience both short- and long-term disabilities. Factors affecting maternal and fetal health during early to mid-pregnancy can greatly influence fetal development. Prediction of birthweight using machine learning (ML) models with antenatal data may help in better clinical management. However, the lack of explainability in these models has raised concerns within the medical community. To address this issue, our study aims to develop a more practical ML model by incorporating explainable artificial intelligence (XAI). We prospectively collected real-world clinical data of 19 maternal and fetal clinical features from 237 singleton pregnancies. Statistical analyses were conducted using Jamovi (version: 2.6.26) and JASP team (2024) JASP (version: 0.18.3). Multiple ML classifiers were employed. We developed a stacked ensemble model that integrated various algorithms, including a custom-stacked ensemble approach and three XAI methodologies: Shapley Additive Explanations (SHAP), Local Interpretable Model-agnostic Explanations (LIME), and Anchor. These methods provided meaningful explanations to help construct reliable and optimal clinical predictive models. Among the ML classifiers evaluated, the AdaBoost model achieved the highest performance, with a maximum accuracy of 77%, a precision of 73%, a recall of 77%, and an F1 score of 72%. The stacked model demonstrated an accuracy of 75%, indicating its possibility in clinical application. However, the accuracy of these models might be affected by the limited dataset, which included pregnant women undergoing treatment for thyroid abnormalities, diabetes, and hypertension. Our developed model identified several key attributes that influence birthweight, such as maternal height, nuchal translucency thickness, parity, crown-rump length, glycated hemoglobin, hypertensive disorders of pregnancy, and pregnancy-associated plasma protein A. This model can assist medical professionals in making more precise birthweight predictions using routinely collected antenatal parameters, enabling timely medical decisions and treatments.\",\"PeriodicalId\":21811,\"journal\":{\"name\":\"Scientific Reports\",\"volume\":\"15 1\",\"pages\":\"26223\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-07-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12276256/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientific Reports\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1038/s41598-025-11837-7\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Reports","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41598-025-11837-7","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

低出生体重（LBW）是世界范围内的一个重大健康挑战，因为这些新生儿经历短期和长期残疾。在怀孕早期到中期影响母体和胎儿健康的因素可以极大地影响胎儿的发育。使用产前数据的机器学习（ML）模型预测出生体重可能有助于更好的临床管理。然而，这些模型缺乏可解释性引起了医学界的关注。为了解决这个问题，我们的研究旨在通过结合可解释的人工智能（XAI）来开发一个更实用的ML模型。我们前瞻性地收集了237例单胎妊娠的19个母胎临床特征的真实临床数据。采用Jamovi（版本：2.6.26）和JASP team (2024) JASP（版本：0.18.3）进行统计分析。使用多个ML分类器。我们开发了一个集成了各种算法的堆叠集成模型，包括自定义堆叠集成方法和三种XAI方法：Shapley加性解释（SHAP）、局部可解释模型不可知论解释（LIME）和锚。这些方法为构建可靠、优化的临床预测模型提供了有意义的解释。在评估的ML分类器中，AdaBoost模型的性能最高，最大准确率为77%，精度为73%，召回率为77%，F1得分为72%。叠加模型的准确率达到75%，具有临床应用的可能性。然而，这些模型的准确性可能会受到有限数据集的影响，其中包括正在接受甲状腺异常、糖尿病和高血压治疗的孕妇。我们开发的模型确定了影响出生体重的几个关键属性，如产妇身高、颈部半透明厚度、胎次、冠臀长、糖化血红蛋白、妊娠高血压疾病和妊娠相关血浆蛋白a。该模型可以帮助医疗专业人员使用常规收集的产前参数做出更精确的出生体重预测，从而及时做出医疗决策和治疗。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Prediction of birthweight with early and mid-pregnancy antenatal markers utilising machine learning and explainable artificial intelligence.

查看原文本刊更多论文

Prediction of birthweight with early and mid-pregnancy antenatal markers utilising machine learning and explainable artificial intelligence.

Low birthweight (LBW) is a significant health challenge worldwide, as these neonates experience both short- and long-term disabilities. Factors affecting maternal and fetal health during early to mid-pregnancy can greatly influence fetal development. Prediction of birthweight using machine learning (ML) models with antenatal data may help in better clinical management. However, the lack of explainability in these models has raised concerns within the medical community. To address this issue, our study aims to develop a more practical ML model by incorporating explainable artificial intelligence (XAI). We prospectively collected real-world clinical data of 19 maternal and fetal clinical features from 237 singleton pregnancies. Statistical analyses were conducted using Jamovi (version: 2.6.26) and JASP team (2024) JASP (version: 0.18.3). Multiple ML classifiers were employed. We developed a stacked ensemble model that integrated various algorithms, including a custom-stacked ensemble approach and three XAI methodologies: Shapley Additive Explanations (SHAP), Local Interpretable Model-agnostic Explanations (LIME), and Anchor. These methods provided meaningful explanations to help construct reliable and optimal clinical predictive models. Among the ML classifiers evaluated, the AdaBoost model achieved the highest performance, with a maximum accuracy of 77%, a precision of 73%, a recall of 77%, and an F1 score of 72%. The stacked model demonstrated an accuracy of 75%, indicating its possibility in clinical application. However, the accuracy of these models might be affected by the limited dataset, which included pregnant women undergoing treatment for thyroid abnormalities, diabetes, and hypertension. Our developed model identified several key attributes that influence birthweight, such as maternal height, nuchal translucency thickness, parity, crown-rump length, glycated hemoglobin, hypertensive disorders of pregnancy, and pregnancy-associated plasma protein A. This model can assist medical professionals in making more precise birthweight predictions using routinely collected antenatal parameters, enabling timely medical decisions and treatments.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Scientific Reports Natural Science Disciplines-

CiteScore

7.50

自引率

4.30%

发文量

19567

审稿时长

3.9 months

期刊介绍： We publish original research from all areas of the natural sciences, psychology, medicine and engineering. You can learn more about what we publish by browsing our specific scientific subject areas below or explore Scientific Reports by browsing all articles and collections. Scientific Reports has a 2-year impact factor: 4.380 (2021), and is the 6th most-cited journal in the world, with more than 540,000 citations in 2020 (Clarivate Analytics, 2021). •Engineering Engineering covers all aspects of engineering, technology, and applied science. It plays a crucial role in the development of technologies to address some of the world''s biggest challenges, helping to save lives and improve the way we live. •Physical sciences Physical sciences are those academic disciplines that aim to uncover the underlying laws of nature — often written in the language of mathematics. It is a collective term for areas of study including astronomy, chemistry, materials science and physics. •Earth and environmental sciences Earth and environmental sciences cover all aspects of Earth and planetary science and broadly encompass solid Earth processes, surface and atmospheric dynamics, Earth system history, climate and climate change, marine and freshwater systems, and ecology. It also considers the interactions between humans and these systems. •Biological sciences Biological sciences encompass all the divisions of natural sciences examining various aspects of vital processes. The concept includes anatomy, physiology, cell biology, biochemistry and biophysics, and covers all organisms from microorganisms, animals to plants. •Health sciences The health sciences study health, disease and healthcare. This field of study aims to develop knowledge, interventions and technology for use in healthcare to improve the treatment of patients.