Using explainable machine learning to better understand the environmental and socioeconomic contributions to preterm birth

IF 7.3 2区 环境科学与生态学 Q1 ENVIRONMENTAL SCIENCES
Shuyu Li , Yuqing Dai , Ying Chen , Fang Liu , Jun Zhang
{"title":"Using explainable machine learning to better understand the environmental and socioeconomic contributions to preterm birth","authors":"Shuyu Li ,&nbsp;Yuqing Dai ,&nbsp;Ying Chen ,&nbsp;Fang Liu ,&nbsp;Jun Zhang","doi":"10.1016/j.envpol.2025.127183","DOIUrl":null,"url":null,"abstract":"<div><div>Preterm birth (PTB) remains a leading cause of child mortality, yet the role of ambient air pollution remains disputed. In a cohort of 52,642 singleton births in Southwest China (2020–2023), we combined an automated machine-learning (AutoML) method with SHapley Additive exPlanations (SHAP) to rank and quantify 12 environmental, clinical, and sociodemographic predictors of PTB. Environmental factors collectively explained 48.5 % of the importance of the model features, with residential ambient PM<sub>2.5</sub> (20.7 %), elevation (17.3 %), and the normalized difference vegetation index (NDVI, 10.5 %) emerging as the top three contributors. The exposure–response curve demonstrated a marked increase in PTB risk above a PM<sub>2.5</sub> threshold of 50 μg/m<sup>3</sup>, with a mean SHAP value of 1.81 (CI: 1.75–1.87). The adverse effects of PM<sub>2.5</sub> were amplified among mothers with low educational levels (mean SHAP value 1.85 vs. 1.78 in the high education group) and varied by infant sex, with female infants exhibiting greater susceptibility when PM<sub>2.5</sub> concentrations were in the highest exposure bin (50–80 μg/m<sup>3</sup>). This study introduces a reproducible AutoML–SHAP framework for comprehensive PTB risk quantification and highlights that stringent air-quality control, coupled with targeted interventions, could substantially reduce prematurity.</div></div>","PeriodicalId":311,"journal":{"name":"Environmental Pollution","volume":"386 ","pages":"Article 127183"},"PeriodicalIF":7.3000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Pollution","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S026974912501557X","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Preterm birth (PTB) remains a leading cause of child mortality, yet the role of ambient air pollution remains disputed. In a cohort of 52,642 singleton births in Southwest China (2020–2023), we combined an automated machine-learning (AutoML) method with SHapley Additive exPlanations (SHAP) to rank and quantify 12 environmental, clinical, and sociodemographic predictors of PTB. Environmental factors collectively explained 48.5 % of the importance of the model features, with residential ambient PM2.5 (20.7 %), elevation (17.3 %), and the normalized difference vegetation index (NDVI, 10.5 %) emerging as the top three contributors. The exposure–response curve demonstrated a marked increase in PTB risk above a PM2.5 threshold of 50 μg/m3, with a mean SHAP value of 1.81 (CI: 1.75–1.87). The adverse effects of PM2.5 were amplified among mothers with low educational levels (mean SHAP value 1.85 vs. 1.78 in the high education group) and varied by infant sex, with female infants exhibiting greater susceptibility when PM2.5 concentrations were in the highest exposure bin (50–80 μg/m3). This study introduces a reproducible AutoML–SHAP framework for comprehensive PTB risk quantification and highlights that stringent air-quality control, coupled with targeted interventions, could substantially reduce prematurity.

Abstract Image

Abstract Image

使用可解释的机器学习来更好地理解环境和社会经济对早产的贡献
早产(PTB)仍然是儿童死亡的主要原因,但环境空气污染的作用仍然存在争议。在中国西南地区(2020-2023)的52,642例单胎分娩队列中,我们将自动机器学习(AutoML)方法与SHapley加性解释(SHAP)相结合,对12种PTB的环境、临床和社会人口学预测因素进行了排名和量化。环境因素共同解释了48.5%的模型特征的重要性,其中住宅环境PM2.5(20.7%),海拔(17.3%)和归一化植被指数(NDVI, 10.5%)成为前三位贡献者。暴露-响应曲线显示,PM2.5阈值超过50 μg/m3时,PTB风险显著增加,平均SHAP值为1.81 (CI: 1.75-1.87)。PM2.5的不良影响在受教育程度低的母亲中被放大(平均SHAP值为1.85,而受教育程度高的母亲为1.78),并因婴儿性别而异,当PM2.5浓度处于最高暴露区(50-80 μg/m3)时,女婴表现出更大的易感性。本研究引入了一个可重复的AutoML-SHAP框架,用于全面量化肺结核风险,并强调严格的空气质量控制,加上有针对性的干预措施,可以大大减少早产。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Environmental Pollution
Environmental Pollution 环境科学-环境科学
CiteScore
16.00
自引率
6.70%
发文量
2082
审稿时长
2.9 months
期刊介绍: Environmental Pollution is an international peer-reviewed journal that publishes high-quality research papers and review articles covering all aspects of environmental pollution and its impacts on ecosystems and human health. Subject areas include, but are not limited to: • Sources and occurrences of pollutants that are clearly defined and measured in environmental compartments, food and food-related items, and human bodies; • Interlinks between contaminant exposure and biological, ecological, and human health effects, including those of climate change; • Contaminants of emerging concerns (including but not limited to antibiotic resistant microorganisms or genes, microplastics/nanoplastics, electronic wastes, light, and noise) and/or their biological, ecological, or human health effects; • Laboratory and field studies on the remediation/mitigation of environmental pollution via new techniques and with clear links to biological, ecological, or human health effects; • Modeling of pollution processes, patterns, or trends that is of clear environmental and/or human health interest; • New techniques that measure and examine environmental occurrences, transport, behavior, and effects of pollutants within the environment or the laboratory, provided that they can be clearly used to address problems within regional or global environmental compartments.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信