Shuyu Li , Yuqing Dai , Ying Chen , Fang Liu , Jun Zhang
{"title":"使用可解释的机器学习来更好地理解环境和社会经济对早产的贡献","authors":"Shuyu Li , Yuqing Dai , Ying Chen , Fang Liu , Jun Zhang","doi":"10.1016/j.envpol.2025.127183","DOIUrl":null,"url":null,"abstract":"<div><div>Preterm birth (PTB) remains a leading cause of child mortality, yet the role of ambient air pollution remains disputed. In a cohort of 52,642 singleton births in Southwest China (2020–2023), we combined an automated machine-learning (AutoML) method with SHapley Additive exPlanations (SHAP) to rank and quantify 12 environmental, clinical, and sociodemographic predictors of PTB. Environmental factors collectively explained 48.5 % of the importance of the model features, with residential ambient PM<sub>2.5</sub> (20.7 %), elevation (17.3 %), and the normalized difference vegetation index (NDVI, 10.5 %) emerging as the top three contributors. The exposure–response curve demonstrated a marked increase in PTB risk above a PM<sub>2.5</sub> threshold of 50 μg/m<sup>3</sup>, with a mean SHAP value of 1.81 (CI: 1.75–1.87). The adverse effects of PM<sub>2.5</sub> were amplified among mothers with low educational levels (mean SHAP value 1.85 vs. 1.78 in the high education group) and varied by infant sex, with female infants exhibiting greater susceptibility when PM<sub>2.5</sub> concentrations were in the highest exposure bin (50–80 μg/m<sup>3</sup>). This study introduces a reproducible AutoML–SHAP framework for comprehensive PTB risk quantification and highlights that stringent air-quality control, coupled with targeted interventions, could substantially reduce prematurity.</div></div>","PeriodicalId":311,"journal":{"name":"Environmental Pollution","volume":"386 ","pages":"Article 127183"},"PeriodicalIF":7.3000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Using explainable machine learning to better understand the environmental and socioeconomic contributions to preterm birth\",\"authors\":\"Shuyu Li , Yuqing Dai , Ying Chen , Fang Liu , Jun Zhang\",\"doi\":\"10.1016/j.envpol.2025.127183\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Preterm birth (PTB) remains a leading cause of child mortality, yet the role of ambient air pollution remains disputed. In a cohort of 52,642 singleton births in Southwest China (2020–2023), we combined an automated machine-learning (AutoML) method with SHapley Additive exPlanations (SHAP) to rank and quantify 12 environmental, clinical, and sociodemographic predictors of PTB. Environmental factors collectively explained 48.5 % of the importance of the model features, with residential ambient PM<sub>2.5</sub> (20.7 %), elevation (17.3 %), and the normalized difference vegetation index (NDVI, 10.5 %) emerging as the top three contributors. The exposure–response curve demonstrated a marked increase in PTB risk above a PM<sub>2.5</sub> threshold of 50 μg/m<sup>3</sup>, with a mean SHAP value of 1.81 (CI: 1.75–1.87). The adverse effects of PM<sub>2.5</sub> were amplified among mothers with low educational levels (mean SHAP value 1.85 vs. 1.78 in the high education group) and varied by infant sex, with female infants exhibiting greater susceptibility when PM<sub>2.5</sub> concentrations were in the highest exposure bin (50–80 μg/m<sup>3</sup>). This study introduces a reproducible AutoML–SHAP framework for comprehensive PTB risk quantification and highlights that stringent air-quality control, coupled with targeted interventions, could substantially reduce prematurity.</div></div>\",\"PeriodicalId\":311,\"journal\":{\"name\":\"Environmental Pollution\",\"volume\":\"386 \",\"pages\":\"Article 127183\"},\"PeriodicalIF\":7.3000,\"publicationDate\":\"2025-10-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Pollution\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S026974912501557X\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Pollution","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S026974912501557X","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Using explainable machine learning to better understand the environmental and socioeconomic contributions to preterm birth
Preterm birth (PTB) remains a leading cause of child mortality, yet the role of ambient air pollution remains disputed. In a cohort of 52,642 singleton births in Southwest China (2020–2023), we combined an automated machine-learning (AutoML) method with SHapley Additive exPlanations (SHAP) to rank and quantify 12 environmental, clinical, and sociodemographic predictors of PTB. Environmental factors collectively explained 48.5 % of the importance of the model features, with residential ambient PM2.5 (20.7 %), elevation (17.3 %), and the normalized difference vegetation index (NDVI, 10.5 %) emerging as the top three contributors. The exposure–response curve demonstrated a marked increase in PTB risk above a PM2.5 threshold of 50 μg/m3, with a mean SHAP value of 1.81 (CI: 1.75–1.87). The adverse effects of PM2.5 were amplified among mothers with low educational levels (mean SHAP value 1.85 vs. 1.78 in the high education group) and varied by infant sex, with female infants exhibiting greater susceptibility when PM2.5 concentrations were in the highest exposure bin (50–80 μg/m3). This study introduces a reproducible AutoML–SHAP framework for comprehensive PTB risk quantification and highlights that stringent air-quality control, coupled with targeted interventions, could substantially reduce prematurity.
期刊介绍:
Environmental Pollution is an international peer-reviewed journal that publishes high-quality research papers and review articles covering all aspects of environmental pollution and its impacts on ecosystems and human health.
Subject areas include, but are not limited to:
• Sources and occurrences of pollutants that are clearly defined and measured in environmental compartments, food and food-related items, and human bodies;
• Interlinks between contaminant exposure and biological, ecological, and human health effects, including those of climate change;
• Contaminants of emerging concerns (including but not limited to antibiotic resistant microorganisms or genes, microplastics/nanoplastics, electronic wastes, light, and noise) and/or their biological, ecological, or human health effects;
• Laboratory and field studies on the remediation/mitigation of environmental pollution via new techniques and with clear links to biological, ecological, or human health effects;
• Modeling of pollution processes, patterns, or trends that is of clear environmental and/or human health interest;
• New techniques that measure and examine environmental occurrences, transport, behavior, and effects of pollutants within the environment or the laboratory, provided that they can be clearly used to address problems within regional or global environmental compartments.