Yu Ding, Jiaxin Dong, Mengfan Teng, Shiyao Meng, Jie Yang, Siwei Li
{"title":"一个特征级集成框架,用于改善美国相邻地区的每日PM2.5估算(2000-2023)","authors":"Yu Ding, Jiaxin Dong, Mengfan Teng, Shiyao Meng, Jie Yang, Siwei Li","doi":"10.1016/j.envsoft.2025.106733","DOIUrl":null,"url":null,"abstract":"Accurate PM<ce:inf loc=\"post\">2.5</ce:inf> surface concentration estimates are vital for air quality management and exposure assessment. This study introduces a novel feature-level ensemble framework to enhance daily PM<ce:inf loc=\"post\">2.5</ce:inf> estimation across the contiguous United States from 2000 to 2023. The framework integrates multiple XGBoost models trained with diverse temporal features, including calendar encodings and physically derived indicators like rolling averages and change rates from reanalysis PM<ce:inf loc=\"post\">2.5</ce:inf>. By capturing complementary pollution dynamics, the ensemble outperforms models using only calendar features. Under spatial cross-validation, R<ce:sup loc=\"post\">2</ce:sup> increases from 0.64 to 0.70 and RMSE drops from 7.87 to 7.31 μg/m<ce:sup loc=\"post\">3</ce:sup>. Temporal extrapolation also improves, with △R<ce:sup loc=\"post\">2</ce:sup> gains of 0.08 (historical backcasting) and 0.07 (future forecasting). These results demonstrate the framework's robustness, generalizability, and value for long-term PM<ce:inf loc=\"post\">2.5</ce:inf> monitoring, epidemiological research, and data-driven air quality policymaking.","PeriodicalId":310,"journal":{"name":"Environmental Modelling & Software","volume":"116 1","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A feature-level ensemble framework for improving daily PM2.5 estimation across the contiguous United States (2000–2023)\",\"authors\":\"Yu Ding, Jiaxin Dong, Mengfan Teng, Shiyao Meng, Jie Yang, Siwei Li\",\"doi\":\"10.1016/j.envsoft.2025.106733\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accurate PM<ce:inf loc=\\\"post\\\">2.5</ce:inf> surface concentration estimates are vital for air quality management and exposure assessment. This study introduces a novel feature-level ensemble framework to enhance daily PM<ce:inf loc=\\\"post\\\">2.5</ce:inf> estimation across the contiguous United States from 2000 to 2023. The framework integrates multiple XGBoost models trained with diverse temporal features, including calendar encodings and physically derived indicators like rolling averages and change rates from reanalysis PM<ce:inf loc=\\\"post\\\">2.5</ce:inf>. By capturing complementary pollution dynamics, the ensemble outperforms models using only calendar features. Under spatial cross-validation, R<ce:sup loc=\\\"post\\\">2</ce:sup> increases from 0.64 to 0.70 and RMSE drops from 7.87 to 7.31 μg/m<ce:sup loc=\\\"post\\\">3</ce:sup>. Temporal extrapolation also improves, with △R<ce:sup loc=\\\"post\\\">2</ce:sup> gains of 0.08 (historical backcasting) and 0.07 (future forecasting). These results demonstrate the framework's robustness, generalizability, and value for long-term PM<ce:inf loc=\\\"post\\\">2.5</ce:inf> monitoring, epidemiological research, and data-driven air quality policymaking.\",\"PeriodicalId\":310,\"journal\":{\"name\":\"Environmental Modelling & Software\",\"volume\":\"116 1\",\"pages\":\"\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-10-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Modelling & Software\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1016/j.envsoft.2025.106733\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Modelling & Software","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.envsoft.2025.106733","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
A feature-level ensemble framework for improving daily PM2.5 estimation across the contiguous United States (2000–2023)
Accurate PM2.5 surface concentration estimates are vital for air quality management and exposure assessment. This study introduces a novel feature-level ensemble framework to enhance daily PM2.5 estimation across the contiguous United States from 2000 to 2023. The framework integrates multiple XGBoost models trained with diverse temporal features, including calendar encodings and physically derived indicators like rolling averages and change rates from reanalysis PM2.5. By capturing complementary pollution dynamics, the ensemble outperforms models using only calendar features. Under spatial cross-validation, R2 increases from 0.64 to 0.70 and RMSE drops from 7.87 to 7.31 μg/m3. Temporal extrapolation also improves, with △R2 gains of 0.08 (historical backcasting) and 0.07 (future forecasting). These results demonstrate the framework's robustness, generalizability, and value for long-term PM2.5 monitoring, epidemiological research, and data-driven air quality policymaking.
期刊介绍:
Environmental Modelling & Software publishes contributions, in the form of research articles, reviews and short communications, on recent advances in environmental modelling and/or software. The aim is to improve our capacity to represent, understand, predict or manage the behaviour of environmental systems at all practical scales, and to communicate those improvements to a wide scientific and professional audience.