基于健康风险评估和高时空分辨率的韩国全国机器学习-集成PM2.5映射预测和预测模型

IF 6.3
Seoyeong Ahn, Ayoung Kim, Yeonseung Chung, Cinoo Kang, Sooyoung Kim, Dohoon Kwon, Jiwoo Park, Jieun Oh, Jinah Park, Jeongmin Moon, Insung Song, Jieun Min, Hyung Joo Lee, Ho Kim and Whanhee Lee*, 
{"title":"基于健康风险评估和高时空分辨率的韩国全国机器学习-集成PM2.5映射预测和预测模型","authors":"Seoyeong Ahn,&nbsp;Ayoung Kim,&nbsp;Yeonseung Chung,&nbsp;Cinoo Kang,&nbsp;Sooyoung Kim,&nbsp;Dohoon Kwon,&nbsp;Jiwoo Park,&nbsp;Jieun Oh,&nbsp;Jinah Park,&nbsp;Jeongmin Moon,&nbsp;Insung Song,&nbsp;Jieun Min,&nbsp;Hyung Joo Lee,&nbsp;Ho Kim and Whanhee Lee*,&nbsp;","doi":"10.1021/envhealth.4c00201","DOIUrl":null,"url":null,"abstract":"<p >Several <b>s</b>tudies developed machine learning-based PM<sub>2.5</sub> prediction models; however, nationwide models addressing both mapping prediction and forecasting were limited. Further, although the prediction accuracy is different from PM<sub>2.5</sub>-related health risk estimation, previous studies solely examined the prediction accuracy. This study suggests a method to assess the statistical properties of PM<sub>2.5</sub>-health risk estimation, which also can be used as a model selection. We used three machine learning algorithms and an ensemble method to construct PM<sub>2.5</sub> mapping prediction (1 km<sup>2</sup>) and two-day forecasting models majorly using satellite-driven data in South Korea (2015–2022). We performed a simulation study to examine the statistical properties of short-term PM<sub>2.5</sub> risk estimation using prediction models. Our ensemble spatial prediction model showed better performance than single algorithms (0.956 test <i>R</i><sup>2</sup>). The range of the <i>R</i><sup>2</sup> values was 0.78–0.98 across the monitoring sites. The average % bias was from 1.403%–1.787% when our mapping models for PM<sub>2.5</sub>-mortality risk estimation, compared to the estimates from monitored PM<sub>2.5</sub>. The best <i>R</i><sup>2</sup> of our forecasting models was 0.904. This study developed machine learning models for spatial PM<sub>2.5</sub> predictions and forecasting in Korea. This study also suggested a method to address risk estimation and model selection concurrently when multiple prediction models were used.</p>","PeriodicalId":29795,"journal":{"name":"Environment & Health","volume":"3 8","pages":"878–887"},"PeriodicalIF":6.3000,"publicationDate":"2025-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/pdf/10.1021/envhealth.4c00201","citationCount":"0","resultStr":"{\"title\":\"Nationwide Machine Learning-Ensemble PM2.5 Mapping Prediction and Forecasting Models in South Korea with High Spatiotemporal Resolution and Health Risk Estimation-Based Evaluations\",\"authors\":\"Seoyeong Ahn,&nbsp;Ayoung Kim,&nbsp;Yeonseung Chung,&nbsp;Cinoo Kang,&nbsp;Sooyoung Kim,&nbsp;Dohoon Kwon,&nbsp;Jiwoo Park,&nbsp;Jieun Oh,&nbsp;Jinah Park,&nbsp;Jeongmin Moon,&nbsp;Insung Song,&nbsp;Jieun Min,&nbsp;Hyung Joo Lee,&nbsp;Ho Kim and Whanhee Lee*,&nbsp;\",\"doi\":\"10.1021/envhealth.4c00201\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >Several <b>s</b>tudies developed machine learning-based PM<sub>2.5</sub> prediction models; however, nationwide models addressing both mapping prediction and forecasting were limited. Further, although the prediction accuracy is different from PM<sub>2.5</sub>-related health risk estimation, previous studies solely examined the prediction accuracy. This study suggests a method to assess the statistical properties of PM<sub>2.5</sub>-health risk estimation, which also can be used as a model selection. We used three machine learning algorithms and an ensemble method to construct PM<sub>2.5</sub> mapping prediction (1 km<sup>2</sup>) and two-day forecasting models majorly using satellite-driven data in South Korea (2015–2022). We performed a simulation study to examine the statistical properties of short-term PM<sub>2.5</sub> risk estimation using prediction models. Our ensemble spatial prediction model showed better performance than single algorithms (0.956 test <i>R</i><sup>2</sup>). The range of the <i>R</i><sup>2</sup> values was 0.78–0.98 across the monitoring sites. The average % bias was from 1.403%–1.787% when our mapping models for PM<sub>2.5</sub>-mortality risk estimation, compared to the estimates from monitored PM<sub>2.5</sub>. The best <i>R</i><sup>2</sup> of our forecasting models was 0.904. This study developed machine learning models for spatial PM<sub>2.5</sub> predictions and forecasting in Korea. This study also suggested a method to address risk estimation and model selection concurrently when multiple prediction models were used.</p>\",\"PeriodicalId\":29795,\"journal\":{\"name\":\"Environment & Health\",\"volume\":\"3 8\",\"pages\":\"878–887\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2025-04-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://pubs.acs.org/doi/pdf/10.1021/envhealth.4c00201\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environment & Health\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://pubs.acs.org/doi/10.1021/envhealth.4c00201\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environment & Health","FirstCategoryId":"1085","ListUrlMain":"https://pubs.acs.org/doi/10.1021/envhealth.4c00201","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

一些研究开发了基于机器学习的PM2.5预测模型;然而,同时处理制图预测和预测的全国性模型是有限的。此外,虽然预测精度与pm2.5相关的健康风险估计不同,但以往的研究只考察了预测精度。本研究提出了一种评估pm2.5健康风险估计统计特性的方法,该方法也可用于模型选择。我们使用三种机器学习算法和集成方法构建PM2.5映射预测(1 km2)和主要使用韩国卫星驱动数据的两天预测模型(2015-2022)。我们进行了一项模拟研究,以检验使用预测模型估算PM2.5短期风险的统计特性。集成空间预测模型的预测效果优于单一算法(0.956检验R2)。各监测点的R2范围为0.78 ~ 0.98。与监测PM2.5的估算相比,我们的PM2.5死亡率风险估算映射模型的平均偏差为1.403%-1.787%。预测模型的最佳R2为0.904。该研究开发了用于韩国空间PM2.5预测和预报的机器学习模型。本研究还提出了一种在使用多个预测模型时同时处理风险估计和模型选择的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Nationwide Machine Learning-Ensemble PM2.5 Mapping Prediction and Forecasting Models in South Korea with High Spatiotemporal Resolution and Health Risk Estimation-Based Evaluations

Several studies developed machine learning-based PM2.5 prediction models; however, nationwide models addressing both mapping prediction and forecasting were limited. Further, although the prediction accuracy is different from PM2.5-related health risk estimation, previous studies solely examined the prediction accuracy. This study suggests a method to assess the statistical properties of PM2.5-health risk estimation, which also can be used as a model selection. We used three machine learning algorithms and an ensemble method to construct PM2.5 mapping prediction (1 km2) and two-day forecasting models majorly using satellite-driven data in South Korea (2015–2022). We performed a simulation study to examine the statistical properties of short-term PM2.5 risk estimation using prediction models. Our ensemble spatial prediction model showed better performance than single algorithms (0.956 test R2). The range of the R2 values was 0.78–0.98 across the monitoring sites. The average % bias was from 1.403%–1.787% when our mapping models for PM2.5-mortality risk estimation, compared to the estimates from monitored PM2.5. The best R2 of our forecasting models was 0.904. This study developed machine learning models for spatial PM2.5 predictions and forecasting in Korea. This study also suggested a method to address risk estimation and model selection concurrently when multiple prediction models were used.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Environment & Health
Environment & Health 环境科学、健康科学-
自引率
0.00%
发文量
0
期刊介绍: Environment & Health a peer-reviewed open access journal is committed to exploring the relationship between the environment and human health.As a premier journal for multidisciplinary research Environment & Health reports the health consequences for individuals and communities of changing and hazardous environmental factors. In supporting the UN Sustainable Development Goals the journal aims to help formulate policies to create a healthier world.Topics of interest include but are not limited to:Air water and soil pollutionExposomicsEnvironmental epidemiologyInnovative analytical methodology and instrumentation (multi-omics non-target analysis effect-directed analysis high-throughput screening etc.)Environmental toxicology (endocrine disrupting effect neurotoxicity alternative toxicology computational toxicology epigenetic toxicology etc.)Environmental microbiology pathogen and environmental transmission mechanisms of diseasesEnvironmental modeling bioinformatics and artificial intelligenceEmerging contaminants (including plastics engineered nanomaterials etc.)Climate change and related health effectHealth impacts of energy evolution and carbon neutralizationFood and drinking water safetyOccupational exposure and medicineInnovations in environmental technologies for better healthPolicies and international relations concerned with environmental health
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信