Nishit Aman, Kasemsan Manomaiphiboon, Di Xian, Ling Gao, Lin Tian, Natchanok Pala-En, Yangjun Wang, Komsilp Wangyao
{"title":"Spatiotemporal estimation of hourly PM2.5 using AOD derived from geostationary satellite Fengyun-4A and machine learning models for Greater Bangkok","authors":"Nishit Aman, Kasemsan Manomaiphiboon, Di Xian, Ling Gao, Lin Tian, Natchanok Pala-En, Yangjun Wang, Komsilp Wangyao","doi":"10.1007/s11869-024-01524-3","DOIUrl":null,"url":null,"abstract":"<div><p>This study used four individual machine learning (ML) models (random forest, adaptive boosting, gradient boosting, and extreme gradient boosting), and a stacked ensemble model (SEM) for PM<sub>2.5</sub> estimation over Greater Bangkok (GBK) during the dry season for 2018–2022. Aerosol optical depth (AOD) from Fengyun-4A satellite was used as the main predictor variable. The other predictor variables include meteorological variables, fire hotspots, vegetation index, terrain elevation, and population density. Surface PM<sub>2.5</sub> from 17 air quality monitoring stations was used for model development and evaluation. Satellite AOD aligns reasonably well with AOD from two AERONET stations in the study area in terms of correlation coefficient (<i>r</i>), mean bias (MB), mean error (ME), and root mean square error (RMSE). Among the individual models, adaptive boosting performed the best with <i>r</i> = 0.75, MB = 0.55 µg m<sup>−3</sup>, ME = 9.1 µg m<sup>−3</sup>, and RMSE = 12.9 µg m<sup>−3</sup>. As for SEM which comprises all individual models, it outperformed every individual model, with <i>r</i> = 0.84, zero MB, ME = 7.2 µg m<sup>−3</sup>, and RMSE = 10.4 µg m<sup>−3</sup>. In two additional cases of haze hours and clean hours, SEM is best overall while adaptive boosting is superior to the other individual ML models. The case of haze hours has lower model predictability, suggesting elevated PM<sub>2.5</sub> is difficult to predict. SEM was thus chosen to map PM<sub>2.5</sub> as well as exposure intensity over GBK. Good agreement between the observed and predicted diurnal and monthly patterns is achieved by every model. PM<sub>2.5</sub> tends to be relatively high at 08–10 LT and declines in later hours, corresponding to higher traffic emissions in the morning and daytime meteorological conditions more favorable to dilute air pollutants, respectively. PM<sub>2.5</sub> intensifies in the winter but decreases in March and April. During these two months, the areas outside Bangkok tend to have higher PM<sub>2.5</sub> than within Bangkok, possibly linked to active summertime biomass burning in those areas that are less urbanized with more agricultural lands. Relatively high exposure intensity is constrained to Bangkok due likely to its much denser population. The findings indicate a significant potential for leveraging the Fengyun-4A satellite data and ML to advance space-based air quality monitoring for Thailand and other data-scare regions in Southeast Asia. A satellite-based PM<sub>2.5</sub> dataset could support the formulation of effective air quality management strategies in GBK.</p></div>","PeriodicalId":49109,"journal":{"name":"Air Quality Atmosphere and Health","volume":null,"pages":null},"PeriodicalIF":2.9000,"publicationDate":"2024-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Air Quality Atmosphere and Health","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s11869-024-01524-3","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
This study used four individual machine learning (ML) models (random forest, adaptive boosting, gradient boosting, and extreme gradient boosting), and a stacked ensemble model (SEM) for PM2.5 estimation over Greater Bangkok (GBK) during the dry season for 2018–2022. Aerosol optical depth (AOD) from Fengyun-4A satellite was used as the main predictor variable. The other predictor variables include meteorological variables, fire hotspots, vegetation index, terrain elevation, and population density. Surface PM2.5 from 17 air quality monitoring stations was used for model development and evaluation. Satellite AOD aligns reasonably well with AOD from two AERONET stations in the study area in terms of correlation coefficient (r), mean bias (MB), mean error (ME), and root mean square error (RMSE). Among the individual models, adaptive boosting performed the best with r = 0.75, MB = 0.55 µg m−3, ME = 9.1 µg m−3, and RMSE = 12.9 µg m−3. As for SEM which comprises all individual models, it outperformed every individual model, with r = 0.84, zero MB, ME = 7.2 µg m−3, and RMSE = 10.4 µg m−3. In two additional cases of haze hours and clean hours, SEM is best overall while adaptive boosting is superior to the other individual ML models. The case of haze hours has lower model predictability, suggesting elevated PM2.5 is difficult to predict. SEM was thus chosen to map PM2.5 as well as exposure intensity over GBK. Good agreement between the observed and predicted diurnal and monthly patterns is achieved by every model. PM2.5 tends to be relatively high at 08–10 LT and declines in later hours, corresponding to higher traffic emissions in the morning and daytime meteorological conditions more favorable to dilute air pollutants, respectively. PM2.5 intensifies in the winter but decreases in March and April. During these two months, the areas outside Bangkok tend to have higher PM2.5 than within Bangkok, possibly linked to active summertime biomass burning in those areas that are less urbanized with more agricultural lands. Relatively high exposure intensity is constrained to Bangkok due likely to its much denser population. The findings indicate a significant potential for leveraging the Fengyun-4A satellite data and ML to advance space-based air quality monitoring for Thailand and other data-scare regions in Southeast Asia. A satellite-based PM2.5 dataset could support the formulation of effective air quality management strategies in GBK.
期刊介绍:
Air Quality, Atmosphere, and Health is a multidisciplinary journal which, by its very name, illustrates the broad range of work it publishes and which focuses on atmospheric consequences of human activities and their implications for human and ecological health.
It offers research papers, critical literature reviews and commentaries, as well as special issues devoted to topical subjects or themes.
International in scope, the journal presents papers that inform and stimulate a global readership, as the topic addressed are global in their import. Consequently, we do not encourage submission of papers involving local data that relate to local problems. Unless they demonstrate wide applicability, these are better submitted to national or regional journals.
Air Quality, Atmosphere & Health addresses such topics as acid precipitation; airborne particulate matter; air quality monitoring and management; exposure assessment; risk assessment; indoor air quality; atmospheric chemistry; atmospheric modeling and prediction; air pollution climatology; climate change and air quality; air pollution measurement; atmospheric impact assessment; forest-fire emissions; atmospheric science; greenhouse gases; health and ecological effects; clean air technology; regional and global change and satellite measurements.
This journal benefits a diverse audience of researchers, public health officials and policy makers addressing problems that call for solutions based in evidence from atmospheric and exposure assessment scientists, epidemiologists, and risk assessors. Publication in the journal affords the opportunity to reach beyond defined disciplinary niches to this broader readership.