Obaidullah Salehie, Mohamad Hidayat Bin Jamal, Shamsuddin Shahid
{"title":"Characterization and prediction of PM2.5 levels in Afghanistan using machine learning techniques","authors":"Obaidullah Salehie, Mohamad Hidayat Bin Jamal, Shamsuddin Shahid","doi":"10.1007/s00704-024-05172-6","DOIUrl":null,"url":null,"abstract":"<p>Afghanistan faces severe air quality issues in major cities due to various sources like transportation, domestic energy use, and industrial activity. This study investigates PM2.5 spatiotemporal variability and its future relationship with six meteorological variables: precipitation, temperature, dewpoint temperature, wind speed, boundary layer height and surface pressure. This study aims to assess the spatiotemporal variability of PM2.5 concentrations in Afghanistan and derive models for predicting PM2.5 from the six variables. Satellite-measured PM2.5 and six reanalyses (ERA5) meteorological datasets for 1998–2020 were used as predictors. Three machine learning models, AdaBoost, Random Forest (RF), and Support Vector Machine (SVM), were used to develop the annual and seasonal PM2.5 concentration prediction model. Results suggest PM2.5 levels ranging from 60–80 µg/m<sup>3</sup> in northern, southern, and western regions, while other areas experience lower levels (12–50 µg/m<sup>3</sup>). The lowest PM2.5 concentrations are in the Hindu Kush mountain range. Summer exhibited the highest PM2.5 concentrations, reaching a maximum of 137.4 µg/m<sup>3</sup> and an average of 48.5 µg/m<sup>3</sup>. Among the prediction models, RF performed best in predicting PM2.5 across Afghanistan, as evidenced by the evaluation metrics: NRMSE (59.2), RSR (0.59), rSD (0.75), and higher values of NSE (0.65), R<sup>2</sup> (0.65), and KGE (0.68). The geographical and seasonal distribution of observed PM2.5 distribution was very similar to the PM2.5 estimated using RF compared to the other two models. The analysis showed that air temperature, precipitation, wind speeds, and boundary layer heights play significant roles in PM2.5 distribution. However, the relationship between precipitation and PM2.5 was more pronounced than other meteorological variables.</p>","PeriodicalId":22945,"journal":{"name":"Theoretical and Applied Climatology","volume":null,"pages":null},"PeriodicalIF":2.8000,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Theoretical and Applied Climatology","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s00704-024-05172-6","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Afghanistan faces severe air quality issues in major cities due to various sources like transportation, domestic energy use, and industrial activity. This study investigates PM2.5 spatiotemporal variability and its future relationship with six meteorological variables: precipitation, temperature, dewpoint temperature, wind speed, boundary layer height and surface pressure. This study aims to assess the spatiotemporal variability of PM2.5 concentrations in Afghanistan and derive models for predicting PM2.5 from the six variables. Satellite-measured PM2.5 and six reanalyses (ERA5) meteorological datasets for 1998–2020 were used as predictors. Three machine learning models, AdaBoost, Random Forest (RF), and Support Vector Machine (SVM), were used to develop the annual and seasonal PM2.5 concentration prediction model. Results suggest PM2.5 levels ranging from 60–80 µg/m3 in northern, southern, and western regions, while other areas experience lower levels (12–50 µg/m3). The lowest PM2.5 concentrations are in the Hindu Kush mountain range. Summer exhibited the highest PM2.5 concentrations, reaching a maximum of 137.4 µg/m3 and an average of 48.5 µg/m3. Among the prediction models, RF performed best in predicting PM2.5 across Afghanistan, as evidenced by the evaluation metrics: NRMSE (59.2), RSR (0.59), rSD (0.75), and higher values of NSE (0.65), R2 (0.65), and KGE (0.68). The geographical and seasonal distribution of observed PM2.5 distribution was very similar to the PM2.5 estimated using RF compared to the other two models. The analysis showed that air temperature, precipitation, wind speeds, and boundary layer heights play significant roles in PM2.5 distribution. However, the relationship between precipitation and PM2.5 was more pronounced than other meteorological variables.
期刊介绍:
Theoretical and Applied Climatology covers the following topics:
- climate modeling, climatic changes and climate forecasting, micro- to mesoclimate, applied meteorology as in agro- and forestmeteorology, biometeorology, building meteorology and atmospheric radiation problems as they relate to the biosphere
- effects of anthropogenic and natural aerosols or gaseous trace constituents
- hardware and software elements of meteorological measurements, including techniques of remote sensing