{"title":"Long-Term Retrospective Predicted Concentration of PM<sub>2.5</sub> in Upper Northern Thailand Using Machine Learning Models.","authors":"Sawaeng Kawichai, Patumrat Sripan, Amaraporn Rerkasem, Kittipan Rerkasem, Worawut Srisukkham","doi":"10.3390/toxics13030170","DOIUrl":null,"url":null,"abstract":"<p><p>This study aims to build, for the first time, a model that uses a machine learning (ML) approach to predict long-term retrospective PM<sub>2.5</sub> concentrations in upper northern Thailand, a region impacted by biomass burning and transboundary pollution. The dataset includes PM<sub>10</sub> levels, fire hotspots, and critical meteorological data from 1 January 2011 to 31 December 2020. ML techniques, namely multi-layer perceptron neural network (MLP), support vector machine (SVM), multiple linear regression (MLR), decision tree (DT), and random forests (RF), were used to construct the prediction models. The best ML prediction model was selected considering root mean square error (RMSE), mean prediction error (MPE), relative prediction error (RPE) (the lower, the better), and coefficient of determination (R<sup>2</sup>) (the bigger, the better). Our study found that the ML model-based RF technique using PM<sub>10</sub>, CO<sub>2</sub>, O<sub>3</sub>, fire hotspots, air pressure, rainfall, relative humidity, temperature, wind direction, and wind speed performs the best when predicting the concentration of PM<sub>2.5</sub> with an RMSE of 6.82 µg/m<sup>3</sup>, MPE of 4.33 µg/m<sup>3</sup>, RPE of 22.50%, and R<sup>2</sup> of 0.93. The RF prediction model of PM<sub>2.5</sub> used in this research could support further studies of the long-term effects of PM<sub>2.5</sub> concentration on human health and related issues.</p>","PeriodicalId":23195,"journal":{"name":"Toxics","volume":"13 3","pages":""},"PeriodicalIF":3.9000,"publicationDate":"2025-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11946178/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Toxics","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.3390/toxics13030170","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
This study aims to build, for the first time, a model that uses a machine learning (ML) approach to predict long-term retrospective PM2.5 concentrations in upper northern Thailand, a region impacted by biomass burning and transboundary pollution. The dataset includes PM10 levels, fire hotspots, and critical meteorological data from 1 January 2011 to 31 December 2020. ML techniques, namely multi-layer perceptron neural network (MLP), support vector machine (SVM), multiple linear regression (MLR), decision tree (DT), and random forests (RF), were used to construct the prediction models. The best ML prediction model was selected considering root mean square error (RMSE), mean prediction error (MPE), relative prediction error (RPE) (the lower, the better), and coefficient of determination (R2) (the bigger, the better). Our study found that the ML model-based RF technique using PM10, CO2, O3, fire hotspots, air pressure, rainfall, relative humidity, temperature, wind direction, and wind speed performs the best when predicting the concentration of PM2.5 with an RMSE of 6.82 µg/m3, MPE of 4.33 µg/m3, RPE of 22.50%, and R2 of 0.93. The RF prediction model of PM2.5 used in this research could support further studies of the long-term effects of PM2.5 concentration on human health and related issues.
ToxicsChemical Engineering-Chemical Health and Safety
CiteScore
4.50
自引率
10.90%
发文量
681
审稿时长
6 weeks
期刊介绍:
Toxics (ISSN 2305-6304) is an international, peer-reviewed, open access journal which provides an advanced forum for studies related to all aspects of toxic chemicals and materials. It publishes reviews, regular research papers, and short communications. Our aim is to encourage scientists to publish their experimental and theoretical results in detail. There is, therefore, no restriction on the maximum length of the papers, although authors should write their papers in a clear and concise way. The full experimental details must be provided so that the results can be reproduced. Electronic files or software regarding the full details of calculations and experimental procedure can be deposited as supplementary material, if it is not possible to publish them along with the text.