Proposal of a methodology for prediction of heavy metals concentration based on PM2.5 concentration and meteorological variables using machine learning
Shin-Young Park, Hye-Won Lee, Jaymin Kwon, Sung-Won Yoon, Cheol-Min Lee
{"title":"Proposal of a methodology for prediction of heavy metals concentration based on PM2.5 concentration and meteorological variables using machine learning","authors":"Shin-Young Park, Hye-Won Lee, Jaymin Kwon, Sung-Won Yoon, Cheol-Min Lee","doi":"10.1007/s44273-024-00029-w","DOIUrl":null,"url":null,"abstract":"<div><p>In this study, we developed a prediction model for heavy metal concentrations using PM<sub>2.5</sub> concentrations and meteorological variables. Data was collected from five sites, encompassing meteorological factors, PM<sub>2.5</sub>, and 18 metals over 2 years. The study employed four analytical methods: multiple linear regression (MLR), random forest regression (RFR), gradient boosting, and artificial neural networks (ANN). RFR was the best predictor for most metals, and gradient boosting and ANN were optimal for certain metals like Al, Cu, As, Mo, Zn, and Cd. Upon evaluating the final model’s predicted values against the actual measurements, differences in the concentration distribution between measurement locations were observed for Mn, Fe, Cu, Ba, and Pb, indicating varying prediction performances among sites. Additionally, Al, As, Cd, and Ba showed significant differences in prediction performance across seasons. The developed model is expected to overcome the technical limitations involved in measuring and analyzing heavy metal concentrations. It could further be utilized to obtain fundamental data for studying the health effects of exposure to hazardous substances such as heavy metals.</p></div>","PeriodicalId":45358,"journal":{"name":"Asian Journal of Atmospheric Environment","volume":null,"pages":null},"PeriodicalIF":1.1000,"publicationDate":"2024-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s44273-024-00029-w.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asian Journal of Atmospheric Environment","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s44273-024-00029-w","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
In this study, we developed a prediction model for heavy metal concentrations using PM2.5 concentrations and meteorological variables. Data was collected from five sites, encompassing meteorological factors, PM2.5, and 18 metals over 2 years. The study employed four analytical methods: multiple linear regression (MLR), random forest regression (RFR), gradient boosting, and artificial neural networks (ANN). RFR was the best predictor for most metals, and gradient boosting and ANN were optimal for certain metals like Al, Cu, As, Mo, Zn, and Cd. Upon evaluating the final model’s predicted values against the actual measurements, differences in the concentration distribution between measurement locations were observed for Mn, Fe, Cu, Ba, and Pb, indicating varying prediction performances among sites. Additionally, Al, As, Cd, and Ba showed significant differences in prediction performance across seasons. The developed model is expected to overcome the technical limitations involved in measuring and analyzing heavy metal concentrations. It could further be utilized to obtain fundamental data for studying the health effects of exposure to hazardous substances such as heavy metals.