{"title":"利用梯度树增强模型预测韩国PM10浓度","authors":"Khaula Qadeer, M. Jeon","doi":"10.1145/3387168.3387234","DOIUrl":null,"url":null,"abstract":"Particulate matter (PM) is a term generally used for very small particles and liquid droplets in the atmosphere. PM10 is the particle pollution with diameter less than or equal to 10 micrometers. Exposure to particle pollution is a public health hazard which leads to serious diseases such as asthma, bronchitis and even cancer; especially in elderly, children and sensitive people. It is crucial to predict the concentration of PM before-hand so that people can take precautionary measures and avoid the hazardous impact of pollution. These days the gradient boosting is one of popular methods in regression and classification tasks. In this study, we predict the PM10 concentration using Extreme Gradient Boosting (XGBoost) and Light Gradient Boosting Machine (LightGBM) algorithms after combining meteorological, emission rate data and output features of Community Multi-Scale Air Quality (CMAQ) model. All the missing values are removed because handling them is quite challenging and requires feature engineering. The results show that XGBoost performs better than LightGBM in terms of prediction estimation with the RMSE of 12.846; but takes longer to train and tune the model's parameters. RMSE of LightGBM is 12.9066, which is slightly higher; but on the contrary, it is 29 times faster than XGBoost.","PeriodicalId":346739,"journal":{"name":"Proceedings of the 3rd International Conference on Vision, Image and Signal Processing","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Prediction of PM10 Concentration in South Korea Using Gradient Tree Boosting Models\",\"authors\":\"Khaula Qadeer, M. Jeon\",\"doi\":\"10.1145/3387168.3387234\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Particulate matter (PM) is a term generally used for very small particles and liquid droplets in the atmosphere. PM10 is the particle pollution with diameter less than or equal to 10 micrometers. Exposure to particle pollution is a public health hazard which leads to serious diseases such as asthma, bronchitis and even cancer; especially in elderly, children and sensitive people. It is crucial to predict the concentration of PM before-hand so that people can take precautionary measures and avoid the hazardous impact of pollution. These days the gradient boosting is one of popular methods in regression and classification tasks. In this study, we predict the PM10 concentration using Extreme Gradient Boosting (XGBoost) and Light Gradient Boosting Machine (LightGBM) algorithms after combining meteorological, emission rate data and output features of Community Multi-Scale Air Quality (CMAQ) model. All the missing values are removed because handling them is quite challenging and requires feature engineering. The results show that XGBoost performs better than LightGBM in terms of prediction estimation with the RMSE of 12.846; but takes longer to train and tune the model's parameters. RMSE of LightGBM is 12.9066, which is slightly higher; but on the contrary, it is 29 times faster than XGBoost.\",\"PeriodicalId\":346739,\"journal\":{\"name\":\"Proceedings of the 3rd International Conference on Vision, Image and Signal Processing\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-08-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 3rd International Conference on Vision, Image and Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3387168.3387234\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 3rd International Conference on Vision, Image and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3387168.3387234","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Prediction of PM10 Concentration in South Korea Using Gradient Tree Boosting Models
Particulate matter (PM) is a term generally used for very small particles and liquid droplets in the atmosphere. PM10 is the particle pollution with diameter less than or equal to 10 micrometers. Exposure to particle pollution is a public health hazard which leads to serious diseases such as asthma, bronchitis and even cancer; especially in elderly, children and sensitive people. It is crucial to predict the concentration of PM before-hand so that people can take precautionary measures and avoid the hazardous impact of pollution. These days the gradient boosting is one of popular methods in regression and classification tasks. In this study, we predict the PM10 concentration using Extreme Gradient Boosting (XGBoost) and Light Gradient Boosting Machine (LightGBM) algorithms after combining meteorological, emission rate data and output features of Community Multi-Scale Air Quality (CMAQ) model. All the missing values are removed because handling them is quite challenging and requires feature engineering. The results show that XGBoost performs better than LightGBM in terms of prediction estimation with the RMSE of 12.846; but takes longer to train and tune the model's parameters. RMSE of LightGBM is 12.9066, which is slightly higher; but on the contrary, it is 29 times faster than XGBoost.