{"title":"基于叠加的集合模型预测PM2.5浓度","authors":"Haoyuan Zhang, Yilun Jin, Jiaxuan Shi, Shuai Zhang","doi":"10.11648/j.acm.20211006.14","DOIUrl":null,"url":null,"abstract":": With the increasingly serious air pollution problem, PM2.5 concentration, as an effective indicator to evaluate air quality, has attracted extensive attention from all sectors of society. Accurate prediction of PM2.5 concentrations is of great significance in providing the public with early air pollution warning information to protect public health. With a decade of development, artificial intelligence technology has given birth to various prediction models with high-performance, in particular, brought new impetus to the prediction of PM2.5 concentrations. In this study, a stacking-based ensemble model with self-adaptive hyper-parameter optimization is proposed to solve the PM2.5 concentrations prediction problem. First, the raw data are preprocessed with the normalization method to reduce the influence of the different orders of magnitude of input variables on model performance. Second, the Bayesian optimization method is used to optimize the hyper-parameters of the base predictors to improve their performance. Finally, a stacking ensemble method is applied to integrate the optimized base predictors into an ensemble model for final prediction. In the experiments, two datasets from the air quality stations in different areas are tested with four metrics to evaluate the performance of the proposed model in PM2.5 concentration prediction. The experimental results show that the proposed model outperforms other baseline models in solving the PM2.5 concentrations prediction problem.","PeriodicalId":55503,"journal":{"name":"Applied and Computational Mathematics","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting PM2.5 Concentrations Using Stacking-based Ensemble Model\",\"authors\":\"Haoyuan Zhang, Yilun Jin, Jiaxuan Shi, Shuai Zhang\",\"doi\":\"10.11648/j.acm.20211006.14\",\"DOIUrl\":null,\"url\":null,\"abstract\":\": With the increasingly serious air pollution problem, PM2.5 concentration, as an effective indicator to evaluate air quality, has attracted extensive attention from all sectors of society. Accurate prediction of PM2.5 concentrations is of great significance in providing the public with early air pollution warning information to protect public health. With a decade of development, artificial intelligence technology has given birth to various prediction models with high-performance, in particular, brought new impetus to the prediction of PM2.5 concentrations. In this study, a stacking-based ensemble model with self-adaptive hyper-parameter optimization is proposed to solve the PM2.5 concentrations prediction problem. First, the raw data are preprocessed with the normalization method to reduce the influence of the different orders of magnitude of input variables on model performance. Second, the Bayesian optimization method is used to optimize the hyper-parameters of the base predictors to improve their performance. Finally, a stacking ensemble method is applied to integrate the optimized base predictors into an ensemble model for final prediction. In the experiments, two datasets from the air quality stations in different areas are tested with four metrics to evaluate the performance of the proposed model in PM2.5 concentration prediction. The experimental results show that the proposed model outperforms other baseline models in solving the PM2.5 concentrations prediction problem.\",\"PeriodicalId\":55503,\"journal\":{\"name\":\"Applied and Computational Mathematics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2021-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied and Computational Mathematics\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.11648/j.acm.20211006.14\",\"RegionNum\":2,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MATHEMATICS, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied and Computational Mathematics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.11648/j.acm.20211006.14","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
Predicting PM2.5 Concentrations Using Stacking-based Ensemble Model
: With the increasingly serious air pollution problem, PM2.5 concentration, as an effective indicator to evaluate air quality, has attracted extensive attention from all sectors of society. Accurate prediction of PM2.5 concentrations is of great significance in providing the public with early air pollution warning information to protect public health. With a decade of development, artificial intelligence technology has given birth to various prediction models with high-performance, in particular, brought new impetus to the prediction of PM2.5 concentrations. In this study, a stacking-based ensemble model with self-adaptive hyper-parameter optimization is proposed to solve the PM2.5 concentrations prediction problem. First, the raw data are preprocessed with the normalization method to reduce the influence of the different orders of magnitude of input variables on model performance. Second, the Bayesian optimization method is used to optimize the hyper-parameters of the base predictors to improve their performance. Finally, a stacking ensemble method is applied to integrate the optimized base predictors into an ensemble model for final prediction. In the experiments, two datasets from the air quality stations in different areas are tested with four metrics to evaluate the performance of the proposed model in PM2.5 concentration prediction. The experimental results show that the proposed model outperforms other baseline models in solving the PM2.5 concentrations prediction problem.
期刊介绍:
Applied and Computational Mathematics (ISSN Online: 2328-5613, ISSN Print: 2328-5605) is a prestigious journal that focuses on the field of applied and computational mathematics. It is driven by the computational revolution and places a strong emphasis on innovative applied mathematics with potential for real-world applicability and practicality.
The journal caters to a broad audience of applied mathematicians and scientists who are interested in the advancement of mathematical principles and practical aspects of computational mathematics. Researchers from various disciplines can benefit from the diverse range of topics covered in ACM. To ensure the publication of high-quality content, all research articles undergo a rigorous peer review process. This process includes an initial screening by the editors and anonymous evaluation by expert reviewers. This guarantees that only the most valuable and accurate research is published in ACM.