{"title":"基于时间分层混合MLR-RNN方法的大气污染预测","authors":"Khetam Alzubaidy, O. Hannon","doi":"10.33899/iqjoss.2021.169962","DOIUrl":null,"url":null,"abstract":"studying and forecasting Particular matter (PM10) is necessary to control and reduce the damage of environment and human health. There are many pollutants as sources of air pollution may effect on PM10 variable. This type of dataset can be classified as anonlinear. Studied datasets have been taken from climate station in Malaysia. Multiple Linear Regression (MLR) is used as alinear statistical method for PM10 forecasting through its influencing by corresponding climate variables, therefore it may reflect inaccurate results when used with nonlinear datasets. Time stratified (TS) method in different styles is implemental for satisfying more homogeneity of datasets. It includes ordering similar seasons in different years together to formulate anew variable smoother than their original. To improve the results of forecasting, Recurrent Neural Network (RNN) has been suggested to be used after combining with MLR in hybrid MLR-RNN method in this study. In general, the results of forecasting were the best with using time stratified approach. In addition, the results of hybrid method were outperformed comparing to MLR model. As conclusion in this study, RNN and TS can be used as active approaches to obtain better forecasting results with nonlinear datasets in which PM10 is to dependent variable.","PeriodicalId":351789,"journal":{"name":"IRAQI JOURNAL OF STATISTICAL SCIENCES","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Air Pollution Forecasting using Hybrid MLR-RNN Method with Time-Stratified Method\",\"authors\":\"Khetam Alzubaidy, O. Hannon\",\"doi\":\"10.33899/iqjoss.2021.169962\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"studying and forecasting Particular matter (PM10) is necessary to control and reduce the damage of environment and human health. There are many pollutants as sources of air pollution may effect on PM10 variable. This type of dataset can be classified as anonlinear. Studied datasets have been taken from climate station in Malaysia. Multiple Linear Regression (MLR) is used as alinear statistical method for PM10 forecasting through its influencing by corresponding climate variables, therefore it may reflect inaccurate results when used with nonlinear datasets. Time stratified (TS) method in different styles is implemental for satisfying more homogeneity of datasets. It includes ordering similar seasons in different years together to formulate anew variable smoother than their original. To improve the results of forecasting, Recurrent Neural Network (RNN) has been suggested to be used after combining with MLR in hybrid MLR-RNN method in this study. In general, the results of forecasting were the best with using time stratified approach. In addition, the results of hybrid method were outperformed comparing to MLR model. As conclusion in this study, RNN and TS can be used as active approaches to obtain better forecasting results with nonlinear datasets in which PM10 is to dependent variable.\",\"PeriodicalId\":351789,\"journal\":{\"name\":\"IRAQI JOURNAL OF STATISTICAL SCIENCES\",\"volume\":\"30 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IRAQI JOURNAL OF STATISTICAL SCIENCES\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.33899/iqjoss.2021.169962\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IRAQI JOURNAL OF STATISTICAL SCIENCES","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.33899/iqjoss.2021.169962","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
研究和预报可吸入颗粒物(PM10)是控制和减少其对环境和人体健康危害的必要条件。有许多污染物作为空气污染源可能影响PM10变量。这种类型的数据集可以归类为非线性。所研究的数据集取自马来西亚气象站。多元线性回归(Multiple Linear Regression, MLR)是利用相应气候变量对PM10进行预测的线性统计方法,在非线性数据集上可能反映出不准确的结果。为满足数据集的同质性,实现了不同风格的时间分层方法。它包括将不同年份的相似季节排列在一起,以形成比原始变量更平滑的新变量。为了提高预测效果,本研究建议将递归神经网络(Recurrent Neural Network, RNN)与MLR结合,在混合MLR-RNN方法中使用。总体而言,采用时间分层方法预测效果最好。此外,混合方法的结果优于MLR模型。本研究的结论是,在PM10为因变量的非线性数据集上,RNN和TS可以作为主动方法获得更好的预测结果。
Air Pollution Forecasting using Hybrid MLR-RNN Method with Time-Stratified Method
studying and forecasting Particular matter (PM10) is necessary to control and reduce the damage of environment and human health. There are many pollutants as sources of air pollution may effect on PM10 variable. This type of dataset can be classified as anonlinear. Studied datasets have been taken from climate station in Malaysia. Multiple Linear Regression (MLR) is used as alinear statistical method for PM10 forecasting through its influencing by corresponding climate variables, therefore it may reflect inaccurate results when used with nonlinear datasets. Time stratified (TS) method in different styles is implemental for satisfying more homogeneity of datasets. It includes ordering similar seasons in different years together to formulate anew variable smoother than their original. To improve the results of forecasting, Recurrent Neural Network (RNN) has been suggested to be used after combining with MLR in hybrid MLR-RNN method in this study. In general, the results of forecasting were the best with using time stratified approach. In addition, the results of hybrid method were outperformed comparing to MLR model. As conclusion in this study, RNN and TS can be used as active approaches to obtain better forecasting results with nonlinear datasets in which PM10 is to dependent variable.