Anibal Flores, José Valeriano-Zapana, Victor Yana-Mamani, Hugo Tito-Chura
{"title":"PM2.5 prediction with Recurrent Neural Networks and Data Augmentation","authors":"Anibal Flores, José Valeriano-Zapana, Victor Yana-Mamani, Hugo Tito-Chura","doi":"10.1109/LA-CCI48322.2021.9769784","DOIUrl":null,"url":null,"abstract":"This paper presents three novel models based on recurrent neural networks (RNN) such as Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) for PM2.5 prediction with data augmentation (DA). The data augmentation technique is based on linear interpolation, it allows to find a linear function with each pair of items from the original time series. A space parameter allows to define the number of synthetic items to be generated, with this is possible to enlarge the original time series and improve the precision of the regression models. The baseline models as GRU, LSTM and GRU+LSTM got regular and bad prediction results, while the same ones with data augmentation as DA+GRU, DA+LSTM and DA+GRU+LSTM got excellent predictions showing the superiority of the proposals models. Likewise, according to the Mean Absolute Percentage Error (MAPE), the data augmentation allows to improve a regular GRU model by 18.6288% and bad models as LSTM and GRU+LSTM by 21.7683% and 31.0092% respectively.","PeriodicalId":431041,"journal":{"name":"2021 IEEE Latin American Conference on Computational Intelligence (LA-CCI)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Latin American Conference on Computational Intelligence (LA-CCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/LA-CCI48322.2021.9769784","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper presents three novel models based on recurrent neural networks (RNN) such as Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) for PM2.5 prediction with data augmentation (DA). The data augmentation technique is based on linear interpolation, it allows to find a linear function with each pair of items from the original time series. A space parameter allows to define the number of synthetic items to be generated, with this is possible to enlarge the original time series and improve the precision of the regression models. The baseline models as GRU, LSTM and GRU+LSTM got regular and bad prediction results, while the same ones with data augmentation as DA+GRU, DA+LSTM and DA+GRU+LSTM got excellent predictions showing the superiority of the proposals models. Likewise, according to the Mean Absolute Percentage Error (MAPE), the data augmentation allows to improve a regular GRU model by 18.6288% and bad models as LSTM and GRU+LSTM by 21.7683% and 31.0092% respectively.