Short-Term Hourly Ozone Concentration Forecasting Using Functional Data Approach

IF 1.4 Q3 ECONOMICS

Econometrics Pub Date : 2024-05-05 DOI:10.3390/econometrics12020012

Ismail Shah, Naveed Gul, Sajid Ali, Hassan Houmani

{"title":"Short-Term Hourly Ozone Concentration Forecasting Using Functional Data Approach","authors":"Ismail Shah, Naveed Gul, Sajid Ali, Hassan Houmani","doi":"10.3390/econometrics12020012","DOIUrl":null,"url":null,"abstract":"Air pollution, especially ground-level ozone, poses severe threats to human health and ecosystems. Accurate forecasting of ozone concentrations is essential for reducing its adverse effects. This study aims to use the functional time series approach to model ozone concentrations, a method less explored in the literature, and compare it with traditional time series and machine learning models. To this end, the ozone concentration hourly time series is first filtered for yearly seasonality using smoothing splines that lead us to the stochastic (residual) component. The stochastic component is modeled and forecast using a functional autoregressive model (FAR), where each daily ozone concentration profile is considered a single functional datum. For comparison purposes, different traditional and machine learning techniques, such as autoregressive integrated moving average (ARIMA), vector autoregressive (VAR), neural network autoregressive (NNAR), random forest (RF), and support vector machine (SVM), are also used to model and forecast the stochastic component. Once the forecast from the yearly seasonality component and stochastic component are obtained, both are added to obtain the final forecast. For empirical investigation, data consisting of hourly ozone measurements from Los Angeles from 2013 to 2017 are used, and one-day-ahead out-of-sample forecasts are obtained for a complete year. Based on the evaluation metrics, such as R2, root mean squared error (RMSE), and mean absolute error (MAE), the forecasting results indicate that the FAR outperforms the competitors in most scenarios, with the SVM model performing the least favorably across all cases.","PeriodicalId":11499,"journal":{"name":"Econometrics","volume":"13 1","pages":""},"PeriodicalIF":1.4000,"publicationDate":"2024-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Econometrics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/econometrics12020012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ECONOMICS","Score":null,"Total":0}

引用次数: 0

Abstract

Air pollution, especially ground-level ozone, poses severe threats to human health and ecosystems. Accurate forecasting of ozone concentrations is essential for reducing its adverse effects. This study aims to use the functional time series approach to model ozone concentrations, a method less explored in the literature, and compare it with traditional time series and machine learning models. To this end, the ozone concentration hourly time series is first filtered for yearly seasonality using smoothing splines that lead us to the stochastic (residual) component. The stochastic component is modeled and forecast using a functional autoregressive model (FAR), where each daily ozone concentration profile is considered a single functional datum. For comparison purposes, different traditional and machine learning techniques, such as autoregressive integrated moving average (ARIMA), vector autoregressive (VAR), neural network autoregressive (NNAR), random forest (RF), and support vector machine (SVM), are also used to model and forecast the stochastic component. Once the forecast from the yearly seasonality component and stochastic component are obtained, both are added to obtain the final forecast. For empirical investigation, data consisting of hourly ozone measurements from Los Angeles from 2013 to 2017 are used, and one-day-ahead out-of-sample forecasts are obtained for a complete year. Based on the evaluation metrics, such as R2, root mean squared error (RMSE), and mean absolute error (MAE), the forecasting results indicate that the FAR outperforms the competitors in most scenarios, with the SVM model performing the least favorably across all cases.

查看原文本刊更多论文

利用功能数据法预测短期每小时臭氧浓度

空气污染，尤其是地面臭氧，对人类健康和生态系统构成严重威胁。准确预测臭氧浓度对减少其不利影响至关重要。本研究旨在使用文献中较少探讨的函数时间序列方法来模拟臭氧浓度，并将其与传统的时间序列和机器学习模型进行比较。为此，首先使用平滑样条对臭氧浓度小时时间序列进行年度季节性过滤，从而得出随机（残差）成分。使用函数自回归模型（FAR）对随机部分进行建模和预测，其中每个日臭氧浓度曲线都被视为单一的函数基准。为了便于比较，还使用了不同的传统和机器学习技术，如自回归综合移动平均（ARIMA）、向量自回归（VAR）、神经网络自回归（NNAR）、随机森林（RF）和支持向量机（SVM），来模拟和预测随机成分。在获得年度季节性分量和随机分量的预测结果后，将二者相加以获得最终预测结果。在实证研究中，使用了 2013 年至 2017 年洛杉矶的臭氧小时测量数据，并获得了完整一年的提前一天样本外预测。根据 R2、均方根误差（RMSE）和平均绝对误差（MAE）等评价指标，预测结果表明，FAR 在大多数情况下都优于竞争对手，而 SVM 模型在所有情况下表现最差。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊