Online Sequential Learning Based on Extreme Learning Machines for Particulate Matter Forecasting

Andres Bueno, G. P. Coelho, J. R. Bertini
{"title":"Online Sequential Learning Based on Extreme Learning Machines for Particulate Matter Forecasting","authors":"Andres Bueno, G. P. Coelho, J. R. Bertini","doi":"10.1109/BRACIS.2017.25","DOIUrl":null,"url":null,"abstract":"Microscopically small solid particles and liquid droplets suspended in the air, known as particulate matter (PM), may significantly affect not only human health but also urban, natural and agricultural systems. Therefore, it is imperative to keep the concentration levels of these pollutants below harmful thresholds. Forecasting tools based on machine learning have been used to estimate the concentration of PM and other pollutants in the atmosphere. However, PM data are uninterruptedly collected over time, thus producing a stream of data whose distribution may evolve over time. As traditional machine learning techniques do not have mechanisms to handle changes on data distribution at running time, they usually present limited prediction accuracy when facing such scenario. The overall goal of this work is to evaluate whether online sequential learning can improve the estimation accuracy of PM forecasting. To do so, online and offline algorithms based on Extreme Learning Machines (ELM) were compared, in order to evaluate their performance when applied to forecast hourly concentrations of PM. The experiments were performed using real-world data streams of PM concentration from different cities of the State of São Paulo, Brazil. The obtained results show not only that online sequential learning approaches lead to smaller mean squared errors but also that the stability of the results is enhanced when such approaches are combined in ensembles.","PeriodicalId":202240,"journal":{"name":"2017 Brazilian Conference on Intelligent Systems (BRACIS)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Brazilian Conference on Intelligent Systems (BRACIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BRACIS.2017.25","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

Microscopically small solid particles and liquid droplets suspended in the air, known as particulate matter (PM), may significantly affect not only human health but also urban, natural and agricultural systems. Therefore, it is imperative to keep the concentration levels of these pollutants below harmful thresholds. Forecasting tools based on machine learning have been used to estimate the concentration of PM and other pollutants in the atmosphere. However, PM data are uninterruptedly collected over time, thus producing a stream of data whose distribution may evolve over time. As traditional machine learning techniques do not have mechanisms to handle changes on data distribution at running time, they usually present limited prediction accuracy when facing such scenario. The overall goal of this work is to evaluate whether online sequential learning can improve the estimation accuracy of PM forecasting. To do so, online and offline algorithms based on Extreme Learning Machines (ELM) were compared, in order to evaluate their performance when applied to forecast hourly concentrations of PM. The experiments were performed using real-world data streams of PM concentration from different cities of the State of São Paulo, Brazil. The obtained results show not only that online sequential learning approaches lead to smaller mean squared errors but also that the stability of the results is enhanced when such approaches are combined in ensembles.
基于极限学习机的在线顺序学习在颗粒物预测中的应用
悬浮在空气中的微小固体颗粒和液滴,被称为颗粒物(PM),不仅会对人类健康产生重大影响,还会对城市、自然和农业系统产生重大影响。因此,必须将这些污染物的浓度水平控制在有害阈值以下。基于机器学习的预测工具已被用于估计大气中PM和其他污染物的浓度。然而,随着时间的推移,PM数据是不间断地收集的,因此产生了数据流,其分布可能随着时间的推移而变化。由于传统的机器学习技术不具备在运行时处理数据分布变化的机制,因此在面对这种情况时,它们的预测精度通常有限。这项工作的总体目标是评估在线顺序学习是否可以提高PM预测的估计精度。为此,比较了基于极限学习机(ELM)的在线和离线算法,以评估它们在预测PM每小时浓度时的性能。实验使用来自巴西圣保罗州不同城市的PM浓度的真实数据流进行。研究结果表明,在线顺序学习方法不仅可以减小均方误差,而且当这些方法在集成中组合时,结果的稳定性也得到了提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信