Combining random forest and graph wavenet for spatial-temporal data prediction

Chong Chen;Yanbo Xu;Jixuan Zhao;Lulu Chen;Yaru Xue
{"title":"Combining random forest and graph wavenet for spatial-temporal data prediction","authors":"Chong Chen;Yanbo Xu;Jixuan Zhao;Lulu Chen;Yaru Xue","doi":"10.23919/ICN.2022.0024","DOIUrl":null,"url":null,"abstract":"The prosperity of deep learning has revolutionized many machine learning tasks (such as image recognition, natural language processing, etc.). With the widespread use of autonomous sensor networks, the Internet of Things, and crowd sourcing to monitor real-world processes, the volume, diversity, and veracity of spatial-temporal data are expanding rapidly. However, traditional methods have their limitation in coping with spatial-temporal dependencies, which either incorporate too much data from weakly connected locations or ignore the relationships between those interrelated but geographically separated regions. In this paper, a novel deep learning model (termed RF-GWN) is proposed by combining Random Forest (RF) and Graph WaveNet (GWN). In RF-GWN, a new adaptive weight matrix is formulated by combining Variable Importance Measure (VIM) of RF with the long time series feature extraction ability of GWN in order to capture potential spatial dependencies and extract long-term dependencies from the input data. Furthermore, two experiments are conducted on two real-world datasets with the purpose of predicting traffic flow and groundwater level. Baseline models are implemented by Diffusion Convolutional Recurrent Neural Network (DCRNN), Spatial-Temporal GCN (ST-GCN), and GWN to verify the effectiveness of the RF-GWN. The Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and Mean Absolute Percentage Error (MAPE) are selected as performance criteria. The results show that the proposed model can better capture the spatial-temporal relationships, the prediction performance on the METR-LA dataset is slightly improved, and the index of the prediction task on the PEMS-BAY dataset is significantly improved. These improvements are extended to the groundwater dataset, which can effectively improve the prediction accuracy. Thus, the applicability and effectiveness of the proposed model RF-GWN in both traffic flow and groundwater level prediction are demonstrated.","PeriodicalId":100681,"journal":{"name":"Intelligent and Converged Networks","volume":"3 4","pages":"364-377"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9195266/10026509/10026523.pdf","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent and Converged Networks","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10026523/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

The prosperity of deep learning has revolutionized many machine learning tasks (such as image recognition, natural language processing, etc.). With the widespread use of autonomous sensor networks, the Internet of Things, and crowd sourcing to monitor real-world processes, the volume, diversity, and veracity of spatial-temporal data are expanding rapidly. However, traditional methods have their limitation in coping with spatial-temporal dependencies, which either incorporate too much data from weakly connected locations or ignore the relationships between those interrelated but geographically separated regions. In this paper, a novel deep learning model (termed RF-GWN) is proposed by combining Random Forest (RF) and Graph WaveNet (GWN). In RF-GWN, a new adaptive weight matrix is formulated by combining Variable Importance Measure (VIM) of RF with the long time series feature extraction ability of GWN in order to capture potential spatial dependencies and extract long-term dependencies from the input data. Furthermore, two experiments are conducted on two real-world datasets with the purpose of predicting traffic flow and groundwater level. Baseline models are implemented by Diffusion Convolutional Recurrent Neural Network (DCRNN), Spatial-Temporal GCN (ST-GCN), and GWN to verify the effectiveness of the RF-GWN. The Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and Mean Absolute Percentage Error (MAPE) are selected as performance criteria. The results show that the proposed model can better capture the spatial-temporal relationships, the prediction performance on the METR-LA dataset is slightly improved, and the index of the prediction task on the PEMS-BAY dataset is significantly improved. These improvements are extended to the groundwater dataset, which can effectively improve the prediction accuracy. Thus, the applicability and effectiveness of the proposed model RF-GWN in both traffic flow and groundwater level prediction are demonstrated.
随机森林与图波网络相结合的时空数据预测
深度学习的繁荣彻底改变了许多机器学习任务(如图像识别、自然语言处理等)。随着自主传感器网络、物联网和众包的广泛使用来监控现实世界的过程,时空数据的数量、多样性和准确性正在迅速扩大。然而,传统的方法在处理时空依赖关系时存在局限性,这些方法要么包含了太多来自弱连接位置的数据,要么忽略了地理上相互关联但又分离的区域之间的关系。本文将随机森林(Random Forest, RF)和图波网(Graph WaveNet, GWN)相结合,提出了一种新的深度学习模型RF-GWN。在RF-GWN中,将RF的变重要度量(VIM)与GWN的长时间序列特征提取能力相结合,建立了一种新的自适应权重矩阵,以捕获潜在的空间依赖关系,并从输入数据中提取长期依赖关系。此外,为了预测交通流量和地下水位,在两个真实数据集上进行了两个实验。通过扩散卷积递归神经网络(DCRNN)、时空GCN (ST-GCN)和GWN实现基线模型,验证了RF-GWN的有效性。选择均方根误差(RMSE)、平均绝对误差(MAE)和平均绝对百分比误差(MAPE)作为性能标准。结果表明,该模型能更好地捕捉时空关系,在met - la数据集上的预测性能略有提高,在PEMS-BAY数据集上的预测任务指标有显著提高。将这些改进推广到地下水数据集,可以有效提高预测精度。验证了该模型在交通流预测和地下水位预测中的适用性和有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信