不同机器学习方法对近时信息对微观经济平衡时间序列数据预测的影响

ERN: Neural Networks & Related Topics (Topic) Pub Date : 2020-03-23 DOI:10.2139/ssrn.3559645

Frederik Collin, M. Kies

{"title":"不同机器学习方法对近时信息对微观经济平衡时间序列数据预测的影响","authors":"Frederik Collin, M. Kies","doi":"10.2139/ssrn.3559645","DOIUrl":null,"url":null,"abstract":"Instead of relying solely on data of a single time series it is possible to use information of parallel, similar time series to improve prediction quality. Our data set consists of microeconomic data of daily store deposits from a large number of different stores. We analyze how prediction performance regarding a given store can be increased by using data from other stores. First we compare several machine learning methods, including Elastic Nets, Partial Least Squares, Generalized Additive Models, Random Forests, Gradient Boosting and Neural Networks using only data of a single time series. Afterwards we show that Random Forests are able to better utilize parallel time series data compared to Partial Least Squares. Using near-time data of parallel time series is highly beneficial for prediction performance. To allow a fair comparison between different machine learning methods, we present a novel hyper-parameter optimization technique using a regression tree. It enables a fast and flexible determination of optimal parameters for a given method.","PeriodicalId":114865,"journal":{"name":"ERN: Neural Networks & Related Topics (Topic)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Impact of Near-Time Information for Prediction on Microeconomic Balanced Time Series Data using Different Machine Learning Methods\",\"authors\":\"Frederik Collin, M. Kies\",\"doi\":\"10.2139/ssrn.3559645\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Instead of relying solely on data of a single time series it is possible to use information of parallel, similar time series to improve prediction quality. Our data set consists of microeconomic data of daily store deposits from a large number of different stores. We analyze how prediction performance regarding a given store can be increased by using data from other stores. First we compare several machine learning methods, including Elastic Nets, Partial Least Squares, Generalized Additive Models, Random Forests, Gradient Boosting and Neural Networks using only data of a single time series. Afterwards we show that Random Forests are able to better utilize parallel time series data compared to Partial Least Squares. Using near-time data of parallel time series is highly beneficial for prediction performance. To allow a fair comparison between different machine learning methods, we present a novel hyper-parameter optimization technique using a regression tree. It enables a fast and flexible determination of optimal parameters for a given method.\",\"PeriodicalId\":114865,\"journal\":{\"name\":\"ERN: Neural Networks & Related Topics (Topic)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-03-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ERN: Neural Networks & Related Topics (Topic)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2139/ssrn.3559645\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ERN: Neural Networks & Related Topics (Topic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3559645","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

可以利用平行的、相似的时间序列信息来提高预测质量，而不是仅仅依赖于单个时间序列的数据。我们的数据集由来自大量不同商店的日常商店存款的微观经济数据组成。我们分析了如何通过使用来自其他存储的数据来提高给定存储的预测性能。首先，我们比较了几种机器学习方法，包括弹性网络、偏最小二乘、广义可加模型、随机森林、梯度增强和仅使用单个时间序列数据的神经网络。之后，我们证明了随机森林与偏最小二乘法相比能够更好地利用平行时间序列数据。利用平行时间序列的近时数据，对提高预测性能非常有利。为了在不同的机器学习方法之间进行公平的比较，我们提出了一种使用回归树的新型超参数优化技术。它可以快速灵活地确定给定方法的最佳参数。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Impact of Near-Time Information for Prediction on Microeconomic Balanced Time Series Data using Different Machine Learning Methods

Instead of relying solely on data of a single time series it is possible to use information of parallel, similar time series to improve prediction quality. Our data set consists of microeconomic data of daily store deposits from a large number of different stores. We analyze how prediction performance regarding a given store can be increased by using data from other stores. First we compare several machine learning methods, including Elastic Nets, Partial Least Squares, Generalized Additive Models, Random Forests, Gradient Boosting and Neural Networks using only data of a single time series. Afterwards we show that Random Forests are able to better utilize parallel time series data compared to Partial Least Squares. Using near-time data of parallel time series is highly beneficial for prediction performance. To allow a fair comparison between different machine learning methods, we present a novel hyper-parameter optimization technique using a regression tree. It enables a fast and flexible determination of optimal parameters for a given method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

ERN: Neural Networks & Related Topics (Topic)

自引率

0.00%

发文量