时间序列预测的统计，机器学习和深度学习模型的综合评估

2022 7th International Conference on Data Science and Machine Learning Applications (CDMA) Pub Date : 2022-03-01 DOI:10.1109/CDMA54072.2022.00014

A. Xuan, Mengmeng Yin, Yupei Li, Xiyu Chen, Zhenliang Ma

{"title":"时间序列预测的统计，机器学习和深度学习模型的综合评估","authors":"A. Xuan, Mengmeng Yin, Yupei Li, Xiyu Chen, Zhenliang Ma","doi":"10.1109/CDMA54072.2022.00014","DOIUrl":null,"url":null,"abstract":"How to choose the appropriate model to predict the time series is one of the most prominent activities of temporal data analysis. Empirical evidence is often adopted to select the most suitable model since there is no unified standard for matching data and models. Data characteristics affect model performance to a certain extent and maybe where the factors that determine the balance between prediction accuracy and model complexity are. In this article, Multi-Criteria Performance Measure method considering Mean of Absolute Value of the Residual Autocorrelation was adopted to address this problem. Case studies apply Time-Series Analysis decomposing datasets into trend, seasonality and residue and summarize the limitations and recommendations from the stochasticity of the residue. The results show that the statistical models perform best for datasets with low stochasticity, deep learning models specialize in forecasting fluctuant and long-term time series data, machine learning models could be candidates for datasets that possess numerical characters between the previous two categories. Conclusions could provide suggestions in selecting appropriate models and guide the research community in focusing the effort on more feasible or promising directions.","PeriodicalId":313042,"journal":{"name":"2022 7th International Conference on Data Science and Machine Learning Applications (CDMA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A comprehensive evaluation of statistical, machine learning and deep learning models for time series prediction\",\"authors\":\"A. Xuan, Mengmeng Yin, Yupei Li, Xiyu Chen, Zhenliang Ma\",\"doi\":\"10.1109/CDMA54072.2022.00014\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"How to choose the appropriate model to predict the time series is one of the most prominent activities of temporal data analysis. Empirical evidence is often adopted to select the most suitable model since there is no unified standard for matching data and models. Data characteristics affect model performance to a certain extent and maybe where the factors that determine the balance between prediction accuracy and model complexity are. In this article, Multi-Criteria Performance Measure method considering Mean of Absolute Value of the Residual Autocorrelation was adopted to address this problem. Case studies apply Time-Series Analysis decomposing datasets into trend, seasonality and residue and summarize the limitations and recommendations from the stochasticity of the residue. The results show that the statistical models perform best for datasets with low stochasticity, deep learning models specialize in forecasting fluctuant and long-term time series data, machine learning models could be candidates for datasets that possess numerical characters between the previous two categories. Conclusions could provide suggestions in selecting appropriate models and guide the research community in focusing the effort on more feasible or promising directions.\",\"PeriodicalId\":313042,\"journal\":{\"name\":\"2022 7th International Conference on Data Science and Machine Learning Applications (CDMA)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 7th International Conference on Data Science and Machine Learning Applications (CDMA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CDMA54072.2022.00014\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 7th International Conference on Data Science and Machine Learning Applications (CDMA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CDMA54072.2022.00014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

如何选择合适的模型进行时间序列预测是时间数据分析的重要内容之一。由于数据和模型的匹配没有统一的标准，因此通常采用经验证据来选择最合适的模型。数据特征在一定程度上影响模型的性能，也可能是决定预测精度和模型复杂性之间平衡的因素所在。本文采用考虑残差自相关绝对值均值的多准则性能度量方法来解决这一问题。案例研究应用时间序列分析将数据集分解为趋势、季节性和残差，并从残差的随机性中总结出局限性和建议。结果表明，统计模型对低随机性的数据集表现最好，深度学习模型专门用于预测波动和长期时间序列数据，机器学习模型可以用于预测介于前两类之间的数值特征的数据集。结论可以为选择合适的模型提供建议，并指导研究界将精力集中在更可行或更有前景的方向上。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A comprehensive evaluation of statistical, machine learning and deep learning models for time series prediction

How to choose the appropriate model to predict the time series is one of the most prominent activities of temporal data analysis. Empirical evidence is often adopted to select the most suitable model since there is no unified standard for matching data and models. Data characteristics affect model performance to a certain extent and maybe where the factors that determine the balance between prediction accuracy and model complexity are. In this article, Multi-Criteria Performance Measure method considering Mean of Absolute Value of the Residual Autocorrelation was adopted to address this problem. Case studies apply Time-Series Analysis decomposing datasets into trend, seasonality and residue and summarize the limitations and recommendations from the stochasticity of the residue. The results show that the statistical models perform best for datasets with low stochasticity, deep learning models specialize in forecasting fluctuant and long-term time series data, machine learning models could be candidates for datasets that possess numerical characters between the previous two categories. Conclusions could provide suggestions in selecting appropriate models and guide the research community in focusing the effort on more feasible or promising directions.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 7th International Conference on Data Science and Machine Learning Applications (CDMA)

自引率

0.00%

发文量