研究改变arima模型参数对短数据集预报质量的影响

M. Fel’ker, V. Chesnov
{"title":"研究改变arima模型参数对短数据集预报质量的影响","authors":"M. Fel’ker, V. Chesnov","doi":"10.14529/ctcr210304","DOIUrl":null,"url":null,"abstract":"Time series, i.e. data collected at various times. The data collection segments may differ de-pending on the task. Time series are used for decision making. Time series analysis allows you to get some result that will determine the format of the decision. Time series analysis was carried out in very ancient times, for example, various calendars became a consequence of the analysis. Later, time series analysis was applied to study and forecast economic, social and other systems. Time se-ries appeared a long time ago. Once upon a time, ancient Babylonian astronomers, studying the po-sition of the stars, discovered the frequency of eclipses, which allowed them to predict their appearance in the future. Later, the analysis of time series, in a similar way, led to the creation of various calen-dars, for example, harvest calendars. In the future, in addition to natural areas, social and economic ones were added. Aim. Search for classification patterns of time series, allowing to understand whether it is possible to apply the ARIMA model for their short-term (3 counts) forecast. Materials and methods. Special software with ARIMA implementation and all need services is made. We examined 59 data sets with a short length and step equal a year, less than 20 values in the paper. The data was processed using Python libraries: Statsmodels and Pandas. The Dickey – Fuller test was used to de-termine the stationarity of the series. The stationarity of the time series allows for better forecasting. The Akaike information criterion was used to select the best model. Recommendations for a rea-sonable selection of parameters for adjusting ARIMA models are obtained. The dependence of the settings on the category of annual data set is shown. Conclusion. After processing the data, four categories (patterns) of year data sets were identified. Depending on the category ranges of parame-ters were selected for tuning ARIMA models. The suggested ranges will allow to determine the starting parameters for exploring similar datasets. Recommendations for improving the quality of post-forecast and forecast using the ARIMA model by adjusting the settings are given.","PeriodicalId":338904,"journal":{"name":"Bulletin of the South Ural State University. Ser. Computer Technologies, Automatic Control & Radioelectronics","volume":"78 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"STUDY OF THE INFLUENCE OF CHANGING THE PARAMETERS OF THE ARIMA MODEL ON THE QUALITY OF THE FORECAST FOR SHORT DATA SETS\",\"authors\":\"M. Fel’ker, V. Chesnov\",\"doi\":\"10.14529/ctcr210304\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Time series, i.e. data collected at various times. The data collection segments may differ de-pending on the task. Time series are used for decision making. Time series analysis allows you to get some result that will determine the format of the decision. Time series analysis was carried out in very ancient times, for example, various calendars became a consequence of the analysis. Later, time series analysis was applied to study and forecast economic, social and other systems. Time se-ries appeared a long time ago. Once upon a time, ancient Babylonian astronomers, studying the po-sition of the stars, discovered the frequency of eclipses, which allowed them to predict their appearance in the future. Later, the analysis of time series, in a similar way, led to the creation of various calen-dars, for example, harvest calendars. In the future, in addition to natural areas, social and economic ones were added. Aim. Search for classification patterns of time series, allowing to understand whether it is possible to apply the ARIMA model for their short-term (3 counts) forecast. Materials and methods. Special software with ARIMA implementation and all need services is made. We examined 59 data sets with a short length and step equal a year, less than 20 values in the paper. The data was processed using Python libraries: Statsmodels and Pandas. The Dickey – Fuller test was used to de-termine the stationarity of the series. The stationarity of the time series allows for better forecasting. The Akaike information criterion was used to select the best model. Recommendations for a rea-sonable selection of parameters for adjusting ARIMA models are obtained. The dependence of the settings on the category of annual data set is shown. Conclusion. After processing the data, four categories (patterns) of year data sets were identified. Depending on the category ranges of parame-ters were selected for tuning ARIMA models. The suggested ranges will allow to determine the starting parameters for exploring similar datasets. Recommendations for improving the quality of post-forecast and forecast using the ARIMA model by adjusting the settings are given.\",\"PeriodicalId\":338904,\"journal\":{\"name\":\"Bulletin of the South Ural State University. Ser. Computer Technologies, Automatic Control & Radioelectronics\",\"volume\":\"78 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bulletin of the South Ural State University. Ser. Computer Technologies, Automatic Control & Radioelectronics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.14529/ctcr210304\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bulletin of the South Ural State University. Ser. Computer Technologies, Automatic Control & Radioelectronics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14529/ctcr210304","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

时间序列,即在不同时间收集的数据。数据收集段可能因任务而异。时间序列用于决策。时间序列分析允许您获得一些将决定决策格式的结果。时间序列分析是在非常古老的时代进行的,例如,各种日历成为分析的结果。后来,时间序列分析被应用于经济、社会等系统的研究和预测。时间序列很久以前就出现了。从前,古巴比伦天文学家在研究恒星的位置时,发现了日食的频率,这使他们能够预测未来日食的出现。后来,对时间序列的分析,以类似的方式,导致了各种日历的创建,例如,收获日历。未来,除了自然区域,还将增加社会和经济区域。的目标。搜索时间序列的分类模式,以便了解是否有可能将ARIMA模型应用于他们的短期(3计数)预测。材料和方法。专门的软件与ARIMA实现和所有需要的服务。我们检查了59个短长度和步长等于一年的数据集,论文中不到20个值。数据使用Python库:statmodels和Pandas进行处理。Dickey - Fuller检验用于确定该系列的平稳性。时间序列的平稳性有利于更好的预测。采用赤池信息准则选择最佳模型。对合理选择ARIMA模型的参数提出了建议。显示了设置对年度数据集类别的依赖关系。结论。对数据进行处理后,确定了年份数据集的四类(模式)。根据类别选择参数范围进行ARIMA模型的调优。建议的范围将允许确定探索类似数据集的起始参数。提出了通过调整设置来提高ARIMA模型后预报和预报质量的建议。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
STUDY OF THE INFLUENCE OF CHANGING THE PARAMETERS OF THE ARIMA MODEL ON THE QUALITY OF THE FORECAST FOR SHORT DATA SETS
Time series, i.e. data collected at various times. The data collection segments may differ de-pending on the task. Time series are used for decision making. Time series analysis allows you to get some result that will determine the format of the decision. Time series analysis was carried out in very ancient times, for example, various calendars became a consequence of the analysis. Later, time series analysis was applied to study and forecast economic, social and other systems. Time se-ries appeared a long time ago. Once upon a time, ancient Babylonian astronomers, studying the po-sition of the stars, discovered the frequency of eclipses, which allowed them to predict their appearance in the future. Later, the analysis of time series, in a similar way, led to the creation of various calen-dars, for example, harvest calendars. In the future, in addition to natural areas, social and economic ones were added. Aim. Search for classification patterns of time series, allowing to understand whether it is possible to apply the ARIMA model for their short-term (3 counts) forecast. Materials and methods. Special software with ARIMA implementation and all need services is made. We examined 59 data sets with a short length and step equal a year, less than 20 values in the paper. The data was processed using Python libraries: Statsmodels and Pandas. The Dickey – Fuller test was used to de-termine the stationarity of the series. The stationarity of the time series allows for better forecasting. The Akaike information criterion was used to select the best model. Recommendations for a rea-sonable selection of parameters for adjusting ARIMA models are obtained. The dependence of the settings on the category of annual data set is shown. Conclusion. After processing the data, four categories (patterns) of year data sets were identified. Depending on the category ranges of parame-ters were selected for tuning ARIMA models. The suggested ranges will allow to determine the starting parameters for exploring similar datasets. Recommendations for improving the quality of post-forecast and forecast using the ARIMA model by adjusting the settings are given.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信