基于bert主题识别和情感分析的农产品期货价格预测新框架

IF 2.7 3区 经济学 Q1 ECONOMICS
Wensheng Wang, Yuxi Liu
{"title":"基于bert主题识别和情感分析的农产品期货价格预测新框架","authors":"Wensheng Wang,&nbsp;Yuxi Liu","doi":"10.1002/for.3278","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>In China's financial and economic system, the agricultural futures market plays an important role in guiding the market to self regulate and providing efficient information transmission for regulators. The effective prediction of futures prices can assist in guiding agricultural production, monitoring operational risks arising from significant price fluctuations, and enhancing the predictability and pertinence of the country's macroeconomic regulation policies. This study investigates the main variety of grain futures—soybean futures, taking into account complex market and non-market influencing factors. Using historical market data and related news headlines of soybean futures as source data and integrating topic identification and sentiment analysis techniques, a novel framework for predicting agricultural futures prices that integrates topic sentiment is constructed. This model uses BERTopic to extract topic information from agricultural news texts, then integrates FinBERT to construct topic-based sentiment features, fuses them with structured market features, and constructs LSTM price prediction model with multi-feature inputs. In order to better model the short-term features and state transfer patterns of the time series, hidden Markov model (HMM) is further used to extract the hidden states, which are deeply fused with the LSTM model. The empirical results show that the model fusing topic and sentiment features significantly improves the forecasting accuracy in all lags, LSTM works best in short-term forecasting, and the combination of HMM and LSTM exhibits significant performance advantages in medium- and long-term forecasting. Compared with the baseline model that relies only on market features, topic sentiment features provide important incremental information for price forecasting, and the contribution of each topic sentiment feature calculated based on the PI metric is close to 50%. In addition, deep learning–based prediction model performs better than baseline machine learning models in dealing with extreme external shocks such as climate disasters, the COVID-19 pandemic, and the Russia–Ukraine conflict.</p>\n </div>","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":"44 6","pages":"1969-1992"},"PeriodicalIF":2.7000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Novel Framework for Agricultural Futures Price Prediction With BERT-Based Topic Identification and Sentiment Analysis\",\"authors\":\"Wensheng Wang,&nbsp;Yuxi Liu\",\"doi\":\"10.1002/for.3278\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>In China's financial and economic system, the agricultural futures market plays an important role in guiding the market to self regulate and providing efficient information transmission for regulators. The effective prediction of futures prices can assist in guiding agricultural production, monitoring operational risks arising from significant price fluctuations, and enhancing the predictability and pertinence of the country's macroeconomic regulation policies. This study investigates the main variety of grain futures—soybean futures, taking into account complex market and non-market influencing factors. Using historical market data and related news headlines of soybean futures as source data and integrating topic identification and sentiment analysis techniques, a novel framework for predicting agricultural futures prices that integrates topic sentiment is constructed. This model uses BERTopic to extract topic information from agricultural news texts, then integrates FinBERT to construct topic-based sentiment features, fuses them with structured market features, and constructs LSTM price prediction model with multi-feature inputs. In order to better model the short-term features and state transfer patterns of the time series, hidden Markov model (HMM) is further used to extract the hidden states, which are deeply fused with the LSTM model. The empirical results show that the model fusing topic and sentiment features significantly improves the forecasting accuracy in all lags, LSTM works best in short-term forecasting, and the combination of HMM and LSTM exhibits significant performance advantages in medium- and long-term forecasting. Compared with the baseline model that relies only on market features, topic sentiment features provide important incremental information for price forecasting, and the contribution of each topic sentiment feature calculated based on the PI metric is close to 50%. In addition, deep learning–based prediction model performs better than baseline machine learning models in dealing with extreme external shocks such as climate disasters, the COVID-19 pandemic, and the Russia–Ukraine conflict.</p>\\n </div>\",\"PeriodicalId\":47835,\"journal\":{\"name\":\"Journal of Forecasting\",\"volume\":\"44 6\",\"pages\":\"1969-1992\"},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2025-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Forecasting\",\"FirstCategoryId\":\"96\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/for.3278\",\"RegionNum\":3,\"RegionCategory\":\"经济学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Forecasting","FirstCategoryId":"96","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/for.3278","RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0

摘要

在中国的金融经济体系中,农产品期货市场在引导市场自我调节和为监管机构提供有效的信息传递方面发挥着重要作用。有效预测期货价格,有利于指导农业生产,监测价格大幅波动带来的经营风险,增强国家宏观调控政策的可预见性和针对性。本研究考察了粮食期货的主要品种——大豆期货,考虑了复杂的市场和非市场影响因素。以大豆期货历史市场数据和相关新闻标题为源数据,结合主题识别和情感分析技术,构建了一个融合主题情感的农产品期货价格预测框架。该模型利用BERTopic从农业新闻文本中提取主题信息,然后结合FinBERT构建基于主题的情绪特征,并将其与结构化市场特征融合,构建多特征输入的LSTM价格预测模型。为了更好地建模时间序列的短期特征和状态转移模式,进一步使用隐马尔可夫模型(HMM)提取隐藏状态,并将其与LSTM模型深度融合。实证结果表明,融合主题和情感特征的模型在所有滞后时间内都显著提高了预测精度,LSTM在短期预测中效果最好,HMM和LSTM结合在中长期预测中表现出显著的性能优势。与仅依赖市场特征的基线模型相比,主题情绪特征为价格预测提供了重要的增量信息,基于PI指标计算的每个主题情绪特征的贡献接近50%。此外,基于深度学习的预测模型在应对气候灾害、COVID-19大流行、俄乌冲突等极端外部冲击方面的表现优于基线机器学习模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Novel Framework for Agricultural Futures Price Prediction With BERT-Based Topic Identification and Sentiment Analysis

In China's financial and economic system, the agricultural futures market plays an important role in guiding the market to self regulate and providing efficient information transmission for regulators. The effective prediction of futures prices can assist in guiding agricultural production, monitoring operational risks arising from significant price fluctuations, and enhancing the predictability and pertinence of the country's macroeconomic regulation policies. This study investigates the main variety of grain futures—soybean futures, taking into account complex market and non-market influencing factors. Using historical market data and related news headlines of soybean futures as source data and integrating topic identification and sentiment analysis techniques, a novel framework for predicting agricultural futures prices that integrates topic sentiment is constructed. This model uses BERTopic to extract topic information from agricultural news texts, then integrates FinBERT to construct topic-based sentiment features, fuses them with structured market features, and constructs LSTM price prediction model with multi-feature inputs. In order to better model the short-term features and state transfer patterns of the time series, hidden Markov model (HMM) is further used to extract the hidden states, which are deeply fused with the LSTM model. The empirical results show that the model fusing topic and sentiment features significantly improves the forecasting accuracy in all lags, LSTM works best in short-term forecasting, and the combination of HMM and LSTM exhibits significant performance advantages in medium- and long-term forecasting. Compared with the baseline model that relies only on market features, topic sentiment features provide important incremental information for price forecasting, and the contribution of each topic sentiment feature calculated based on the PI metric is close to 50%. In addition, deep learning–based prediction model performs better than baseline machine learning models in dealing with extreme external shocks such as climate disasters, the COVID-19 pandemic, and the Russia–Ukraine conflict.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
5.40
自引率
5.90%
发文量
91
期刊介绍: The Journal of Forecasting is an international journal that publishes refereed papers on forecasting. It is multidisciplinary, welcoming papers dealing with any aspect of forecasting: theoretical, practical, computational and methodological. A broad interpretation of the topic is taken with approaches from various subject areas, such as statistics, economics, psychology, systems engineering and social sciences, all encouraged. Furthermore, the Journal welcomes a wide diversity of applications in such fields as business, government, technology and the environment. Of particular interest are papers dealing with modelling issues and the relationship of forecasting systems to decision-making processes.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信