{"title":"基于bert主题识别和情感分析的农产品期货价格预测新框架","authors":"Wensheng Wang, Yuxi Liu","doi":"10.1002/for.3278","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>In China's financial and economic system, the agricultural futures market plays an important role in guiding the market to self regulate and providing efficient information transmission for regulators. The effective prediction of futures prices can assist in guiding agricultural production, monitoring operational risks arising from significant price fluctuations, and enhancing the predictability and pertinence of the country's macroeconomic regulation policies. This study investigates the main variety of grain futures—soybean futures, taking into account complex market and non-market influencing factors. Using historical market data and related news headlines of soybean futures as source data and integrating topic identification and sentiment analysis techniques, a novel framework for predicting agricultural futures prices that integrates topic sentiment is constructed. This model uses BERTopic to extract topic information from agricultural news texts, then integrates FinBERT to construct topic-based sentiment features, fuses them with structured market features, and constructs LSTM price prediction model with multi-feature inputs. In order to better model the short-term features and state transfer patterns of the time series, hidden Markov model (HMM) is further used to extract the hidden states, which are deeply fused with the LSTM model. The empirical results show that the model fusing topic and sentiment features significantly improves the forecasting accuracy in all lags, LSTM works best in short-term forecasting, and the combination of HMM and LSTM exhibits significant performance advantages in medium- and long-term forecasting. Compared with the baseline model that relies only on market features, topic sentiment features provide important incremental information for price forecasting, and the contribution of each topic sentiment feature calculated based on the PI metric is close to 50%. In addition, deep learning–based prediction model performs better than baseline machine learning models in dealing with extreme external shocks such as climate disasters, the COVID-19 pandemic, and the Russia–Ukraine conflict.</p>\n </div>","PeriodicalId":47835,"journal":{"name":"Journal of Forecasting","volume":"44 6","pages":"1969-1992"},"PeriodicalIF":2.7000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Novel Framework for Agricultural Futures Price Prediction With BERT-Based Topic Identification and Sentiment Analysis\",\"authors\":\"Wensheng Wang, Yuxi Liu\",\"doi\":\"10.1002/for.3278\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>In China's financial and economic system, the agricultural futures market plays an important role in guiding the market to self regulate and providing efficient information transmission for regulators. The effective prediction of futures prices can assist in guiding agricultural production, monitoring operational risks arising from significant price fluctuations, and enhancing the predictability and pertinence of the country's macroeconomic regulation policies. This study investigates the main variety of grain futures—soybean futures, taking into account complex market and non-market influencing factors. Using historical market data and related news headlines of soybean futures as source data and integrating topic identification and sentiment analysis techniques, a novel framework for predicting agricultural futures prices that integrates topic sentiment is constructed. This model uses BERTopic to extract topic information from agricultural news texts, then integrates FinBERT to construct topic-based sentiment features, fuses them with structured market features, and constructs LSTM price prediction model with multi-feature inputs. In order to better model the short-term features and state transfer patterns of the time series, hidden Markov model (HMM) is further used to extract the hidden states, which are deeply fused with the LSTM model. The empirical results show that the model fusing topic and sentiment features significantly improves the forecasting accuracy in all lags, LSTM works best in short-term forecasting, and the combination of HMM and LSTM exhibits significant performance advantages in medium- and long-term forecasting. Compared with the baseline model that relies only on market features, topic sentiment features provide important incremental information for price forecasting, and the contribution of each topic sentiment feature calculated based on the PI metric is close to 50%. In addition, deep learning–based prediction model performs better than baseline machine learning models in dealing with extreme external shocks such as climate disasters, the COVID-19 pandemic, and the Russia–Ukraine conflict.</p>\\n </div>\",\"PeriodicalId\":47835,\"journal\":{\"name\":\"Journal of Forecasting\",\"volume\":\"44 6\",\"pages\":\"1969-1992\"},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2025-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Forecasting\",\"FirstCategoryId\":\"96\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/for.3278\",\"RegionNum\":3,\"RegionCategory\":\"经济学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Forecasting","FirstCategoryId":"96","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/for.3278","RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
A Novel Framework for Agricultural Futures Price Prediction With BERT-Based Topic Identification and Sentiment Analysis
In China's financial and economic system, the agricultural futures market plays an important role in guiding the market to self regulate and providing efficient information transmission for regulators. The effective prediction of futures prices can assist in guiding agricultural production, monitoring operational risks arising from significant price fluctuations, and enhancing the predictability and pertinence of the country's macroeconomic regulation policies. This study investigates the main variety of grain futures—soybean futures, taking into account complex market and non-market influencing factors. Using historical market data and related news headlines of soybean futures as source data and integrating topic identification and sentiment analysis techniques, a novel framework for predicting agricultural futures prices that integrates topic sentiment is constructed. This model uses BERTopic to extract topic information from agricultural news texts, then integrates FinBERT to construct topic-based sentiment features, fuses them with structured market features, and constructs LSTM price prediction model with multi-feature inputs. In order to better model the short-term features and state transfer patterns of the time series, hidden Markov model (HMM) is further used to extract the hidden states, which are deeply fused with the LSTM model. The empirical results show that the model fusing topic and sentiment features significantly improves the forecasting accuracy in all lags, LSTM works best in short-term forecasting, and the combination of HMM and LSTM exhibits significant performance advantages in medium- and long-term forecasting. Compared with the baseline model that relies only on market features, topic sentiment features provide important incremental information for price forecasting, and the contribution of each topic sentiment feature calculated based on the PI metric is close to 50%. In addition, deep learning–based prediction model performs better than baseline machine learning models in dealing with extreme external shocks such as climate disasters, the COVID-19 pandemic, and the Russia–Ukraine conflict.
期刊介绍:
The Journal of Forecasting is an international journal that publishes refereed papers on forecasting. It is multidisciplinary, welcoming papers dealing with any aspect of forecasting: theoretical, practical, computational and methodological. A broad interpretation of the topic is taken with approaches from various subject areas, such as statistics, economics, psychology, systems engineering and social sciences, all encouraged. Furthermore, the Journal welcomes a wide diversity of applications in such fields as business, government, technology and the environment. Of particular interest are papers dealing with modelling issues and the relationship of forecasting systems to decision-making processes.