{"title":"A data-filtered sentiment analysis model for economic forecasting","authors":"Wanwan Zheng , Kunihiko Hara","doi":"10.1016/j.dajour.2025.100629","DOIUrl":null,"url":null,"abstract":"<div><div>In behavioral economics, sentiments influence decision-making processes, with positive sentiments tending to underestimate risks and negative sentiments overestimate them. At the individual level, these sentiments shape economic behavior in ways that collectively influence broader economic dynamics. With the proliferation of textual data and advancements in natural language processing techniques, the analysis of economic sentiments has garnered growing attention, with large language models showing superior analytical capabilities. However, unlike many machine learning tasks where true labels are available through human annotation, sentiment analysis encounters challenges in obtaining true labels due to the psychological biases and inconsistencies inherent in human assessments. To address this issue, this study introduced a data filtering methodology to enhance data reliability and developed a robust sentiment analysis model tailored to the Japanese economy. The findings revealed that our model not only outperforms existing models in terms of generalization capability across diverse datasets — achieving RMSE values of 0.09–0.11 and classification accuracies of 0.83–0.88 — but also effectively captures fluctuations in other quantitative economic indicators, as evidenced by Euclidean distances of up to 1.56, which is smaller than the records 4.33 and 4.24 of the existing models. Moreover, a statistically significant correlation between qualitative and quantitative economic indicators was identified, highlighting the potential of qualitative indicators in predicting economic conditions.</div></div>","PeriodicalId":100357,"journal":{"name":"Decision Analytics Journal","volume":"16 ","pages":"Article 100629"},"PeriodicalIF":0.0000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Decision Analytics Journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772662225000852","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In behavioral economics, sentiments influence decision-making processes, with positive sentiments tending to underestimate risks and negative sentiments overestimate them. At the individual level, these sentiments shape economic behavior in ways that collectively influence broader economic dynamics. With the proliferation of textual data and advancements in natural language processing techniques, the analysis of economic sentiments has garnered growing attention, with large language models showing superior analytical capabilities. However, unlike many machine learning tasks where true labels are available through human annotation, sentiment analysis encounters challenges in obtaining true labels due to the psychological biases and inconsistencies inherent in human assessments. To address this issue, this study introduced a data filtering methodology to enhance data reliability and developed a robust sentiment analysis model tailored to the Japanese economy. The findings revealed that our model not only outperforms existing models in terms of generalization capability across diverse datasets — achieving RMSE values of 0.09–0.11 and classification accuracies of 0.83–0.88 — but also effectively captures fluctuations in other quantitative economic indicators, as evidenced by Euclidean distances of up to 1.56, which is smaller than the records 4.33 and 4.24 of the existing models. Moreover, a statistically significant correlation between qualitative and quantitative economic indicators was identified, highlighting the potential of qualitative indicators in predicting economic conditions.