{"title":"Econometric forecasting using ubiquitous news text: Text-enhanced factor model","authors":"Beomseok Seo","doi":"10.1016/j.ijforecast.2024.11.001","DOIUrl":null,"url":null,"abstract":"<div><div>News text is gaining increasing attention as a novel source for econometric forecasting. This paper revisits how narrative information is incorporated into econometric forecasting by effectively quantifying sector-specific textual information without requiring training data. We propose <em>Theme Frequency Indices</em> (TFIs), which utilize domain-specific subject-predicate patterns to measure public perception about the economy. TFIs for 15 sectors, including production, inflation, employment, capital investment, stock and house prices, and others, were examined and integrated into the <em>Text-enhanced Factor Model</em> (TFM), using latent factor structures. Empirical analysis based on over 18 million news articles from Korea reveals that TFM improves the accuracy of near-term GDP forecasts, demonstrating that simple text-mining techniques combined with domain knowledge can effectively leverage qualitative information in the news without costly training. The proposed method is applicable to a wide range of subjects for utilizing narrative information on the economy, offering a rapid and cost-effective approach.</div></div>","PeriodicalId":14061,"journal":{"name":"International Journal of Forecasting","volume":"41 3","pages":"Pages 1055-1072"},"PeriodicalIF":6.9000,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Forecasting","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169207024001055","RegionNum":2,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
News text is gaining increasing attention as a novel source for econometric forecasting. This paper revisits how narrative information is incorporated into econometric forecasting by effectively quantifying sector-specific textual information without requiring training data. We propose Theme Frequency Indices (TFIs), which utilize domain-specific subject-predicate patterns to measure public perception about the economy. TFIs for 15 sectors, including production, inflation, employment, capital investment, stock and house prices, and others, were examined and integrated into the Text-enhanced Factor Model (TFM), using latent factor structures. Empirical analysis based on over 18 million news articles from Korea reveals that TFM improves the accuracy of near-term GDP forecasts, demonstrating that simple text-mining techniques combined with domain knowledge can effectively leverage qualitative information in the news without costly training. The proposed method is applicable to a wide range of subjects for utilizing narrative information on the economy, offering a rapid and cost-effective approach.
期刊介绍:
The International Journal of Forecasting is a leading journal in its field that publishes high quality refereed papers. It aims to bridge the gap between theory and practice, making forecasting useful and relevant for decision and policy makers. The journal places strong emphasis on empirical studies, evaluation activities, implementation research, and improving the practice of forecasting. It welcomes various points of view and encourages debate to find solutions to field-related problems. The journal is the official publication of the International Institute of Forecasters (IIF) and is indexed in Sociological Abstracts, Journal of Economic Literature, Statistical Theory and Method Abstracts, INSPEC, Current Contents, UMI Data Courier, RePEc, Academic Journal Guide, CIS, IAOR, and Social Sciences Citation Index.