{"title":"Should asset managers pay for economic research? A machine learning evaluation","authors":"Krzysztof Rybinski","doi":"10.1016/j.jfds.2020.08.001","DOIUrl":null,"url":null,"abstract":"<div><p>This paper presents the first-ever comparison of the forecasting power of two types of narratives: articles in a major daily newspaper and regular research reports released by professional forecasters. The applied testing methodology developed in <sup>22</sup> and extended in this paper includes two natural language processing (NLP) techniques – the sentiment analysis and the wordscores model – that are used to convert the text corpora into the NLP indices. These indices are explanatory variables in linear regression, Granger causality test, vector autoregressive model and random forest model. The paper extends this methodology by applying Latent Dirichlet Allocation (LDA) to the newspaper corpus to filter out articles that discuss topics not relevant for economic and financial analysis. The forecasting test is conducted for two major banks in Poland – BZ WBK and mbank and for major daily newspaper Rzeczpospolita, in Polish. It is shown that mbank narratives have the best forecasting power, while BZ WBK and Rzeczpospolita trade second and third place depending on the model applied. In the vast majority of analyzed cases adding an NLP index to the model improves the forecast accuracy. The answer to the title question is – it depends. Before paying for economic research asset managers are advised to apply methods such as presented in this paper to evaluate whether sell-side research offers any forecasting value in comparison with a newspaper.</p></div>","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":"6 ","pages":"Pages 31-48"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.jfds.2020.08.001","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Finance and Data Science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2405918820300131","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 1
Abstract
This paper presents the first-ever comparison of the forecasting power of two types of narratives: articles in a major daily newspaper and regular research reports released by professional forecasters. The applied testing methodology developed in 22 and extended in this paper includes two natural language processing (NLP) techniques – the sentiment analysis and the wordscores model – that are used to convert the text corpora into the NLP indices. These indices are explanatory variables in linear regression, Granger causality test, vector autoregressive model and random forest model. The paper extends this methodology by applying Latent Dirichlet Allocation (LDA) to the newspaper corpus to filter out articles that discuss topics not relevant for economic and financial analysis. The forecasting test is conducted for two major banks in Poland – BZ WBK and mbank and for major daily newspaper Rzeczpospolita, in Polish. It is shown that mbank narratives have the best forecasting power, while BZ WBK and Rzeczpospolita trade second and third place depending on the model applied. In the vast majority of analyzed cases adding an NLP index to the model improves the forecast accuracy. The answer to the title question is – it depends. Before paying for economic research asset managers are advised to apply methods such as presented in this paper to evaluate whether sell-side research offers any forecasting value in comparison with a newspaper.