{"title":"高频社交媒体数据是否能改善低频消费者信心指标的预测?","authors":"Steven F. Lehrer, Tian Xie, T. Zeng","doi":"10.1093/jjfinec/nbz037","DOIUrl":null,"url":null,"abstract":"\n Social media data present challenges for forecasters since one must convert text into data and deal with issues related to these measures being collected at different frequencies and volumes than traditional financial data. In this article, we use a deep learning algorithm to measure sentiment within Twitter messages on an hourly basis and introduce a new method to undertake mixed data sampling (MIDAS) that allows for a weaker discounting of historical data that is well-suited for this new data source. To evaluate the performance of approach relative to alternative MIDAS strategies, we conduct an out of sample forecasting exercise for the consumer confidence index with both traditional econometric strategies and machine learning algorithms. Irrespective of the estimator used to conduct forecasts, our results show that (i) including consumer sentiment measures from Twitter greatly improves forecast accuracy and (ii) there are substantial gains from our proposed MIDAS procedure relative to common alternatives.","PeriodicalId":378066,"journal":{"name":"PSN: Communications (Topic)","volume":"81 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Does High Frequency Social Media Data Improve Forecasts of Low Frequency Consumer Confidence Measures?\",\"authors\":\"Steven F. Lehrer, Tian Xie, T. Zeng\",\"doi\":\"10.1093/jjfinec/nbz037\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n Social media data present challenges for forecasters since one must convert text into data and deal with issues related to these measures being collected at different frequencies and volumes than traditional financial data. In this article, we use a deep learning algorithm to measure sentiment within Twitter messages on an hourly basis and introduce a new method to undertake mixed data sampling (MIDAS) that allows for a weaker discounting of historical data that is well-suited for this new data source. To evaluate the performance of approach relative to alternative MIDAS strategies, we conduct an out of sample forecasting exercise for the consumer confidence index with both traditional econometric strategies and machine learning algorithms. Irrespective of the estimator used to conduct forecasts, our results show that (i) including consumer sentiment measures from Twitter greatly improves forecast accuracy and (ii) there are substantial gains from our proposed MIDAS procedure relative to common alternatives.\",\"PeriodicalId\":378066,\"journal\":{\"name\":\"PSN: Communications (Topic)\",\"volume\":\"81 1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"PSN: Communications (Topic)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/jjfinec/nbz037\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"PSN: Communications (Topic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/jjfinec/nbz037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Does High Frequency Social Media Data Improve Forecasts of Low Frequency Consumer Confidence Measures?
Social media data present challenges for forecasters since one must convert text into data and deal with issues related to these measures being collected at different frequencies and volumes than traditional financial data. In this article, we use a deep learning algorithm to measure sentiment within Twitter messages on an hourly basis and introduce a new method to undertake mixed data sampling (MIDAS) that allows for a weaker discounting of historical data that is well-suited for this new data source. To evaluate the performance of approach relative to alternative MIDAS strategies, we conduct an out of sample forecasting exercise for the consumer confidence index with both traditional econometric strategies and machine learning algorithms. Irrespective of the estimator used to conduct forecasts, our results show that (i) including consumer sentiment measures from Twitter greatly improves forecast accuracy and (ii) there are substantial gains from our proposed MIDAS procedure relative to common alternatives.