高频社交媒体数据是否能改善低频消费者信心指标的预测?

PSN: Communications (Topic) Pub Date : 2019-11-01 DOI:10.1093/jjfinec/nbz037

Steven F. Lehrer, Tian Xie, T. Zeng

{"title":"高频社交媒体数据是否能改善低频消费者信心指标的预测?","authors":"Steven F. Lehrer, Tian Xie, T. Zeng","doi":"10.1093/jjfinec/nbz037","DOIUrl":null,"url":null,"abstract":"\n Social media data present challenges for forecasters since one must convert text into data and deal with issues related to these measures being collected at different frequencies and volumes than traditional financial data. In this article, we use a deep learning algorithm to measure sentiment within Twitter messages on an hourly basis and introduce a new method to undertake mixed data sampling (MIDAS) that allows for a weaker discounting of historical data that is well-suited for this new data source. To evaluate the performance of approach relative to alternative MIDAS strategies, we conduct an out of sample forecasting exercise for the consumer confidence index with both traditional econometric strategies and machine learning algorithms. Irrespective of the estimator used to conduct forecasts, our results show that (i) including consumer sentiment measures from Twitter greatly improves forecast accuracy and (ii) there are substantial gains from our proposed MIDAS procedure relative to common alternatives.","PeriodicalId":378066,"journal":{"name":"PSN: Communications (Topic)","volume":"81 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Does High Frequency Social Media Data Improve Forecasts of Low Frequency Consumer Confidence Measures?\",\"authors\":\"Steven F. Lehrer, Tian Xie, T. Zeng\",\"doi\":\"10.1093/jjfinec/nbz037\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n Social media data present challenges for forecasters since one must convert text into data and deal with issues related to these measures being collected at different frequencies and volumes than traditional financial data. In this article, we use a deep learning algorithm to measure sentiment within Twitter messages on an hourly basis and introduce a new method to undertake mixed data sampling (MIDAS) that allows for a weaker discounting of historical data that is well-suited for this new data source. To evaluate the performance of approach relative to alternative MIDAS strategies, we conduct an out of sample forecasting exercise for the consumer confidence index with both traditional econometric strategies and machine learning algorithms. Irrespective of the estimator used to conduct forecasts, our results show that (i) including consumer sentiment measures from Twitter greatly improves forecast accuracy and (ii) there are substantial gains from our proposed MIDAS procedure relative to common alternatives.\",\"PeriodicalId\":378066,\"journal\":{\"name\":\"PSN: Communications (Topic)\",\"volume\":\"81 1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"PSN: Communications (Topic)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/jjfinec/nbz037\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"PSN: Communications (Topic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/jjfinec/nbz037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 9

摘要

社交媒体数据给预测者带来了挑战，因为人们必须将文本转换为数据，并处理与这些以不同频率和数量收集的指标相关的问题，而不是传统的金融数据。在本文中，我们使用深度学习算法以小时为基础测量Twitter消息中的情绪，并引入一种进行混合数据采样(MIDAS)的新方法，该方法允许对历史数据进行较弱的折扣，这非常适合这个新数据源。为了评估方法相对于替代MIDAS策略的性能，我们使用传统计量经济学策略和机器学习算法对消费者信心指数进行了样本外预测练习。无论用于进行预测的估计器是什么，我们的结果表明:(i)包括来自Twitter的消费者情绪测量大大提高了预测准确性;(ii)相对于常见的替代方案，我们提出的MIDAS程序有实质性的收益。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Does High Frequency Social Media Data Improve Forecasts of Low Frequency Consumer Confidence Measures?

Social media data present challenges for forecasters since one must convert text into data and deal with issues related to these measures being collected at different frequencies and volumes than traditional financial data. In this article, we use a deep learning algorithm to measure sentiment within Twitter messages on an hourly basis and introduce a new method to undertake mixed data sampling (MIDAS) that allows for a weaker discounting of historical data that is well-suited for this new data source. To evaluate the performance of approach relative to alternative MIDAS strategies, we conduct an out of sample forecasting exercise for the consumer confidence index with both traditional econometric strategies and machine learning algorithms. Irrespective of the estimator used to conduct forecasts, our results show that (i) including consumer sentiment measures from Twitter greatly improves forecast accuracy and (ii) there are substantial gains from our proposed MIDAS procedure relative to common alternatives.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

PSN: Communications (Topic)

自引率

0.00%

发文量