Crowd Prediction Systems: Markets, Polls, and Elite Forecasters

Proceedings of the 23rd ACM Conference on Economics and Computation Pub Date : 2022-07-12 DOI:10.1145/3490486.3538265

P. Atanasov, Jens Witkowski, B. Mellers, P. Tetlock

{"title":"Crowd Prediction Systems: Markets, Polls, and Elite Forecasters","authors":"P. Atanasov, Jens Witkowski, B. Mellers, P. Tetlock","doi":"10.1145/3490486.3538265","DOIUrl":null,"url":null,"abstract":"Crowd prediction systems, such as prediction markets, provide the infrastructure to elicit and combine the predictions from a group (“crowd”) of forecasters. In contrast to data-driven approaches, crowd predictions are especially useful in settings with little historical data, such as in new product development, vaccine trials, pandemics, or geopolitical events. Our contributions in this area are threefold. First, we provide an experimental evaluation of two popular types of prediction market architectures: continuous double auction (CDA) markets and logarithmic market scoring rules (LMSR) markets. To the best of our knowledge, we are the first to study these methods in a large, randomized experiment. Prior research reporting on CDA and LMSR market performance did not compare the two designs directly but had separate sets of questions for each [2]. Using data from over 1300 forecasters and 147 forecasting questions, we find that the LMSR market achieves higher accuracy than the CDA market. The LMSR market achieves 14% lower Brier scores (MCDA = 0.245, SDCDA = 0.327 versus MLMSR = 0.211, SDLMSR = 0.280; t(146) = 2.28, p = 0.024). In exploratory analyses, we find that the better performance of the LMSR market appears particularly pronounced for questions that attracted few traders as well as early in a question when only few traders had placed orders on the question. Relative to LMSR, the CDA market underperformed in thin-market settings, consistent with Robin Hanson’s motivation for the LMSR market mechanism. Second, we quantify the impact of prediction system architecture and individual forecaster track record on aggregate performance. Previous research studied how the performance of CDA prediction markets and prediction polls compares when populated by sub-elite forecasters [1] while most previous work on elite forecasters has only examined their individual performance [3]. We are the first to compare the aggregate performance of small, elite forecaster crowds across two prediction systems: LMSR prediction markets and team prediction polls. Moreover, we compare the aggregate accuracy of elite forecaster crowds to larger, sub-elite crowds using the same prediction","PeriodicalId":209859,"journal":{"name":"Proceedings of the 23rd ACM Conference on Economics and Computation","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 23rd ACM Conference on Economics and Computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3490486.3538265","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Crowd prediction systems, such as prediction markets, provide the infrastructure to elicit and combine the predictions from a group (“crowd”) of forecasters. In contrast to data-driven approaches, crowd predictions are especially useful in settings with little historical data, such as in new product development, vaccine trials, pandemics, or geopolitical events. Our contributions in this area are threefold. First, we provide an experimental evaluation of two popular types of prediction market architectures: continuous double auction (CDA) markets and logarithmic market scoring rules (LMSR) markets. To the best of our knowledge, we are the first to study these methods in a large, randomized experiment. Prior research reporting on CDA and LMSR market performance did not compare the two designs directly but had separate sets of questions for each [2]. Using data from over 1300 forecasters and 147 forecasting questions, we find that the LMSR market achieves higher accuracy than the CDA market. The LMSR market achieves 14% lower Brier scores (MCDA = 0.245, SDCDA = 0.327 versus MLMSR = 0.211, SDLMSR = 0.280; t(146) = 2.28, p = 0.024). In exploratory analyses, we find that the better performance of the LMSR market appears particularly pronounced for questions that attracted few traders as well as early in a question when only few traders had placed orders on the question. Relative to LMSR, the CDA market underperformed in thin-market settings, consistent with Robin Hanson’s motivation for the LMSR market mechanism. Second, we quantify the impact of prediction system architecture and individual forecaster track record on aggregate performance. Previous research studied how the performance of CDA prediction markets and prediction polls compares when populated by sub-elite forecasters [1] while most previous work on elite forecasters has only examined their individual performance [3]. We are the first to compare the aggregate performance of small, elite forecaster crowds across two prediction systems: LMSR prediction markets and team prediction polls. Moreover, we compare the aggregate accuracy of elite forecaster crowds to larger, sub-elite crowds using the same prediction

查看原文本刊更多论文

人群预测系统:市场、民意调查和精英预测者

人群预测系统，如预测市场，提供了从一群预测者(“人群”)中引出和组合预测的基础设施。与数据驱动的方法相比，人群预测在历史数据很少的情况下特别有用，例如在新产品开发、疫苗试验、流行病或地缘政治事件中。我们在这方面的贡献有三方面。首先，我们对两种流行的预测市场架构进行了实验评估:连续双拍卖(CDA)市场和对数市场评分规则(LMSR)市场。据我们所知，我们是第一个在大型随机实验中研究这些方法的人。先前关于CDA和LMSR市场表现的研究报告并没有直接比较两种设计，而是为每个bbb单独设置了一组问题。使用来自1300多名预测者和147个预测问题的数据，我们发现LMSR市场比CDA市场具有更高的准确性。LMSR市场Brier评分低14% (MCDA = 0.245, SDCDA = 0.327, MLMSR = 0.211, SDLMSR = 0.280;T (146) = 2.28, p = 0.024)。在探索性分析中，我们发现LMSR市场的良好表现对于那些吸引了很少交易者的问题以及在只有很少交易者对该问题下订单的问题的早期表现尤为明显。相对于LMSR, CDA市场在薄市场环境下表现不佳，这与Robin Hanson关于LMSR市场机制的动机一致。其次，我们量化了预测系统架构和个体预测者跟踪记录对总体性能的影响。先前的研究研究了在由次精英预测者组成的情况下，CDA预测市场和预测民意调查的表现如何比较，而之前大多数关于精英预测者的工作只研究了他们的个人表现[3]。我们首先比较了两种预测系统(LMSR预测市场和团队预测民意调查)中小型精英预测人群的总体表现。此外，我们比较了精英预测者群体与使用相同预测的更大的、次精英群体的总准确性

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 23rd ACM Conference on Economics and Computation

自引率

0.00%

发文量