{"title":"Postelection analysis of presidential election/poll data","authors":"Jiming Jiang, Yuanyuan Li, Peter X. K. Song","doi":"10.1214/22-aoas1707","DOIUrl":null,"url":null,"abstract":"This paper concerns analyses of the 2016 and 2020 U. S. presidential election data, including the data of pre-election polls and the actual elections. Our analyses unveil statistical evidence of discrepancy between the polls and real elections that is consistent across these two elections. Specifi-cally, the polls had consistently over-estimated advantages of the Democratic candidates, or, equivalently, under-estimated the true population support of the Republican candidate, Donald Trump, in both elections. The analyses are stratified by state, reflecting the U. S. electoral college system, by the means of small area estimation. We have found recurrent patterns suggesting that the polls have been underestimating the Republican candidate, especially in swing states of critical importance. Our findings also suggest an improvement of the 2020 polling methods to mitigate the size of underestimation. We show that a small-area model built upon the actual election data from one election can provide a better prediction than the poll-based projection to another election involving the same Republican candidate. Ranking of pollsters based on prediction bias using mixed model prediction is also considered.","PeriodicalId":188068,"journal":{"name":"The Annals of Applied Statistics","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Annals of Applied Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1214/22-aoas1707","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper concerns analyses of the 2016 and 2020 U. S. presidential election data, including the data of pre-election polls and the actual elections. Our analyses unveil statistical evidence of discrepancy between the polls and real elections that is consistent across these two elections. Specifi-cally, the polls had consistently over-estimated advantages of the Democratic candidates, or, equivalently, under-estimated the true population support of the Republican candidate, Donald Trump, in both elections. The analyses are stratified by state, reflecting the U. S. electoral college system, by the means of small area estimation. We have found recurrent patterns suggesting that the polls have been underestimating the Republican candidate, especially in swing states of critical importance. Our findings also suggest an improvement of the 2020 polling methods to mitigate the size of underestimation. We show that a small-area model built upon the actual election data from one election can provide a better prediction than the poll-based projection to another election involving the same Republican candidate. Ranking of pollsters based on prediction bias using mixed model prediction is also considered.