An Unsupervised Density Based Clustering Algorithm to Detect Election Anomalies : Evidence from Georgia’s Largest County

Khurram Yamin, Matthew Oswald, Nima Jadali, Yao Xie, E. Zegura, D. Nazzal
{"title":"An Unsupervised Density Based Clustering Algorithm to Detect Election Anomalies : Evidence from Georgia’s Largest County","authors":"Khurram Yamin, Matthew Oswald, Nima Jadali, Yao Xie, E. Zegura, D. Nazzal","doi":"10.1145/3530190.3534799","DOIUrl":null,"url":null,"abstract":"The 2020 election was fraught with allegations of fraud. To respond to a lack of a robust method to investigate these allegations, we propose a multi-step clustering based approach. We first solve a regression problem to find a group of influential variables, then cluster on these variables to get a set of precincts that should have similar election results. Re-clustering each cluster shows us the outliers. We then apply the approach to Fulton County, Georgia’s largest county and an epicenter of allegations of corruption and fraud. We show that the level of fraud detected is not significant and would not be enough to change the election results in Georgia. In fact, the majority of the precincts that showed to be anomalous were ones where Trump received more votes than was expected. We also validate our analysis through application to the 2015 Argentina National Election.","PeriodicalId":257424,"journal":{"name":"ACM SIGCAS/SIGCHI Conference on Computing and Sustainable Societies (COMPASS)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM SIGCAS/SIGCHI Conference on Computing and Sustainable Societies (COMPASS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3530190.3534799","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The 2020 election was fraught with allegations of fraud. To respond to a lack of a robust method to investigate these allegations, we propose a multi-step clustering based approach. We first solve a regression problem to find a group of influential variables, then cluster on these variables to get a set of precincts that should have similar election results. Re-clustering each cluster shows us the outliers. We then apply the approach to Fulton County, Georgia’s largest county and an epicenter of allegations of corruption and fraud. We show that the level of fraud detected is not significant and would not be enough to change the election results in Georgia. In fact, the majority of the precincts that showed to be anomalous were ones where Trump received more votes than was expected. We also validate our analysis through application to the 2015 Argentina National Election.
一种基于无监督密度的聚类算法检测选举异常:来自乔治亚州最大县的证据
2020年的选举充满了欺诈指控。为了应对缺乏一个强大的方法来调查这些指控,我们提出了一个基于多步骤聚类的方法。我们首先解决一个回归问题,找到一组有影响的变量,然后对这些变量进行聚类,得到一组应该具有相似选举结果的选区。重新聚类每个聚类显示我们的异常值。然后,我们将这种方法应用于富尔顿县,这是佐治亚州最大的县,也是腐败和欺诈指控的中心。我们的研究表明,发现的舞弊程度并不严重,不足以改变格鲁吉亚的选举结果。事实上,大多数显示出异常的选区都是特朗普获得比预期更多选票的选区。我们还通过应用于2015年阿根廷全国大选来验证我们的分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信