{"title":"Weighted Regression with Sybil Networks","authors":"Nihar Shah","doi":"arxiv-2408.17426","DOIUrl":null,"url":null,"abstract":"In many online domains, Sybil networks -- or cases where a single user\nassumes multiple identities -- is a pervasive feature. This complicates\nexperiments, as off-the-shelf regression estimators at least assume known\nnetwork topologies (if not fully independent observations) when Sybil network\ntopologies in practice are often unknown. The literature has exclusively\nfocused on techniques to detect Sybil networks, leading many experimenters to\nsubsequently exclude suspected networks entirely before estimating treatment\neffects. I present a more efficient solution in the presence of these suspected\nSybil networks: a weighted regression framework that applies weights based on\nthe probabilities that sets of observations are controlled by single actors. I\nshow in the paper that the MSE-minimizing solution is to set the weight matrix\nequal to the inverse of the expected network topology. I demonstrate the\nmethodology on simulated data, and then I apply the technique to a competition\nwith suspected Sybil networks run on the Sui blockchain and show reductions in\nthe standard error of the estimate by 6 - 24%.","PeriodicalId":501293,"journal":{"name":"arXiv - ECON - Econometrics","volume":"24 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - ECON - Econometrics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.17426","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In many online domains, Sybil networks -- or cases where a single user
assumes multiple identities -- is a pervasive feature. This complicates
experiments, as off-the-shelf regression estimators at least assume known
network topologies (if not fully independent observations) when Sybil network
topologies in practice are often unknown. The literature has exclusively
focused on techniques to detect Sybil networks, leading many experimenters to
subsequently exclude suspected networks entirely before estimating treatment
effects. I present a more efficient solution in the presence of these suspected
Sybil networks: a weighted regression framework that applies weights based on
the probabilities that sets of observations are controlled by single actors. I
show in the paper that the MSE-minimizing solution is to set the weight matrix
equal to the inverse of the expected network topology. I demonstrate the
methodology on simulated data, and then I apply the technique to a competition
with suspected Sybil networks run on the Sui blockchain and show reductions in
the standard error of the estimate by 6 - 24%.