C. Buntain, Richard Bonneau, Jonathan Nagler, Joshua A. Tucker
{"title":"Measuring the Ideology of Audiences for Web Links and Domains Using Differentially Private Engagement Data","authors":"C. Buntain, Richard Bonneau, Jonathan Nagler, Joshua A. Tucker","doi":"10.1609/icwsm.v17i1.22127","DOIUrl":null,"url":null,"abstract":"This paper demonstrates the use of differentially private hyperlink-level engagement data for measuring ideologies of audiences for web domains, individual links, or aggregations thereof.\nWe examine a simple metric for measuring this ideological position and assess the conditions under which the metric is robust to injected, privacy-preserving noise.\nThis assessment provides insights into and constraints on the level of activity one should observe when applying this metric to privacy-protected data.\nGrounding this work is a massive dataset of social media engagement activity where privacy-preserving noise has been injected into the activity data, provided by Facebook and the Social Science One (SS1) consortium.\nUsing this dataset, we validate our ideology measures by comparing to similar, published work on sharing-based, homophily- and content-oriented measures, where we show consistently high correlation (>0.87).\nWe then apply this metric to individual links from several popular news domains and demonstrate how one can assess link-level distributions of ideological audiences.\nWe further show this estimator is robust to selection of engagement types besides sharing, where domain-level audience-ideology assessments based on views and likes show no significant difference compared to sharing-based estimates.\nEstimates of partisanship, however, suggest the viewing audience is more moderate than the audiences who share and like these domains.\nBeyond providing thresholds on sufficient activity for measuring audience ideology and comparing three types of engagement, this analysis provides a blueprint for ensuring robustness of future work to differential privacy protections.","PeriodicalId":175641,"journal":{"name":"International Conference on Web and Social Media","volume":"102 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Web and Social Media","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1609/icwsm.v17i1.22127","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper demonstrates the use of differentially private hyperlink-level engagement data for measuring ideologies of audiences for web domains, individual links, or aggregations thereof.
We examine a simple metric for measuring this ideological position and assess the conditions under which the metric is robust to injected, privacy-preserving noise.
This assessment provides insights into and constraints on the level of activity one should observe when applying this metric to privacy-protected data.
Grounding this work is a massive dataset of social media engagement activity where privacy-preserving noise has been injected into the activity data, provided by Facebook and the Social Science One (SS1) consortium.
Using this dataset, we validate our ideology measures by comparing to similar, published work on sharing-based, homophily- and content-oriented measures, where we show consistently high correlation (>0.87).
We then apply this metric to individual links from several popular news domains and demonstrate how one can assess link-level distributions of ideological audiences.
We further show this estimator is robust to selection of engagement types besides sharing, where domain-level audience-ideology assessments based on views and likes show no significant difference compared to sharing-based estimates.
Estimates of partisanship, however, suggest the viewing audience is more moderate than the audiences who share and like these domains.
Beyond providing thresholds on sufficient activity for measuring audience ideology and comparing three types of engagement, this analysis provides a blueprint for ensuring robustness of future work to differential privacy protections.