Vasilis Kostakis, Alex Pazaitis, Minas V. Liarokapis
{"title":"Beyond high-tech versus low-tech: A tentative framework for sustainable urban data governance","authors":"Vasilis Kostakis, Alex Pazaitis, Minas V. Liarokapis","doi":"10.1177/20539517231180583","DOIUrl":"https://doi.org/10.1177/20539517231180583","url":null,"abstract":"Technological imaginaries have been increasingly shaping the future perceptions of cities. From artificial intelligence and distributed ledger technology to three-dimensional printing, high-tech artifacts are very often the premises of such imaginaries. However, technology does not only refer to artifacts. Technology also encompasses the processes around the artifacts: how the artifacts are designed, manufactured, used, maintained, and disposed. From this perspective, high-tech visions often disregard problems that pertain to resource extraction, labor exploitation, energy use, and material flows. On the contrary, low-tech and localized alternatives incite lower impact and higher resilience visions. However, they fail to offer solutions of the desired scale and intensity. To address this tension, we provide an alternative vision for mid-tech: a balance between the opposite extreme qualities of low-tech and high-tech. Through a case of open-source prosthetics, we illustrate how to synergistically combine the efficiency and versatility of high-tech solutions with the potential for autonomy and resilience that low-tech offers. Then we discuss a mid-tech approach for distributed ledger technology from a city as a license lens. We provide connections with existing or conceptual applications to show how distributed ledger technology could support more socially and ecologically responsible data practices for city governance.","PeriodicalId":47834,"journal":{"name":"Big Data & Society","volume":" ","pages":""},"PeriodicalIF":8.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47547610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"All WARC and no playback: The materialities of data-centered web archives research","authors":"Emily Maemura","doi":"10.1177/20539517231163172","DOIUrl":"https://doi.org/10.1177/20539517231163172","url":null,"abstract":"This paper examines the Web ARChive (WARC) file format, revealing how the format has come to play a central role in the development and standardization of interoperable tools and methods for the international web archiving community. In the context of emerging big data approaches, I consider the sociotechnical relationships between material construction of data and information infrastructures for collecting and research. Analysis is inspired by Star and Griesemer's historical case of the Museum of Vertebrate Zoology which reveals how boundary objects and methods standardization are used to enroll actors in the work of collecting for natural history. I extend these concepts by pairing them with frameworks for studying digital materiality and the representational qualities of data artifacts. Through examples drawn from fieldwork observations studying two data-centered research projects, I consider how the materiality of the WARC format influences research methods and approaches to data extraction, selection, and transformation. Findings identify three modalities researchers use to configure WARC data for researcher needs: using indexes to support search queries, constructing derivative formats designed for certain types of analysis, and generating custom-designed datasets tailored for specific research purposes. Findings additionally reveal similarities in how these distinct methods approach automated data extraction by relying upon the WARC's standardized metadata elements. By interrogating whose information needs are being met and taken into account in the design of the WARC's underlying information representation, I reveal effects on the emerging field of web history, and consider alternative approaches to knowledge production with archived web data.","PeriodicalId":47834,"journal":{"name":"Big Data & Society","volume":" ","pages":""},"PeriodicalIF":8.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47568379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Google, data voids, and the dynamics of the politics of exclusion","authors":"Ov Cristian Norocel, D. Lewandowski","doi":"10.1177/20539517221149099","DOIUrl":"https://doi.org/10.1177/20539517221149099","url":null,"abstract":"This study deploys a critical approach to big data analytics to gauge the tentative contours of data voids in Google searches that reflect extreme-right dynamics of exclusion in the aftermath of the 2015 humanitarian crisis in Europe. The study adds complexity to the analysis of data voids, expanding the framework of investigation outside the USA context by concentrating on Germany and Sweden. Building on previous big data analytics addressing the politics of exclusion, the study proposes a catalogue of queries concerning the issue of migration in both Germany and Sweden on a continuum from mainstream to extreme-right vocabularies. This catalogue of queries enables specific and localized queries to identify data voids. The results show that a search engine's reliance on source popularity may lead to extreme-right sources appearing in top positions. Furthermore, using platforms for user-generated content provides a way for localized queries to gain top positions.","PeriodicalId":47834,"journal":{"name":"Big Data & Society","volume":" ","pages":""},"PeriodicalIF":8.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49172525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Judgments as bulk data","authors":"V. Janeček","doi":"10.1177/20539517231160527","DOIUrl":"https://doi.org/10.1177/20539517231160527","url":null,"abstract":"Should court judgments be publicly available for text and data mining purposes? This article shows that the arguments for and against access to judgments conflate different understandings of what judgments are. On one view, judgments are seen as a ‘jurisprudential’ category, whereas the other view regards them as something ‘factual’. Once it is understood that these views and the claims based on them do not fight over the same territory, it should be easier to make judgments more widely available, including for the purposes of computational analysis of judgments as bulk data. The purpose of this article is to help to clear the ground for the debate around access to judgments as bulk data and highlight some relevant considerations for the preferred licencing regime concerning judgments.","PeriodicalId":47834,"journal":{"name":"Big Data & Society","volume":" ","pages":""},"PeriodicalIF":8.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45671842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The role of evidence-based misogyny in antifeminist online communities of the ‘manosphere’","authors":"A. Rothermel","doi":"10.1177/20539517221145671","DOIUrl":"https://doi.org/10.1177/20539517221145671","url":null,"abstract":"In recent years, there have been a growing number of online and offline attacks linked to a loosely connected network of misogynist and antifeminist online communities called ‘the manosphere’. Since 2016, the ideas spread among and by groups of the manosphere have also become more closely aligned with those of other Far-Right online networks. In this commentary, I explore the role of what I term ‘evidence-based misogyny’ for mobilization and radicalization into the antifeminist and misogynist subcultures of the manosphere. Evidence-based misogyny is a discursive strategy, whereby members of the manosphere refer to (and misinterpret) knowledge in the form of statistics, studies, news items and pop-culture and mimic accepted methods of knowledge presentation to support their essentializing, polarizing views about gender relations in society. Evidence-based misogyny is a core aspect for manosphere-related mobilization as it provides a false sense of authority and forges a collective identity, which is framed as a supposed ‘alternative’ to mainstream gender knowledge. Due to its core function to justify and confirm the misogynist sentiments of users, evidence-based misogyny serves as connector between the manosphere and both mainstream conservative as well as other Far-Right and conspiratorial discourses.","PeriodicalId":47834,"journal":{"name":"Big Data & Society","volume":" ","pages":""},"PeriodicalIF":8.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41929117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Imaginaries of better administration: Renegotiating the relationship between citizens and digital public power","authors":"Terhi Esko, Riikka Koulu","doi":"10.1177/20539517231164113","DOIUrl":"https://doi.org/10.1177/20539517231164113","url":null,"abstract":"This article investigates future visions of digital public administration as they appear within a particular regulatory process that aims to enable automated decision-making (ADM) in public administration in Finland. By drawing on science and technology studies, public administration studies, and socio-legal studies we analyze law in the making and identify four imaginaries of public digital administration: understandable administration, self-monitoring administration, adaptive administration, and responsible administration. We argue that digital administration is seen from the perspective of public authorities serving their current needs of legitimizing existing automation practices. While technology is pictured as unproblematic, the citizen perspective is missing. We conclude that the absence of an in-depth understanding of the diverse needs of citizens raises the question whether the relationship between public power and citizens is becoming a one-way street despite of the public administration ideals that express values of citizen engagement.","PeriodicalId":47834,"journal":{"name":"Big Data & Society","volume":" ","pages":""},"PeriodicalIF":8.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49464048","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Smart corruption: Satirical strategies for gaming accountability","authors":"Ritwick Ghosh, H. Faxon","doi":"10.1177/20539517231164119","DOIUrl":"https://doi.org/10.1177/20539517231164119","url":null,"abstract":"Although new forms of data can be used to hold power to account, they also grant the powerful new resources to game accountability. We dub the latter behavior “smart corruption.” The concept highlights the possibility of appropriating algorithms, infrastructures, and data publics to accumulate benefits and obscure responsibility while leaning into the positive associations of transparency. Unlike conventional forms of corruption, smart corruption is disguised as progressive, and is thus difficult to spot or analyze through existing legal or ethical frameworks. To illustrate, we outline a satirical strategy for gaming accountability. Identifying the particular mechanisms and outcomes of transgressive activities carried out under the veneer of data-driven transparency, as well as the key actors and organizations most active in gaming accountability, is an important research and political project.","PeriodicalId":47834,"journal":{"name":"Big Data & Society","volume":" ","pages":""},"PeriodicalIF":8.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47661542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"‘I’ve left enough data’: Relations between people and data and the production of surveillance","authors":"Hwankyung Janet Lee","doi":"10.1177/20539517231173904","DOIUrl":"https://doi.org/10.1177/20539517231173904","url":null,"abstract":"Exploring emergent relations between data-producing individuals and their data products, this study aims to contribute to the ongoing scholarly discussion on agencies in data practices. It focuses on shifts in surveillance structure in the era of Big Data, in which the individual becomes both a subject and an object in the production of data surveillance. Drawing on the concept of the ‘dividual’, the study analyses data practices for a tracing system invented by the South Korean government during the COVID-19 pandemic, with findings from field research conducted with 11 research participants in various urban sites in Seoul. Highlighting how the tracing system positioned surveillance ‘in the hands of citizens’, the study exposes the complexities of the relations that the participants formed with the data they produced, and how they reflexively reappropriated their practices through alterations and deflections on the basis of their tacit knowledge and imaginaries concerning digital data and their constituent positions in the knowledge production system. The resultant expression of surveillance was directly shaped by the evolving relationship between the producers (participants) and products (digital data). The study proposes that an intersectional focus on surveillance and critical data studies, with close attention to ordinary people's relations with data, has the capacity to inquire into the politics of data more fully.","PeriodicalId":47834,"journal":{"name":"Big Data & Society","volume":" ","pages":""},"PeriodicalIF":8.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45576144","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"When research is the context: Cross-platform user expectations for social media data reuse","authors":"Sarah A. Gilbert, Katie Shilton, Jessica Vitak","doi":"10.1177/20539517231164108","DOIUrl":"https://doi.org/10.1177/20539517231164108","url":null,"abstract":"Social media provides unique opportunities for researchers to learn about a variety of phenomena—it is often publicly available, highly accessible, and affords more naturalistic observation. However, as research using social media data has increased, so too has public scrutiny, highlighting the need to develop ethical approaches to social media data use. Prior work in this area has explored users’ perceptions of researchers’ use of social media data in the context of a single platform. In this paper, we expand on that work, exploring how platforms and their affordances impact how users feel about social media data reuse. We present results from three factorial vignette surveys, each focusing on a different platform—dating apps, Instagram, and Reddit—to assess users’ comfort with research data use scenarios across a variety of contexts. Although our results highlight different expectations between platforms depending on the research domain, purpose of research, and content collected, we find that the factor with the greatest impact across all platforms is consent—a finding which presents challenges for big data researchers. We conclude by offering a sociotechnical approach to ethical decision-making. This approach provides recommendations on how researchers can interpret and respond to platform norms and affordances to predict potential data use sensitivities. The approach also recommends that researchers respond to the predominant expectation of notification and consent for research participation by bolstering awareness of data collection on digital platforms.","PeriodicalId":47834,"journal":{"name":"Big Data & Society","volume":" ","pages":""},"PeriodicalIF":8.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47167028","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Manipulative tactics are the norm in political emails: Evidence from 300K emails from the 2020 US election cycle","authors":"Arunesh Mathur, Angelina Wang, Carsten Schwemmer, Maia Hamin, Brandon M Stewart, Arvind Narayanan","doi":"10.1177/20539517221145371","DOIUrl":"https://doi.org/10.1177/20539517221145371","url":null,"abstract":"We collect and analyze a corpus of more than 300,000 political emails sent during the 2020 US election cycle. These emails were sent by over 3000 political campaigns and organizations including federal and state level candidates as well as Political Action Committees. We find that in this corpus, manipulative tactics—techniques using some level of deception or clickbait—are the norm, not the exception. We measure six specific tactics senders use to nudge recipients to open emails. Three of these tactics—“dark patterns”—actively deceive recipients through the email user interface, for example, by formatting “from:” fields so that they create the false impression the message is a continuation of an ongoing conversation. The median active sender uses such tactics 5% of the time. The other three tactics, like sensationalistic clickbait—used by the median active sender 37% of the time—are not directly deceptive, but instead, exploit recipients’ curiosity gap and impose pressure to open emails. This can further expose recipients to deception in the email body, such as misleading claims of matching donations. Furthermore, by collecting emails from different locations in the US, we show that senders refine these tactics through A/B testing. Finally, we document disclosures of email addresses between senders in violation of privacy policies and recipients’ expectations. Cumulatively, these tactics undermine voters’ autonomy and welfare, exacting a particularly acute cost for those with low digital literacy. We offer the complete corpus of emails at https://electionemails2020.org for journalists and academics, which we hope will support future work.","PeriodicalId":47834,"journal":{"name":"Big Data & Society","volume":" ","pages":""},"PeriodicalIF":8.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45283674","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}