The Method to the Madness: The 2012 United States Presidential Election Twitter Corpus

Proceedings of the 7th 2016 International Conference on Social Media & Society Pub Date : 2016-07-11 DOI:10.1145/2930971.2930987

C. Mascaro, Denise E. Agosto, S. Goggins

{"title":"The Method to the Madness: The 2012 United States Presidential Election Twitter Corpus","authors":"C. Mascaro, Denise E. Agosto, S. Goggins","doi":"10.1145/2930971.2930987","DOIUrl":null,"url":null,"abstract":"Social media provides a rich environment for understanding social connections, interactions and information sharing across many aspects of society. The relative ease of access to social media data through provision of application program interface's (API) by social media companies has led to a significant number of studies that attempt to understand how social media fits into society and how the public uses it for discourse and information sharing. One of the existing gaps in these studies is the lack of extensive description of the data collection and processing methods. These gaps exist as a result of word limits in existing publication venues and a lack of appropriate publication venues to share this type of fundamental research. The following paper provides extensive detail as to how a 52 million corpus of Twitter data on the 2012 Presidential Election in the United States was collected, parsed and analyzed. This level of detail is imperative in studies of social media as small choices in what data to collect can have material effect on the findings. In addition to the description of the methods, the following paper provides a contribution to knowledge in providing basic characteristics of one of the largest research datasets of social media activity compiled to study political discourse.","PeriodicalId":227482,"journal":{"name":"Proceedings of the 7th 2016 International Conference on Social Media & Society","volume":"340 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 7th 2016 International Conference on Social Media & Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2930971.2930987","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

Social media provides a rich environment for understanding social connections, interactions and information sharing across many aspects of society. The relative ease of access to social media data through provision of application program interface's (API) by social media companies has led to a significant number of studies that attempt to understand how social media fits into society and how the public uses it for discourse and information sharing. One of the existing gaps in these studies is the lack of extensive description of the data collection and processing methods. These gaps exist as a result of word limits in existing publication venues and a lack of appropriate publication venues to share this type of fundamental research. The following paper provides extensive detail as to how a 52 million corpus of Twitter data on the 2012 Presidential Election in the United States was collected, parsed and analyzed. This level of detail is imperative in studies of social media as small choices in what data to collect can have material effect on the findings. In addition to the description of the methods, the following paper provides a contribution to knowledge in providing basic characteristics of one of the largest research datasets of social media activity compiled to study political discourse.

查看原文本刊更多论文

疯狂的方法:2012年美国总统大选推特语料库

社交媒体提供了一个丰富的环境来理解社会的各个方面的社会联系、互动和信息共享。社交媒体公司通过提供应用程序接口(API)相对容易地访问社交媒体数据，这导致了大量研究，试图了解社交媒体如何融入社会，以及公众如何使用它进行话语和信息共享。这些研究中存在的差距之一是缺乏对数据收集和处理方法的广泛描述。这些差距的存在是由于现有出版场所的字数限制和缺乏适当的出版场所来分享这类基础研究。下面的论文提供了关于如何收集、解析和分析2012年美国总统选举的5200万推特数据语料库的广泛细节。在社交媒体研究中，这种程度的细节是必要的，因为收集数据的小选择可能会对研究结果产生重大影响。除了对方法的描述之外，以下论文还提供了对知识的贡献，提供了用于研究政治话语的社交媒体活动的最大研究数据集之一的基本特征。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 7th 2016 International Conference on Social Media & Society

自引率

0.00%

发文量