The Method to the Madness: The 2012 United States Presidential Election Twitter Corpus

C. Mascaro, Denise E. Agosto, S. Goggins
{"title":"The Method to the Madness: The 2012 United States Presidential Election Twitter Corpus","authors":"C. Mascaro, Denise E. Agosto, S. Goggins","doi":"10.1145/2930971.2930987","DOIUrl":null,"url":null,"abstract":"Social media provides a rich environment for understanding social connections, interactions and information sharing across many aspects of society. The relative ease of access to social media data through provision of application program interface's (API) by social media companies has led to a significant number of studies that attempt to understand how social media fits into society and how the public uses it for discourse and information sharing. One of the existing gaps in these studies is the lack of extensive description of the data collection and processing methods. These gaps exist as a result of word limits in existing publication venues and a lack of appropriate publication venues to share this type of fundamental research. The following paper provides extensive detail as to how a 52 million corpus of Twitter data on the 2012 Presidential Election in the United States was collected, parsed and analyzed. This level of detail is imperative in studies of social media as small choices in what data to collect can have material effect on the findings. In addition to the description of the methods, the following paper provides a contribution to knowledge in providing basic characteristics of one of the largest research datasets of social media activity compiled to study political discourse.","PeriodicalId":227482,"journal":{"name":"Proceedings of the 7th 2016 International Conference on Social Media & Society","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2016-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 7th 2016 International Conference on Social Media & Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2930971.2930987","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

Social media provides a rich environment for understanding social connections, interactions and information sharing across many aspects of society. The relative ease of access to social media data through provision of application program interface's (API) by social media companies has led to a significant number of studies that attempt to understand how social media fits into society and how the public uses it for discourse and information sharing. One of the existing gaps in these studies is the lack of extensive description of the data collection and processing methods. These gaps exist as a result of word limits in existing publication venues and a lack of appropriate publication venues to share this type of fundamental research. The following paper provides extensive detail as to how a 52 million corpus of Twitter data on the 2012 Presidential Election in the United States was collected, parsed and analyzed. This level of detail is imperative in studies of social media as small choices in what data to collect can have material effect on the findings. In addition to the description of the methods, the following paper provides a contribution to knowledge in providing basic characteristics of one of the largest research datasets of social media activity compiled to study political discourse.
疯狂的方法:2012年美国总统大选推特语料库
社交媒体提供了一个丰富的环境来理解社会的各个方面的社会联系、互动和信息共享。社交媒体公司通过提供应用程序接口(API)相对容易地访问社交媒体数据,这导致了大量研究,试图了解社交媒体如何融入社会,以及公众如何使用它进行话语和信息共享。这些研究中存在的差距之一是缺乏对数据收集和处理方法的广泛描述。这些差距的存在是由于现有出版场所的字数限制和缺乏适当的出版场所来分享这类基础研究。下面的论文提供了关于如何收集、解析和分析2012年美国总统选举的5200万推特数据语料库的广泛细节。在社交媒体研究中,这种程度的细节是必要的,因为收集数据的小选择可能会对研究结果产生重大影响。除了对方法的描述之外,以下论文还提供了对知识的贡献,提供了用于研究政治话语的社交媒体活动的最大研究数据集之一的基本特征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信