Social media data

A. Tear, H. Southall
{"title":"Social media data","authors":"A. Tear, H. Southall","doi":"10.2307/j.ctvmd84wn.12","DOIUrl":null,"url":null,"abstract":"The increasing availability of huge volumes of social media ‘Big Data’ from Facebook, Flickr, Instagram, Twitter and other social network platforms, combined with the development of software designed to operate at web scale, has fuelled the growth of computational social science. Often analysed by ‘data scientists’, social media data differ substantially from the datasets officially disseminated as by-products of government-sponsored activity, such as population censuses or administrative data, which have long been analysed by professional statisticians. This chapter outlines the characteristics of social media data and identifies key data sources and methods of data capture, introducing several of the technologies used to acquire, store, query, visualise and augment social media data. Unrepresentativeness of, and lack of (geo)demographic control in, social media data are problematic for population-based research. These limitations, alongside wider epistemological and ethical concerns surrounding data validity, inadvertent co-option into research and protection of user privacy, suggest that caution should be exercised when analysing social media datasets. While care must be taken to respect personal privacy and sample assiduously, this chapter concludes that statisticians, who may be unfamiliar with some of the programmatic steps involved in accessing social media data, must play a pivotal role in analysing it.","PeriodicalId":103233,"journal":{"name":"Data in Society","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2307/j.ctvmd84wn.12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 17

Abstract

The increasing availability of huge volumes of social media ‘Big Data’ from Facebook, Flickr, Instagram, Twitter and other social network platforms, combined with the development of software designed to operate at web scale, has fuelled the growth of computational social science. Often analysed by ‘data scientists’, social media data differ substantially from the datasets officially disseminated as by-products of government-sponsored activity, such as population censuses or administrative data, which have long been analysed by professional statisticians. This chapter outlines the characteristics of social media data and identifies key data sources and methods of data capture, introducing several of the technologies used to acquire, store, query, visualise and augment social media data. Unrepresentativeness of, and lack of (geo)demographic control in, social media data are problematic for population-based research. These limitations, alongside wider epistemological and ethical concerns surrounding data validity, inadvertent co-option into research and protection of user privacy, suggest that caution should be exercised when analysing social media datasets. While care must be taken to respect personal privacy and sample assiduously, this chapter concludes that statisticians, who may be unfamiliar with some of the programmatic steps involved in accessing social media data, must play a pivotal role in analysing it.
社交媒体数据
来自Facebook、Flickr、Instagram、Twitter和其他社交网络平台的海量社交媒体“大数据”越来越多,再加上旨在以网络规模运行的软件的发展,推动了计算社会科学的发展。通常由“数据科学家”分析,社交媒体数据与作为政府赞助活动的副产品正式传播的数据集有很大不同,例如人口普查或行政数据,这些数据长期以来一直由专业统计学家分析。本章概述了社交媒体数据的特征,并确定了关键数据源和数据捕获方法,介绍了用于获取、存储、查询、可视化和增强社交媒体数据的几种技术。对于基于人口的研究来说,社交媒体数据的不代表性和缺乏(地理)人口控制是一个问题。这些限制,以及围绕数据有效性的更广泛的认识论和伦理问题,无意中加入研究和保护用户隐私,表明在分析社交媒体数据集时应谨慎行事。虽然必须注意尊重个人隐私和勤奋取样,但本章的结论是,统计学家可能不熟悉访问社交媒体数据所涉及的一些程序化步骤,必须在分析数据时发挥关键作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信