定性数据探索的计算社会科学视角:使用主题模型对社交媒体数据进行描述性分析*

IF 1.5 Q2 SOCIAL WORK
Maria Rodriguez, Heather L. Storer
{"title":"定性数据探索的计算社会科学视角:使用主题模型对社交媒体数据进行描述性分析*","authors":"Maria Rodriguez, Heather L. Storer","doi":"10.1080/15228835.2019.1616350","DOIUrl":null,"url":null,"abstract":"Abstract Comparing and contrasting qualitative and quantitative methods for social media data exploration, this article describes and demonstrates the topic modeling approach for the descriptive analysis of large unstructured text data. Using a sample of tweets with the #WhyIStayed and #WhyILeft hashtags (n = 3,068), a Twitter conversation describing the reasons individuals left or stayed in abusive relationships, a traditional thematic analysis was used to qualitatively code the tweets. The same tweet sample was subject to a series of quantitative topic models. Results suggest topic modeling as a comparable approach to first-round qualitative analysis, with key differences: topic modeling and traditional thematic analysis are both inductive and phenomenon-oriented, but topic modeling results in a lexical semantic analysis, in contrast to the compositional semantic analysis offered by the qualitative approach. An evaluation of topics and codes using the Linguistic Inquiry and Word Count (LIWC) software further supports these findings. We argue topic modeling is a useful method for the descriptive analysis of unstructured social media data sets, and is best used as part of a mixed-method strategy, with topic model results guiding deeper qualitative analysis. Implications for human service intervention development and evaluation are discussed.","PeriodicalId":46115,"journal":{"name":"JOURNAL OF TECHNOLOGY IN HUMAN SERVICES","volume":"38 1","pages":"54 - 86"},"PeriodicalIF":1.5000,"publicationDate":"2020-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/15228835.2019.1616350","citationCount":"60","resultStr":"{\"title\":\"A computational social science perspective on qualitative data exploration: Using topic models for the descriptive analysis of social media data*\",\"authors\":\"Maria Rodriguez, Heather L. Storer\",\"doi\":\"10.1080/15228835.2019.1616350\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract Comparing and contrasting qualitative and quantitative methods for social media data exploration, this article describes and demonstrates the topic modeling approach for the descriptive analysis of large unstructured text data. Using a sample of tweets with the #WhyIStayed and #WhyILeft hashtags (n = 3,068), a Twitter conversation describing the reasons individuals left or stayed in abusive relationships, a traditional thematic analysis was used to qualitatively code the tweets. The same tweet sample was subject to a series of quantitative topic models. Results suggest topic modeling as a comparable approach to first-round qualitative analysis, with key differences: topic modeling and traditional thematic analysis are both inductive and phenomenon-oriented, but topic modeling results in a lexical semantic analysis, in contrast to the compositional semantic analysis offered by the qualitative approach. An evaluation of topics and codes using the Linguistic Inquiry and Word Count (LIWC) software further supports these findings. We argue topic modeling is a useful method for the descriptive analysis of unstructured social media data sets, and is best used as part of a mixed-method strategy, with topic model results guiding deeper qualitative analysis. Implications for human service intervention development and evaluation are discussed.\",\"PeriodicalId\":46115,\"journal\":{\"name\":\"JOURNAL OF TECHNOLOGY IN HUMAN SERVICES\",\"volume\":\"38 1\",\"pages\":\"54 - 86\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2020-01-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1080/15228835.2019.1616350\",\"citationCount\":\"60\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JOURNAL OF TECHNOLOGY IN HUMAN SERVICES\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/15228835.2019.1616350\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"SOCIAL WORK\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JOURNAL OF TECHNOLOGY IN HUMAN SERVICES","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/15228835.2019.1616350","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"SOCIAL WORK","Score":null,"Total":0}
引用次数: 60

摘要

本文通过对社交媒体数据挖掘的定性和定量方法的比较和对比,描述并演示了用于大型非结构化文本数据描述性分析的主题建模方法。使用带有#WhyIStayed和#WhyILeft标签的推文样本(n = 3,068),描述个人离开或留在虐待关系中的原因的推文对话,使用传统的主题分析对推文进行定性编码。同一条推文样本受到一系列定量主题模型的影响。结果表明,主题建模可以与第一轮定性分析相比较,主要区别在于:主题建模和传统的主题分析都是归纳和面向现象的,但主题建模的结果是词法语义分析,而不是定性方法提供的组成语义分析。使用语言调查和单词计数(LIWC)软件对主题和代码进行的评估进一步支持了这些发现。我们认为,主题建模对于非结构化社交媒体数据集的描述性分析是一种有用的方法,并且最好作为混合方法策略的一部分使用,主题模型结果指导更深层次的定性分析。对人类服务干预发展和评价的意义进行了讨论。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A computational social science perspective on qualitative data exploration: Using topic models for the descriptive analysis of social media data*
Abstract Comparing and contrasting qualitative and quantitative methods for social media data exploration, this article describes and demonstrates the topic modeling approach for the descriptive analysis of large unstructured text data. Using a sample of tweets with the #WhyIStayed and #WhyILeft hashtags (n = 3,068), a Twitter conversation describing the reasons individuals left or stayed in abusive relationships, a traditional thematic analysis was used to qualitatively code the tweets. The same tweet sample was subject to a series of quantitative topic models. Results suggest topic modeling as a comparable approach to first-round qualitative analysis, with key differences: topic modeling and traditional thematic analysis are both inductive and phenomenon-oriented, but topic modeling results in a lexical semantic analysis, in contrast to the compositional semantic analysis offered by the qualitative approach. An evaluation of topics and codes using the Linguistic Inquiry and Word Count (LIWC) software further supports these findings. We argue topic modeling is a useful method for the descriptive analysis of unstructured social media data sets, and is best used as part of a mixed-method strategy, with topic model results guiding deeper qualitative analysis. Implications for human service intervention development and evaluation are discussed.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
4.20
自引率
6.70%
发文量
6
期刊介绍: This peer-reviewed, refereed journal explores the potentials of computer and telecommunications technologies in mental health, developmental disability, welfare, addictions, education, and other human services. The Journal of Technology in Human Services covers the full range of technological applications, including direct service techniques. It not only provides the necessary historical perspectives on the use of computers in the human service field, but it also presents articles that will improve your technology literacy and keep you abreast of state-of-the-art developments.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信