The importance of interdisciplinary frameworks in social media mining: An exploratory approach between Computational Informatics and Social Network Analysis (SNA)

Danny Valdez, Meg Patterson, Tyler Prochnow MEd
{"title":"The importance of interdisciplinary frameworks in social media mining: An exploratory approach between Computational Informatics and Social Network Analysis (SNA)","authors":"Danny Valdez, Meg Patterson, Tyler Prochnow MEd","doi":"10.4148/2572-1836.1098","DOIUrl":null,"url":null,"abstract":"Social media content is one of the most visible sources of big data and is often used in health studies to draw inferences about various behaviors. Though much can be gleaned from social media data and mining, the approaches used to collect and analyze data are generally strengthened when examined through established theoretical frameworks. Health behavior, a theory driven field, encourages interdisciplinary collaboration across fields and theories to help us draw robust conclusions about phenomena. This pilot study uses a combined computer informatics and SNA approach to analyze information spread about mask-wearing as a personal mitigation effort during the COVID-19 pandemic. We analyzed one week’s worth of Twitter data (n = 10,107 tweets across 4,289 users) by using at least one of four popular mask-support hashtags (e.g., #maskup). We calculated network-measures to assess structures and patterns present within the Twitter network, and used exponential random graph modeling (ERGM) to test factors related to the presence of retweets between users. The pro-mask Twitter network was largely fragmented, with a select few nodes occupying the most influential positions in the network. Verified accounts, accounts with more followers, and those who generated more tweets were more likely to be retweeted. Contrarily, verified accounts and those with more followers were less likely to retweet others. SNA revealed patterns and structures theoretically important to how information spreads across Twitter. We demonstrated the utility of an interdisciplinary collaboration between computer informatics and SNA to draw conclusions from social media data.","PeriodicalId":73205,"journal":{"name":"Health behavior research","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health behavior research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4148/2572-1836.1098","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Social media content is one of the most visible sources of big data and is often used in health studies to draw inferences about various behaviors. Though much can be gleaned from social media data and mining, the approaches used to collect and analyze data are generally strengthened when examined through established theoretical frameworks. Health behavior, a theory driven field, encourages interdisciplinary collaboration across fields and theories to help us draw robust conclusions about phenomena. This pilot study uses a combined computer informatics and SNA approach to analyze information spread about mask-wearing as a personal mitigation effort during the COVID-19 pandemic. We analyzed one week’s worth of Twitter data (n = 10,107 tweets across 4,289 users) by using at least one of four popular mask-support hashtags (e.g., #maskup). We calculated network-measures to assess structures and patterns present within the Twitter network, and used exponential random graph modeling (ERGM) to test factors related to the presence of retweets between users. The pro-mask Twitter network was largely fragmented, with a select few nodes occupying the most influential positions in the network. Verified accounts, accounts with more followers, and those who generated more tweets were more likely to be retweeted. Contrarily, verified accounts and those with more followers were less likely to retweet others. SNA revealed patterns and structures theoretically important to how information spreads across Twitter. We demonstrated the utility of an interdisciplinary collaboration between computer informatics and SNA to draw conclusions from social media data.
跨学科框架在社交媒体挖掘中的重要性:计算信息学与社会网络分析(SNA)之间的探索方法
社交媒体内容是大数据最明显的来源之一,经常被用于健康研究,以推断各种行为。尽管可以从社交媒体数据和挖掘中收集到很多信息,但当通过既定的理论框架进行研究时,用于收集和分析数据的方法通常会得到加强。健康行为是一个理论驱动的领域,鼓励跨领域和理论的跨学科合作,帮助我们对现象得出有力的结论。这项试点研究使用计算机信息学和国民账户体系相结合的方法来分析新冠肺炎大流行期间戴口罩作为个人缓解措施的信息传播。我们通过使用四个流行的口罩支持标签中的至少一个(例如#maskup),分析了一周的推特数据(4289名用户中的10107条推文)。我们计算了网络测量来评估推特网络中存在的结构和模式,并使用指数随机图建模(ERGM)来测试与用户之间转发存在相关的因素。支持口罩的推特网络在很大程度上是分散的,少数几个节点占据了网络中最具影响力的位置。经过验证的帐户、拥有更多追随者的帐户以及那些生成更多推文的帐户更有可能被转发。相反,经过验证的账户和拥有更多粉丝的账户转发他人的可能性较小。SNA揭示了对信息如何在推特上传播具有重要理论意义的模式和结构。我们展示了计算机信息学和SNA之间跨学科合作的实用性,以从社交媒体数据中得出结论。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
审稿时长
8 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信