Understanding the characteristics of COVID-19 misinformation communities through graphlet analysis

Q1 Social Sciences
James R. Ashford , Liam D. Turner , Roger M. Whitaker , Alun Preece , Diane Felmlee
{"title":"Understanding the characteristics of COVID-19 misinformation communities through graphlet analysis","authors":"James R. Ashford ,&nbsp;Liam D. Turner ,&nbsp;Roger M. Whitaker ,&nbsp;Alun Preece ,&nbsp;Diane Felmlee","doi":"10.1016/j.osnem.2021.100178","DOIUrl":null,"url":null,"abstract":"<div><p>Online social networks serve as a convenient way to connect, share, and promote content with others. As a result, these networks can be used with malicious intent, causing disruption and harm to public debate through the sharing of misinformation. However, automatically identifying such content through its use of natural language is a significant challenge compared to our solution which uses less computational resources, language-agnostic and without the need for complex semantic analysis. Consequently alternative and complementary approaches are highly valuable. In this paper, we assess content that has the potential for misinformation and focus on patterns of user association with online social media communities (subreddits) in the popular Reddit social media platform, and generate networks of behaviour capturing user interaction with different subreddits. We examine these networks using both global and local metrics, in particular noting the presence of induced substructures (graphlets) assessing <span><math><mrow><mn>7</mn><mo>,</mo><mn>876</mn><mo>,</mo><mn>064</mn></mrow></math></span> posts from 96,634 users. From subreddits identified as having potential for misinformation, we note that the associated networks have strongly defined local features relating to node degree — these are evident both from analysis of dominant graphlets and degree-related global metrics. We find that these local features support high accuracy classification of subreddits that are categorised as having the potential for misinformation. Consequently we observe that induced local substructures of high degree are fundamental metrics for subreddit classification, and support automatic detection capabilities for online misinformation independent from any particular language.</p></div>","PeriodicalId":52228,"journal":{"name":"Online Social Networks and Media","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2468696421000586/pdfft?md5=7bf5933a81760cdedf22974545a1b7e2&pid=1-s2.0-S2468696421000586-main.pdf","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Online Social Networks and Media","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2468696421000586","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Social Sciences","Score":null,"Total":0}
引用次数: 6

Abstract

Online social networks serve as a convenient way to connect, share, and promote content with others. As a result, these networks can be used with malicious intent, causing disruption and harm to public debate through the sharing of misinformation. However, automatically identifying such content through its use of natural language is a significant challenge compared to our solution which uses less computational resources, language-agnostic and without the need for complex semantic analysis. Consequently alternative and complementary approaches are highly valuable. In this paper, we assess content that has the potential for misinformation and focus on patterns of user association with online social media communities (subreddits) in the popular Reddit social media platform, and generate networks of behaviour capturing user interaction with different subreddits. We examine these networks using both global and local metrics, in particular noting the presence of induced substructures (graphlets) assessing 7,876,064 posts from 96,634 users. From subreddits identified as having potential for misinformation, we note that the associated networks have strongly defined local features relating to node degree — these are evident both from analysis of dominant graphlets and degree-related global metrics. We find that these local features support high accuracy classification of subreddits that are categorised as having the potential for misinformation. Consequently we observe that induced local substructures of high degree are fundamental metrics for subreddit classification, and support automatic detection capabilities for online misinformation independent from any particular language.

通过graphlet分析了解新冠病毒误传社群特征
在线社交网络是与他人联系、分享和推广内容的便捷方式。因此,这些网络可以被恶意利用,通过分享错误信息对公众辩论造成破坏和伤害。然而,与我们的解决方案相比,通过使用自然语言来自动识别这些内容是一个重大挑战,我们的解决方案使用较少的计算资源,与语言无关,不需要复杂的语义分析。因此,替代和补充方法是非常有价值的。在本文中,我们评估了可能存在错误信息的内容,并关注了流行的Reddit社交媒体平台中用户与在线社交媒体社区(子Reddit)的关联模式,并生成了捕获用户与不同子Reddit互动的行为网络。我们使用全局和局部指标来检查这些网络,特别注意到诱导子结构(石墨)的存在,评估了来自96,634名用户的7,876,064个帖子。从被识别为具有潜在错误信息的子reddit中,我们注意到相关网络具有与节点度相关的强烈定义的局部特征——这些特征从主导石墨烯和与度相关的全局指标的分析中都很明显。我们发现这些局部特征支持对被分类为具有错误信息潜力的子reddit进行高精度分类。因此,我们观察到高程度的诱导局部子结构是subreddit分类的基本指标,并且支持独立于任何特定语言的在线错误信息自动检测能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Online Social Networks and Media
Online Social Networks and Media Social Sciences-Communication
CiteScore
10.60
自引率
0.00%
发文量
32
审稿时长
44 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信