Understanding the characteristics of COVID-19 misinformation communities through graphlet analysis

Q1 Social Sciences

Online Social Networks and Media Pub Date : 2022-01-01 DOI:10.1016/j.osnem.2021.100178

James R. Ashford , Liam D. Turner , Roger M. Whitaker , Alun Preece , Diane Felmlee

{"title":"Understanding the characteristics of COVID-19 misinformation communities through graphlet analysis","authors":"James R. Ashford , Liam D. Turner , Roger M. Whitaker , Alun Preece , Diane Felmlee","doi":"10.1016/j.osnem.2021.100178","DOIUrl":null,"url":null,"abstract":"<div><p>Online social networks serve as a convenient way to connect, share, and promote content with others. As a result, these networks can be used with malicious intent, causing disruption and harm to public debate through the sharing of misinformation. However, automatically identifying such content through its use of natural language is a significant challenge compared to our solution which uses less computational resources, language-agnostic and without the need for complex semantic analysis. Consequently alternative and complementary approaches are highly valuable. In this paper, we assess content that has the potential for misinformation and focus on patterns of user association with online social media communities (subreddits) in the popular Reddit social media platform, and generate networks of behaviour capturing user interaction with different subreddits. We examine these networks using both global and local metrics, in particular noting the presence of induced substructures (graphlets) assessing <span><math><mrow><mn>7</mn><mo>,</mo><mn>876</mn><mo>,</mo><mn>064</mn></mrow></math></span> posts from 96,634 users. From subreddits identified as having potential for misinformation, we note that the associated networks have strongly defined local features relating to node degree — these are evident both from analysis of dominant graphlets and degree-related global metrics. We find that these local features support high accuracy classification of subreddits that are categorised as having the potential for misinformation. Consequently we observe that induced local substructures of high degree are fundamental metrics for subreddit classification, and support automatic detection capabilities for online misinformation independent from any particular language.</p></div>","PeriodicalId":52228,"journal":{"name":"Online Social Networks and Media","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2468696421000586/pdfft?md5=7bf5933a81760cdedf22974545a1b7e2&pid=1-s2.0-S2468696421000586-main.pdf","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Online Social Networks and Media","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2468696421000586","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Social Sciences","Score":null,"Total":0}

引用次数: 6

Abstract

Online social networks serve as a convenient way to connect, share, and promote content with others. As a result, these networks can be used with malicious intent, causing disruption and harm to public debate through the sharing of misinformation. However, automatically identifying such content through its use of natural language is a significant challenge compared to our solution which uses less computational resources, language-agnostic and without the need for complex semantic analysis. Consequently alternative and complementary approaches are highly valuable. In this paper, we assess content that has the potential for misinformation and focus on patterns of user association with online social media communities (subreddits) in the popular Reddit social media platform, and generate networks of behaviour capturing user interaction with different subreddits. We examine these networks using both global and local metrics, in particular noting the presence of induced substructures (graphlets) assessing $7, 876, 064$ posts from 96,634 users. From subreddits identified as having potential for misinformation, we note that the associated networks have strongly defined local features relating to node degree — these are evident both from analysis of dominant graphlets and degree-related global metrics. We find that these local features support high accuracy classification of subreddits that are categorised as having the potential for misinformation. Consequently we observe that induced local substructures of high degree are fundamental metrics for subreddit classification, and support automatic detection capabilities for online misinformation independent from any particular language.

查看原文本刊更多论文

通过graphlet分析了解新冠病毒误传社群特征

在线社交网络是与他人联系、分享和推广内容的便捷方式。因此，这些网络可以被恶意利用，通过分享错误信息对公众辩论造成破坏和伤害。然而，与我们的解决方案相比，通过使用自然语言来自动识别这些内容是一个重大挑战，我们的解决方案使用较少的计算资源，与语言无关，不需要复杂的语义分析。因此，替代和补充方法是非常有价值的。在本文中，我们评估了可能存在错误信息的内容，并关注了流行的Reddit社交媒体平台中用户与在线社交媒体社区(子Reddit)的关联模式，并生成了捕获用户与不同子Reddit互动的行为网络。我们使用全局和局部指标来检查这些网络，特别注意到诱导子结构(石墨)的存在，评估了来自96,634名用户的7,876,064个帖子。从被识别为具有潜在错误信息的子reddit中，我们注意到相关网络具有与节点度相关的强烈定义的局部特征——这些特征从主导石墨烯和与度相关的全局指标的分析中都很明显。我们发现这些局部特征支持对被分类为具有错误信息潜力的子reddit进行高精度分类。因此，我们观察到高程度的诱导局部子结构是subreddit分类的基本指标，并且支持独立于任何特定语言的在线错误信息自动检测能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Online Social Networks and Media Social Sciences-Communication

CiteScore

10.60

自引率

0.00%

发文量

审稿时长

44 days