Unsupervised Social Bot Detection via Structural Information Theory

IF 8.3 2区材料科学 Q1 MATERIALS SCIENCE, MULTIDISCIPLINARY

ACS Applied Materials & Interfaces Pub Date : 2024-04-21 DOI:10.1145/3660522

Hao Peng, Jingyun Zhang, Xiang Huang, Zhifeng Hao, Angsheng Li, Zhengtao Yu, Philip S. Yu

{"title":"Unsupervised Social Bot Detection via Structural Information Theory","authors":"Hao Peng, Jingyun Zhang, Xiang Huang, Zhifeng Hao, Angsheng Li, Zhengtao Yu, Philip S. Yu","doi":"10.1145/3660522","DOIUrl":null,"url":null,"abstract":"\n Research on social bot detection plays a crucial role in maintaining the order and reliability of information dissemination while increasing trust in social interactions. The current mainstream social bot detection models rely on black-box neural network technology, e.g., Graph Neural Network, Transformer, etc., which lacks interpretability. In this work, we present UnDBot, a novel unsupervised, interpretable, yet effective and practical framework for detecting social bots. This framework is built upon structural information theory. We begin by designing three social relationship metrics that capture various aspects of social bot behaviors:\n Posting Type Distribution\n ,\n Posting Influence\n , and\n Follow-to-follower Ratio\n . Three new relationships are utilized to construct a new, unified, and weighted social multi-relational graph, aiming to model the relevance of social user behaviors and discover long-distance correlations between users. Second, we introduce a novel method for optimizing heterogeneous structural entropy. This method involves the personalized aggregation of edge information from the social multi-relational graph to generate a two-dimensional encoding tree. The heterogeneous structural entropy facilitates decoding of the substantial structure of the social bots network and enables hierarchical clustering of social bots. Thirdly, a new community labeling method is presented to distinguish social bot communities by computing the user’s stationary distribution, measuring user contributions to network structure, and counting the intensity of user aggregation within the community. Compared with ten representative social bot detection approaches, comprehensive experiments demonstrate the advantages of effectiveness and interpretability of UnDBot on four real social network datasets.\n","PeriodicalId":5,"journal":{"name":"ACS Applied Materials & Interfaces","volume":"103 51","pages":""},"PeriodicalIF":8.3000,"publicationDate":"2024-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Materials & Interfaces","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1145/3660522","RegionNum":2,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 1

Abstract

Research on social bot detection plays a crucial role in maintaining the order and reliability of information dissemination while increasing trust in social interactions. The current mainstream social bot detection models rely on black-box neural network technology, e.g., Graph Neural Network, Transformer, etc., which lacks interpretability. In this work, we present UnDBot, a novel unsupervised, interpretable, yet effective and practical framework for detecting social bots. This framework is built upon structural information theory. We begin by designing three social relationship metrics that capture various aspects of social bot behaviors: Posting Type Distribution , Posting Influence , and Follow-to-follower Ratio . Three new relationships are utilized to construct a new, unified, and weighted social multi-relational graph, aiming to model the relevance of social user behaviors and discover long-distance correlations between users. Second, we introduce a novel method for optimizing heterogeneous structural entropy. This method involves the personalized aggregation of edge information from the social multi-relational graph to generate a two-dimensional encoding tree. The heterogeneous structural entropy facilitates decoding of the substantial structure of the social bots network and enables hierarchical clustering of social bots. Thirdly, a new community labeling method is presented to distinguish social bot communities by computing the user’s stationary distribution, measuring user contributions to network structure, and counting the intensity of user aggregation within the community. Compared with ten representative social bot detection approaches, comprehensive experiments demonstrate the advantages of effectiveness and interpretability of UnDBot on four real social network datasets.

查看原文本刊更多论文

通过结构信息论进行无监督社交机器人检测

社交僵尸检测研究在维护信息传播秩序和可靠性、提高社交互动信任度方面发挥着至关重要的作用。目前主流的社交僵尸检测模型依赖于黑盒神经网络技术，如图神经网络、变形器等，缺乏可解释性。在这项工作中，我们提出了 UnDBot，这是一种新型的无监督、可解释、有效且实用的社交机器人检测框架。该框架建立在结构信息论的基础上。我们首先设计了三种社会关系度量标准，以捕捉社交机器人行为的各个方面：发帖类型分布、发帖影响力和关注者与关注者比率。我们利用这三种新关系构建了一个新的、统一的、加权的社交多关系图，旨在为社交用户行为的相关性建模，并发现用户之间的远距离相关性。其次，我们介绍了一种优化异构结构熵的新方法。这种方法涉及对社交多关系图中的边缘信息进行个性化聚合，生成二维编码树。异构结构熵有助于解码社交机器人网络的实质性结构，并实现社交机器人的分层聚类。第三，提出了一种新的社区标签方法，通过计算用户的固定分布、衡量用户对网络结构的贡献以及统计社区内用户聚集的强度来区分社交机器人社区。与十种具有代表性的社交僵尸检测方法相比，UnDBot 在四个真实社交网络数据集上的综合实验证明了其有效性和可解释性的优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACS Applied Materials & Interfaces 工程技术-材料科学：综合

CiteScore

16.00

自引率

6.30%

发文量

4978

审稿时长

1.8 months

期刊介绍： ACS Applied Materials & Interfaces is a leading interdisciplinary journal that brings together chemists, engineers, physicists, and biologists to explore the development and utilization of newly-discovered materials and interfacial processes for specific applications. Our journal has experienced remarkable growth since its establishment in 2009, both in terms of the number of articles published and the impact of the research showcased. We are proud to foster a truly global community, with the majority of published articles originating from outside the United States, reflecting the rapid growth of applied research worldwide.