Jibing Gong, Jiquan Peng, Jin Qu, ShuYing Du, Kaiyu Wang
{"title":"Enhancing Twitter Bot Detection via Multimodal Invariant Representations","authors":"Jibing Gong, Jiquan Peng, Jin Qu, ShuYing Du, Kaiyu Wang","doi":"arxiv-2408.03096","DOIUrl":null,"url":null,"abstract":"Detecting Twitter Bots is crucial for maintaining the integrity of online\ndiscourse, safeguarding democratic processes, and preventing the spread of\nmalicious propaganda. However, advanced Twitter Bots today often employ\nsophisticated feature manipulation and account farming techniques to blend\nseamlessly with genuine user interactions, posing significant challenges to\nexisting detection models. In response to these challenges, this paper proposes\na novel Twitter Bot Detection framework called BotSAI. This framework enhances\nthe consistency of multimodal user features, accurately characterizing various\nmodalities to distinguish between real users and bots. Specifically, the\narchitecture integrates information from users, textual content, and\nheterogeneous network topologies, leveraging customized encoders to obtain\ncomprehensive user feature representations. The heterogeneous network encoder\nefficiently aggregates information from neighboring nodes through oversampling\ntechniques and local relationship transformers. Subsequently, a multi-channel\nrepresentation mechanism maps user representations into invariant and specific\nsubspaces, enhancing the feature vectors. Finally, a self-attention mechanism\nis introduced to integrate and refine the enhanced user representations,\nenabling efficient information interaction. Extensive experiments demonstrate\nthat BotSAI outperforms existing state-of-the-art methods on two major Twitter\nBot Detection benchmarks, exhibiting superior performance. Additionally,\nsystematic experiments reveal the impact of different social relationships on\ndetection accuracy, providing novel insights for the identification of social\nbots.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Social and Information Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.03096","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Detecting Twitter Bots is crucial for maintaining the integrity of online
discourse, safeguarding democratic processes, and preventing the spread of
malicious propaganda. However, advanced Twitter Bots today often employ
sophisticated feature manipulation and account farming techniques to blend
seamlessly with genuine user interactions, posing significant challenges to
existing detection models. In response to these challenges, this paper proposes
a novel Twitter Bot Detection framework called BotSAI. This framework enhances
the consistency of multimodal user features, accurately characterizing various
modalities to distinguish between real users and bots. Specifically, the
architecture integrates information from users, textual content, and
heterogeneous network topologies, leveraging customized encoders to obtain
comprehensive user feature representations. The heterogeneous network encoder
efficiently aggregates information from neighboring nodes through oversampling
techniques and local relationship transformers. Subsequently, a multi-channel
representation mechanism maps user representations into invariant and specific
subspaces, enhancing the feature vectors. Finally, a self-attention mechanism
is introduced to integrate and refine the enhanced user representations,
enabling efficient information interaction. Extensive experiments demonstrate
that BotSAI outperforms existing state-of-the-art methods on two major Twitter
Bot Detection benchmarks, exhibiting superior performance. Additionally,
systematic experiments reveal the impact of different social relationships on
detection accuracy, providing novel insights for the identification of social
bots.