如何找到社交机器人?

Proceedings of the 2023 6th International Conference on Software Engineering and Information Management Pub Date : 2023-01-31 DOI:10.1145/3584871.3584873

Jianwei Ding, Zhouguo Chen

{"title":"如何找到社交机器人?","authors":"Jianwei Ding, Zhouguo Chen","doi":"10.1145/3584871.3584873","DOIUrl":null,"url":null,"abstract":"With the rapid development of artificial intelligence and natural language processing, there are more and more social robots applied in the social networks such as Twitter, intended to lead public opinion or crawling private information illegally. The problem of detection social robots, which is automated social accounts governed by artificial intelligence software, pretend to be a human user. There are some technologies proposed to detect the social robots automatically applied to the real social network for verification. Hence, conventional social robot detecting technologies proposed before are applied to detect by the account's metadata or account posted tweet content respectively. With the help of pre-trained language model such as BERT, this paper propose a deep neural network model based on contextual long short-term memory (LSTM) architecture named DeepBot, which exploits tweet content and account's metadata features. The architecture of DeepBot contains three phases: (1) it uses the pretrained model such as BERT to extract the embedding vector from the tweet content of the specific account, and (2) it choose more discriminative account metadata to extract a metadata vector, and then (3) it combines the auxiliary embedding vector and metadata vector into decoder layer to train a detecting model. What's more, in this paper, we review the labelling social robots datasets proposed in public, and get a mixture datasets of labelling social datasets to verify and compare the experimental results of our proposed DeepBot and other conventional methods. We also present empirical results of DeepBot and our ongoing experimentation with it, as we have gained experience applying it to the mixture labeling social robot dataset, including over 10000 accounts. The experimental results show that DeepBot outperforms previous state-of-the-art methods, with leveraging a small and interpretable set of features.","PeriodicalId":173315,"journal":{"name":"Proceedings of the 2023 6th International Conference on Software Engineering and Information Management","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"How to Find Social Robots exactly?\",\"authors\":\"Jianwei Ding, Zhouguo Chen\",\"doi\":\"10.1145/3584871.3584873\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the rapid development of artificial intelligence and natural language processing, there are more and more social robots applied in the social networks such as Twitter, intended to lead public opinion or crawling private information illegally. The problem of detection social robots, which is automated social accounts governed by artificial intelligence software, pretend to be a human user. There are some technologies proposed to detect the social robots automatically applied to the real social network for verification. Hence, conventional social robot detecting technologies proposed before are applied to detect by the account's metadata or account posted tweet content respectively. With the help of pre-trained language model such as BERT, this paper propose a deep neural network model based on contextual long short-term memory (LSTM) architecture named DeepBot, which exploits tweet content and account's metadata features. The architecture of DeepBot contains three phases: (1) it uses the pretrained model such as BERT to extract the embedding vector from the tweet content of the specific account, and (2) it choose more discriminative account metadata to extract a metadata vector, and then (3) it combines the auxiliary embedding vector and metadata vector into decoder layer to train a detecting model. What's more, in this paper, we review the labelling social robots datasets proposed in public, and get a mixture datasets of labelling social datasets to verify and compare the experimental results of our proposed DeepBot and other conventional methods. We also present empirical results of DeepBot and our ongoing experimentation with it, as we have gained experience applying it to the mixture labeling social robot dataset, including over 10000 accounts. The experimental results show that DeepBot outperforms previous state-of-the-art methods, with leveraging a small and interpretable set of features.\",\"PeriodicalId\":173315,\"journal\":{\"name\":\"Proceedings of the 2023 6th International Conference on Software Engineering and Information Management\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2023 6th International Conference on Software Engineering and Information Management\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3584871.3584873\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 6th International Conference on Software Engineering and Information Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3584871.3584873","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

随着人工智能和自然语言处理的快速发展，越来越多的社交机器人应用于Twitter等社交网络，意图引导舆论或非法抓取私人信息。检测社交机器人的问题是，由人工智能软件控制的自动社交账户，假装是人类用户。提出了一些检测社交机器人的技术，将其自动应用到真实的社交网络中进行验证。因此，我们采用之前提出的传统社交机器人检测技术，分别通过账户元数据或账户发布的推文内容进行检测。在BERT等预训练语言模型的帮助下，本文提出了一种基于上下文长短期记忆(LSTM)架构的深度神经网络模型DeepBot，该模型利用推文内容和账户的元数据特征。DeepBot的架构包含三个阶段:(1)使用BERT等预训练模型从特定账号的推文内容中提取嵌入向量，(2)选择更具判别性的账号元数据提取元数据向量，然后(3)将辅助嵌入向量和元数据向量组合到解码器层中训练检测模型。此外，在本文中，我们回顾了公开提出的标记社交机器人数据集，并获得了标记社交数据集的混合数据集，以验证和比较我们提出的DeepBot和其他传统方法的实验结果。我们还介绍了DeepBot的实证结果以及我们正在进行的实验，因为我们已经获得了将其应用于混合标记社交机器人数据集的经验，包括超过10000个帐户。实验结果表明，DeepBot利用了一组小而可解释的特征，优于以前最先进的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

How to Find Social Robots exactly?

With the rapid development of artificial intelligence and natural language processing, there are more and more social robots applied in the social networks such as Twitter, intended to lead public opinion or crawling private information illegally. The problem of detection social robots, which is automated social accounts governed by artificial intelligence software, pretend to be a human user. There are some technologies proposed to detect the social robots automatically applied to the real social network for verification. Hence, conventional social robot detecting technologies proposed before are applied to detect by the account's metadata or account posted tweet content respectively. With the help of pre-trained language model such as BERT, this paper propose a deep neural network model based on contextual long short-term memory (LSTM) architecture named DeepBot, which exploits tweet content and account's metadata features. The architecture of DeepBot contains three phases: (1) it uses the pretrained model such as BERT to extract the embedding vector from the tweet content of the specific account, and (2) it choose more discriminative account metadata to extract a metadata vector, and then (3) it combines the auxiliary embedding vector and metadata vector into decoder layer to train a detecting model. What's more, in this paper, we review the labelling social robots datasets proposed in public, and get a mixture datasets of labelling social datasets to verify and compare the experimental results of our proposed DeepBot and other conventional methods. We also present empirical results of DeepBot and our ongoing experimentation with it, as we have gained experience applying it to the mixture labeling social robot dataset, including over 10000 accounts. The experimental results show that DeepBot outperforms previous state-of-the-art methods, with leveraging a small and interpretable set of features.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2023 6th International Conference on Software Engineering and Information Management

自引率

0.00%

发文量