基于文本和用户行为的网络喷子分层检测

Ting Li, Ke Yu, Xiaofei Wu
{"title":"基于文本和用户行为的网络喷子分层检测","authors":"Ting Li, Ke Yu, Xiaofei Wu","doi":"10.1109/IC-NIDC54101.2021.9660415","DOIUrl":null,"url":null,"abstract":"The cyber trolls in social media have threatened users' personal rights and social order. By publishing offensive and disgusting comments on social media, cyber trolls try to shift the focus of the discussion, provoke others, and even trigger antagonistic behaviors among groups. Most of existing studies were based on English scenes. These methods mainly distinguished the cyber trolls from ordinary users according to whether the comments were offensive or not. But the studies ignored the diversity and concealment of cyber trolls, so it was difficult to identify them pertinently and finely. This paper builds a new Chinese cyber troll dataset and presents a hierarchical cyber troll detection method based on text and user behavior. Starting from the behavior motivation of cyber trolls, we divide users into two levels: inactive and active. For each level of users, this paper proposes some new behavior indicators based on the user statistical features, and selects the text features with significant influence from the comments. Next, these two types of features are input into the XGBoost model for detection. Finally, the detected cyber trolls at each level are combined as the final detection result. Experiments on our dataset show that our method is superior to other baseline methods.","PeriodicalId":264468,"journal":{"name":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hierarchical Cyber Troll Detection with Text and User Behavior\",\"authors\":\"Ting Li, Ke Yu, Xiaofei Wu\",\"doi\":\"10.1109/IC-NIDC54101.2021.9660415\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The cyber trolls in social media have threatened users' personal rights and social order. By publishing offensive and disgusting comments on social media, cyber trolls try to shift the focus of the discussion, provoke others, and even trigger antagonistic behaviors among groups. Most of existing studies were based on English scenes. These methods mainly distinguished the cyber trolls from ordinary users according to whether the comments were offensive or not. But the studies ignored the diversity and concealment of cyber trolls, so it was difficult to identify them pertinently and finely. This paper builds a new Chinese cyber troll dataset and presents a hierarchical cyber troll detection method based on text and user behavior. Starting from the behavior motivation of cyber trolls, we divide users into two levels: inactive and active. For each level of users, this paper proposes some new behavior indicators based on the user statistical features, and selects the text features with significant influence from the comments. Next, these two types of features are input into the XGBoost model for detection. Finally, the detected cyber trolls at each level are combined as the final detection result. Experiments on our dataset show that our method is superior to other baseline methods.\",\"PeriodicalId\":264468,\"journal\":{\"name\":\"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)\",\"volume\":\"46 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IC-NIDC54101.2021.9660415\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC-NIDC54101.2021.9660415","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

社交媒体上的网络喷子已经威胁到用户的个人权利和社会秩序。通过在社交媒体上发表冒犯性和令人厌恶的评论,网络喷子试图转移讨论的焦点,激怒他人,甚至引发群体之间的对抗行为。现有的研究大多基于英语场景。这些方法主要根据评论是否具有攻击性来区分网络喷子和普通用户。但这些研究忽略了网络喷子的多样性和隐蔽性,因此很难有针对性地准确识别它们。本文建立了一个新的中文网络喷子数据集,提出了一种基于文本和用户行为的分层网络喷子检测方法。从网络喷子的行为动机出发,我们将用户分为不活跃和活跃两个层次。对于每一层次的用户,本文基于用户统计特征提出了一些新的行为指标,并从评论中选择影响显著的文本特征。接下来,将这两种类型的特征输入到XGBoost模型中进行检测。最后,将各个层次检测到的网络喷子组合起来作为最终的检测结果。在我们的数据集上进行的实验表明,我们的方法优于其他基线方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Hierarchical Cyber Troll Detection with Text and User Behavior
The cyber trolls in social media have threatened users' personal rights and social order. By publishing offensive and disgusting comments on social media, cyber trolls try to shift the focus of the discussion, provoke others, and even trigger antagonistic behaviors among groups. Most of existing studies were based on English scenes. These methods mainly distinguished the cyber trolls from ordinary users according to whether the comments were offensive or not. But the studies ignored the diversity and concealment of cyber trolls, so it was difficult to identify them pertinently and finely. This paper builds a new Chinese cyber troll dataset and presents a hierarchical cyber troll detection method based on text and user behavior. Starting from the behavior motivation of cyber trolls, we divide users into two levels: inactive and active. For each level of users, this paper proposes some new behavior indicators based on the user statistical features, and selects the text features with significant influence from the comments. Next, these two types of features are input into the XGBoost model for detection. Finally, the detected cyber trolls at each level are combined as the final detection result. Experiments on our dataset show that our method is superior to other baseline methods.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信