"Comments Matter and The More The Better!": Improving Rumor Detection with User Comments

Yang Xu, Jie Guo, Weidong Qiu, Zheng Huang, Enes ALTUNCU, Shujun Li
{"title":"\"Comments Matter and The More The Better!\": Improving Rumor Detection with User Comments","authors":"Yang Xu, Jie Guo, Weidong Qiu, Zheng Huang, Enes ALTUNCU, Shujun Li","doi":"10.1109/TrustCom56396.2022.00060","DOIUrl":null,"url":null,"abstract":"While many online platforms bring great benefits to their users by allowing user-generated content, they have also facilitated generation and spreading of harmful content such as rumors. Researcher have proposed different rumor detection methods based on features extracted from the original post and/or associated comments, but how comments affect the performance of such methods remains largely less understood. In this paper, we first propose a new BERT-based rumor detection method that can outperform other state-of-the-art methods, and then used it to study the role of comments in rumor detection. Our proposed method concatenates the original post and associated comments to form a single long text, which is then segmented into shorter chunks more suitable for BERT-based vectorization. Features extracted from all trunks are fed into a classifier based on an LSTM network or a transformer layer for the classification task. The experimental results on the PHEME and Ma-Weibo datasets proved the superior performance of our method. We conducted additional experiments on different settings of our proposed method to study different aspects of the role comments play in the rumor detection task. These additional experiments led to some very interesting findings, including the surprising result that fixed-length segmentation is better than natural segmentation, and the observation that including more comments can help improve the rumor detector’s performance. Some of these findings have profound operational implications for online platforms, e.g., commentators can contribute to rumor detection positively so online platforms can leverage the crowd intelligence to detect online rumors more effectively without applying overstrict content consensus policies.","PeriodicalId":276379,"journal":{"name":"2022 IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom)","volume":"03 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TrustCom56396.2022.00060","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

While many online platforms bring great benefits to their users by allowing user-generated content, they have also facilitated generation and spreading of harmful content such as rumors. Researcher have proposed different rumor detection methods based on features extracted from the original post and/or associated comments, but how comments affect the performance of such methods remains largely less understood. In this paper, we first propose a new BERT-based rumor detection method that can outperform other state-of-the-art methods, and then used it to study the role of comments in rumor detection. Our proposed method concatenates the original post and associated comments to form a single long text, which is then segmented into shorter chunks more suitable for BERT-based vectorization. Features extracted from all trunks are fed into a classifier based on an LSTM network or a transformer layer for the classification task. The experimental results on the PHEME and Ma-Weibo datasets proved the superior performance of our method. We conducted additional experiments on different settings of our proposed method to study different aspects of the role comments play in the rumor detection task. These additional experiments led to some very interesting findings, including the surprising result that fixed-length segmentation is better than natural segmentation, and the observation that including more comments can help improve the rumor detector’s performance. Some of these findings have profound operational implications for online platforms, e.g., commentators can contribute to rumor detection positively so online platforms can leverage the crowd intelligence to detect online rumors more effectively without applying overstrict content consensus policies.
“评论很重要,而且越多越好!”:利用用户评论改进谣言检测
许多网络平台通过允许用户生成内容给用户带来巨大利益的同时,也为谣言等有害内容的产生和传播提供了便利。研究人员提出了基于从原始帖子和/或相关评论中提取的特征的不同谣言检测方法,但评论如何影响这些方法的性能仍然知之甚少。在本文中,我们首先提出了一种新的基于bert的谣言检测方法,该方法优于其他现有的方法,然后用它来研究评论在谣言检测中的作用。我们提出的方法将原始帖子和相关评论连接起来形成一个长文本,然后将其分割成更适合基于bert的矢量化的更短的块。从所有主干中提取的特征被馈送到基于LSTM网络或变压器层的分类器中进行分类任务。在PHEME和Ma-Weibo数据集上的实验结果证明了该方法的优越性。我们在我们提出的方法的不同设置上进行了额外的实验,以研究评论在谣言检测任务中所起作用的不同方面。这些额外的实验导致了一些非常有趣的发现,包括固定长度分割比自然分割更好的惊人结果,以及包含更多评论有助于提高谣言检测器性能的观察。其中一些发现对网络平台具有深远的操作意义,例如,评论者可以积极地促进谣言检测,因此在线平台可以利用人群智能更有效地检测网络谣言,而无需采用过于严格的内容共识政策。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信