Describing rumours: a comparative evaluation of two handcrafted representations for rumour detection

Luisa Francini, P. Soda, R. Sicilia
{"title":"Describing rumours: a comparative evaluation of two handcrafted representations for rumour detection","authors":"Luisa Francini, P. Soda, R. Sicilia","doi":"10.1109/IDT52577.2021.9497615","DOIUrl":null,"url":null,"abstract":"Nowadays, people use more and more social media as a source of information, leading to an increased and uncontrolled spread of misinformation. For this reason, tools to detect unverified and instrumentally relevant news, named as rumours, are necessary. In this work we compare two state-of-the-art handcrafted representations, namely User-Network and Social-Content, designed for developing machine learning-based rumour detection systems, in order to analyse which descriptors best capture the information hidden in unknown rumours. To this end we set up an experimental assessment implementing a Leave-One-Topic-Out evaluation on 8 different topics retrieved from Twitter social microblog. The results obtained for both representations are low as we designed a simple and non optimised pipeline for a fair comparison. Besides this, we were able to find out that the User-Network set of feature results more stable to topic changes. As a further contribution, we introduce two new datasets labelled for rumour detection task on Twitter.","PeriodicalId":316100,"journal":{"name":"2021 International Conference on Information and Digital Technologies (IDT)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Information and Digital Technologies (IDT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IDT52577.2021.9497615","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Nowadays, people use more and more social media as a source of information, leading to an increased and uncontrolled spread of misinformation. For this reason, tools to detect unverified and instrumentally relevant news, named as rumours, are necessary. In this work we compare two state-of-the-art handcrafted representations, namely User-Network and Social-Content, designed for developing machine learning-based rumour detection systems, in order to analyse which descriptors best capture the information hidden in unknown rumours. To this end we set up an experimental assessment implementing a Leave-One-Topic-Out evaluation on 8 different topics retrieved from Twitter social microblog. The results obtained for both representations are low as we designed a simple and non optimised pipeline for a fair comparison. Besides this, we were able to find out that the User-Network set of feature results more stable to topic changes. As a further contribution, we introduce two new datasets labelled for rumour detection task on Twitter.
描述谣言:谣言检测的两种手工表示的比较评价
如今,人们越来越多地使用社交媒体作为信息来源,导致错误信息的传播增加且不受控制。因此,有必要使用工具来检测未经证实的、与工具相关的新闻(称为谣言)。在这项工作中,我们比较了两种最先进的手工表示,即用户网络和社会内容,旨在开发基于机器学习的谣言检测系统,以分析哪种描述符最能捕获隐藏在未知谣言中的信息。为此,我们建立了一个实验评估,对从Twitter社交微博中检索的8个不同主题进行了Leave-One-Topic-Out评估。两种表示的结果都很低,因为我们设计了一个简单且未优化的管道进行公平比较。除此之外,我们可以发现User-Network的特征结果集对于主题的变化更加稳定。作为进一步的贡献,我们引入了两个新的数据集,标记为Twitter上的谣言检测任务。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信