Rumor Identification on Twitter Data for 2020 US Presidential Elections with BERT Model

A. Rahim
{"title":"Rumor Identification on Twitter Data for 2020 US Presidential Elections with BERT Model","authors":"A. Rahim","doi":"10.32350/umtair.11.03","DOIUrl":null,"url":null,"abstract":"Social Media platforms provide rich resources to its users to connect, share and search the information of their interest. It is becoming part of every day’s life and politics is no different. In fact, social media platforms are becoming more significant when it comes to governmental issues and political campaigns. As information spreads within seconds, it’s extremely challenging to control and monitor the authenticity of the information. Many attempts have been made in this regard, in this paper, we briefly overview some major efforts and discuss the patterns found in the rumors and fake news that can be found by latest machine learning techniques. We extracted the tweets data specifically with hashtag_donaldtrump during the high time of 2020 US presidential election and to test their authenticity and the similar data from fact check websites Snopes.com, factcheck.org and politifact.org. We applied the already established BERT model to train on checked data and tested on the one million tweets data. In doing so, we found a reliable accuracy and proposed the fact that once all the truthful information is saved and pretrained in the model, it is able to auto identify the validation of the information shared. Also, once established such kind of models are also helpful in finding the behavior of rumors and pattern showed for American politics.","PeriodicalId":198719,"journal":{"name":"UMT Artificial Intelligence Review","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"UMT Artificial Intelligence Review","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.32350/umtair.11.03","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Social Media platforms provide rich resources to its users to connect, share and search the information of their interest. It is becoming part of every day’s life and politics is no different. In fact, social media platforms are becoming more significant when it comes to governmental issues and political campaigns. As information spreads within seconds, it’s extremely challenging to control and monitor the authenticity of the information. Many attempts have been made in this regard, in this paper, we briefly overview some major efforts and discuss the patterns found in the rumors and fake news that can be found by latest machine learning techniques. We extracted the tweets data specifically with hashtag_donaldtrump during the high time of 2020 US presidential election and to test their authenticity and the similar data from fact check websites Snopes.com, factcheck.org and politifact.org. We applied the already established BERT model to train on checked data and tested on the one million tweets data. In doing so, we found a reliable accuracy and proposed the fact that once all the truthful information is saved and pretrained in the model, it is able to auto identify the validation of the information shared. Also, once established such kind of models are also helpful in finding the behavior of rumors and pattern showed for American politics.
基于BERT模型的2020年美国总统大选Twitter数据谣言识别
社交媒体平台为用户提供了丰富的资源来连接、分享和搜索他们感兴趣的信息。它正在成为日常生活的一部分,政治也不例外。事实上,在涉及政府问题和政治竞选时,社交媒体平台正变得越来越重要。由于信息在几秒钟内传播,因此控制和监控信息的真实性极具挑战性。在这方面已经做了很多尝试,在本文中,我们简要概述了一些主要的努力,并讨论了最新的机器学习技术可以在谣言和假新闻中发现的模式。我们在2020年美国总统大选的高潮时期专门提取了hashtag_donaldtrump的推文数据,并测试了它们的真实性,以及来自事实核查网站Snopes.com、factcheck.org和politifact.org的类似数据。我们应用已经建立的BERT模型对校验数据进行训练,并对100万条tweets数据进行测试。在此过程中,我们发现了一个可靠的准确性,并提出一旦所有真实信息被保存并在模型中进行预训练,它就能够自动识别共享信息的有效性。此外,这种模型一旦建立,也有助于发现谣言的行为和模式,为美国政治所展示。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信