使用朴素贝叶斯的假新闻检测

Nurshaheeda Shazleen Yuslee, N. Abdullah
{"title":"使用朴素贝叶斯的假新闻检测","authors":"Nurshaheeda Shazleen Yuslee, N. Abdullah","doi":"10.1109/ICSET53708.2021.9612540","DOIUrl":null,"url":null,"abstract":"The issue of fake news arises every year. Moreover, the enhancement and evolution of technologies enable the news to be manipulated by irresponsible people. However, it is not deniable that somehow this technology impacts our daily life. Nowadays, people get the latest news through the social media platforms as it is free, easy to access, and fast. However, not all the news on social media is reliable, and some fake news are spread to mislead the readers. Fake news can disseminate information to confuse people to believe things that are not true. In Natural Language Processing, text processing such as regular expression, removing the stop words and lemmatization are done before the data is being transformed into N-grams using TF-IDF and Count Vectorizer. Therefore, this paper aimed to review the fake news detection using the Naive Bayes algorithms. Results shows that Naive Bayes with n-gram gives a slight increase in the accuracy of TF-IDF and Count Vectorizer. It proves that TF-IDF Vectorizer can detect fake news better as it has higher precision of 94 % whereas Count Vectorizer can detect both fake news and real news in quite a balance.","PeriodicalId":433197,"journal":{"name":"2021 IEEE 11th International Conference on System Engineering and Technology (ICSET)","volume":"02 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Fake News Detection using Naive Bayes\",\"authors\":\"Nurshaheeda Shazleen Yuslee, N. Abdullah\",\"doi\":\"10.1109/ICSET53708.2021.9612540\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The issue of fake news arises every year. Moreover, the enhancement and evolution of technologies enable the news to be manipulated by irresponsible people. However, it is not deniable that somehow this technology impacts our daily life. Nowadays, people get the latest news through the social media platforms as it is free, easy to access, and fast. However, not all the news on social media is reliable, and some fake news are spread to mislead the readers. Fake news can disseminate information to confuse people to believe things that are not true. In Natural Language Processing, text processing such as regular expression, removing the stop words and lemmatization are done before the data is being transformed into N-grams using TF-IDF and Count Vectorizer. Therefore, this paper aimed to review the fake news detection using the Naive Bayes algorithms. Results shows that Naive Bayes with n-gram gives a slight increase in the accuracy of TF-IDF and Count Vectorizer. It proves that TF-IDF Vectorizer can detect fake news better as it has higher precision of 94 % whereas Count Vectorizer can detect both fake news and real news in quite a balance.\",\"PeriodicalId\":433197,\"journal\":{\"name\":\"2021 IEEE 11th International Conference on System Engineering and Technology (ICSET)\",\"volume\":\"02 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 11th International Conference on System Engineering and Technology (ICSET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSET53708.2021.9612540\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 11th International Conference on System Engineering and Technology (ICSET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSET53708.2021.9612540","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

摘要

假新闻的问题每年都会出现。此外,技术的进步和发展使新闻能够被不负责任的人操纵。然而,不可否认的是,这项技术以某种方式影响着我们的日常生活。如今,人们通过社交媒体平台获得最新的新闻,因为它是免费的,易于访问,快速。然而,并不是所有社交媒体上的新闻都是可靠的,一些假新闻的传播误导了读者。假新闻可以传播信息,使人们相信不真实的事情。在自然语言处理中,在使用TF-IDF和Count Vectorizer将数据转换为n -gram之前,会进行文本处理,如正则表达式、删除停止词和词序化。因此,本文旨在回顾使用朴素贝叶斯算法的假新闻检测。结果表明,使用n-gram的朴素贝叶斯方法可以略微提高TF-IDF和计数矢量器的准确率。它证明了TF-IDF矢量器可以更好地检测假新闻,因为它具有高达94%的精度,而计数矢量器可以在相当平衡的情况下检测假新闻和真实新闻。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Fake News Detection using Naive Bayes
The issue of fake news arises every year. Moreover, the enhancement and evolution of technologies enable the news to be manipulated by irresponsible people. However, it is not deniable that somehow this technology impacts our daily life. Nowadays, people get the latest news through the social media platforms as it is free, easy to access, and fast. However, not all the news on social media is reliable, and some fake news are spread to mislead the readers. Fake news can disseminate information to confuse people to believe things that are not true. In Natural Language Processing, text processing such as regular expression, removing the stop words and lemmatization are done before the data is being transformed into N-grams using TF-IDF and Count Vectorizer. Therefore, this paper aimed to review the fake news detection using the Naive Bayes algorithms. Results shows that Naive Bayes with n-gram gives a slight increase in the accuracy of TF-IDF and Count Vectorizer. It proves that TF-IDF Vectorizer can detect fake news better as it has higher precision of 94 % whereas Count Vectorizer can detect both fake news and real news in quite a balance.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信