Content Based Offline Fake News Detection using Classification Technique

Meenu Gupta, Rakesh Kumar, Geet Pradhan, Dheeraj Kumawat
{"title":"Content Based Offline Fake News Detection using Classification Technique","authors":"Meenu Gupta, Rakesh Kumar, Geet Pradhan, Dheeraj Kumawat","doi":"10.1109/ICSES52305.2021.9633857","DOIUrl":null,"url":null,"abstract":"The phrase post-reality coined with the aid of using the dictionary of Oxford word in the Year 2016. The adjective name, referring to the describing conditions of which goal information have little impact on reframing public opinion instead of being attractive to non-public emotions and beliefs. This ends in incorrect information and social problems. Therefore, it's far essential to take the time to locate this information and save you them from spreading. In this paper, astrategyis used for device mastering, particularly surveyed reading, to reap fake information. Specifically, this work used a database of non-fiction tales to educate the device mastering version, the use of the Scikit-study which is a library in Python. Records were extracted by us from the database the use of textual content illustration fashions together with a bag of words, the term frequency Inverse document frequency, and the bi diagram frequency. After which we tested strategies of type, particularly the feasible type and the linear department of the name and content material, searching at whether it changed into a typical/no-click on feed, in a fake / real sequence. The end result of our take a look at is that line segregation works high-quality with the TF-IDF version withinside the content material segmentation process. The Bi-gram frequency version furnished an awful lot of decrease accuracy of theme separation as compared to the term bag of words and TF-IDF.","PeriodicalId":6777,"journal":{"name":"2021 International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems (ICSES)","volume":"7 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems (ICSES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSES52305.2021.9633857","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The phrase post-reality coined with the aid of using the dictionary of Oxford word in the Year 2016. The adjective name, referring to the describing conditions of which goal information have little impact on reframing public opinion instead of being attractive to non-public emotions and beliefs. This ends in incorrect information and social problems. Therefore, it's far essential to take the time to locate this information and save you them from spreading. In this paper, astrategyis used for device mastering, particularly surveyed reading, to reap fake information. Specifically, this work used a database of non-fiction tales to educate the device mastering version, the use of the Scikit-study which is a library in Python. Records were extracted by us from the database the use of textual content illustration fashions together with a bag of words, the term frequency Inverse document frequency, and the bi diagram frequency. After which we tested strategies of type, particularly the feasible type and the linear department of the name and content material, searching at whether it changed into a typical/no-click on feed, in a fake / real sequence. The end result of our take a look at is that line segregation works high-quality with the TF-IDF version withinside the content material segmentation process. The Bi-gram frequency version furnished an awful lot of decrease accuracy of theme separation as compared to the term bag of words and TF-IDF.
基于内容的离线假新闻分类检测
后现实(post-reality)一词是在2016年牛津词汇词典的帮助下创造的。形容词名称,指目标信息对重构公众舆论影响不大,而不是对非公众情绪和信念具有吸引力的描述条件。这最终导致了不正确的信息和社会问题。因此,花时间找到这些信息并防止它们传播是非常必要的。在本文中,我们使用了一种策略来掌握设备,特别是调查阅读,以收获虚假信息。具体来说,这项工作使用了一个非小说故事数据库来教育设备掌握版本,使用Scikit-study,这是一个Python库。我们利用文本内容说明的方式,结合词包、术语频率、逆文档频率和双图频率,从数据库中提取记录。之后,我们测试了类型策略,特别是可行类型和名称和内容材料的线性部门,搜索它是否按照假/真顺序变成了典型/无点击提要。我们研究的最终结果是,在内容材料分割过程中,行分离与TF-IDF版本一起高质量地工作。与词汇包和TF-IDF相比,双谱频率版本的主题分离精度降低了很多。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信