{"title":"Content Based Fake News Detection Using N-Gram Models","authors":"Hnin Ei Wynne, Zar Zar Wint","doi":"10.1145/3366030.3366116","DOIUrl":null,"url":null,"abstract":"Fake news is very popular these days because of the increasing popularity of social media. Detecting fake news is considered as one of the most dangerous types of deception because it is created with dishonest intention to misdirect the public. Many researchers proposed fake news detection systems considering many approaches; content, social-context, and propagation. When the news is detected fake or real, there is a limitation in the accuracy and understandability of language. In this paper, we propose the fake news detection system that considers the content of the online news articles. We investigate two machine learning algorithms with the use of word n-grams and character n-grams analysis. Experiments yield better results using character n-grams with Term-Frequency-Inverted Document Frequency (TF-IDF) and Gradient Boosting Classifier achieves an accuracy of 96%.","PeriodicalId":446280,"journal":{"name":"Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"37","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3366030.3366116","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 37
Abstract
Fake news is very popular these days because of the increasing popularity of social media. Detecting fake news is considered as one of the most dangerous types of deception because it is created with dishonest intention to misdirect the public. Many researchers proposed fake news detection systems considering many approaches; content, social-context, and propagation. When the news is detected fake or real, there is a limitation in the accuracy and understandability of language. In this paper, we propose the fake news detection system that considers the content of the online news articles. We investigate two machine learning algorithms with the use of word n-grams and character n-grams analysis. Experiments yield better results using character n-grams with Term-Frequency-Inverted Document Frequency (TF-IDF) and Gradient Boosting Classifier achieves an accuracy of 96%.