Barath M, Sangeethkumar C, Naveen N, Karthickram S, Partha Sarathi P
{"title":"WELFAKE – WORD EMBEDDING OVER LINGUISTIC FEATURES FOR FAKE NEWS DETECTION","authors":"Barath M, Sangeethkumar C, Naveen N, Karthickram S, Partha Sarathi P","doi":"10.46647/ijetms.2022.v06i06.080","DOIUrl":null,"url":null,"abstract":"News is the only mode and set of information that helps the public to know what's happening everyday globally. We have started our path of reading news digitally, by which many \"Fake news\" are being circulated. Fake news is false or misleading information presented as news. Fake news often has the aim of damaging the reputation of a person or entity, or making money through advertising revenue.\nPeople unknowingly believe those fake news as original one without any analysis or study. Since the machine cannot read the words we use, we are going to use “ML model” to train our dataset to the machine. Our project is a two-phase benchmark model named WELFake based on word embedding where each and all words are converted into numerical values which is further processed to classify\nbased on certain matching property using machine learning. The first phase preprocesses the data set and validates the veracity of news content by using linguistic features. The second phase merges the linguistic feature sets with WE(Word Embedding) and applies voting classification. The classification is based on words and meaning matching and this matching percentage should be above a threshold\nvalue we fix. In this paper we are going to discuss about choosing the best algorithm based on our needs and accuracy and complete the task successfully","PeriodicalId":202831,"journal":{"name":"international journal of engineering technology and management sciences","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"international journal of engineering technology and management sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.46647/ijetms.2022.v06i06.080","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
News is the only mode and set of information that helps the public to know what's happening everyday globally. We have started our path of reading news digitally, by which many "Fake news" are being circulated. Fake news is false or misleading information presented as news. Fake news often has the aim of damaging the reputation of a person or entity, or making money through advertising revenue.
People unknowingly believe those fake news as original one without any analysis or study. Since the machine cannot read the words we use, we are going to use “ML model” to train our dataset to the machine. Our project is a two-phase benchmark model named WELFake based on word embedding where each and all words are converted into numerical values which is further processed to classify
based on certain matching property using machine learning. The first phase preprocesses the data set and validates the veracity of news content by using linguistic features. The second phase merges the linguistic feature sets with WE(Word Embedding) and applies voting classification. The classification is based on words and meaning matching and this matching percentage should be above a threshold
value we fix. In this paper we are going to discuss about choosing the best algorithm based on our needs and accuracy and complete the task successfully