Suhaib Khater, Oraib Al-Sahlee, Daoud M. Daoud, M. El-Seoud
{"title":"Clickbait Detection","authors":"Suhaib Khater, Oraib Al-Sahlee, Daoud M. Daoud, M. El-Seoud","doi":"10.1145/3220267.3220287","DOIUrl":null,"url":null,"abstract":"Clickbait is a term that describes deceiving web content that uses ambiguity to provoke the user into clicking a link. It aims to increase the number of online readers in order to generate more advertising revenue. Clickbaits are heavily present on social media platforms wasting the time of users. We used supervised machine learning to create a model trained on 24 features extracted from a dataset of social media posts to classify the posts into two classes. This method achieved an F1-score of %79 and area under ROC curve of 0.7. The method used highlights the importance of using features extracted from different elements of a social media posts along with the traditional features extracted from the title and the article. In this research, we prove that it is possible to identify clickbaits using all parts of the post while having minimum number of features possible.","PeriodicalId":177522,"journal":{"name":"International Conference on Software and Information Engineering","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"51","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Software and Information Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3220267.3220287","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 51
Abstract
Clickbait is a term that describes deceiving web content that uses ambiguity to provoke the user into clicking a link. It aims to increase the number of online readers in order to generate more advertising revenue. Clickbaits are heavily present on social media platforms wasting the time of users. We used supervised machine learning to create a model trained on 24 features extracted from a dataset of social media posts to classify the posts into two classes. This method achieved an F1-score of %79 and area under ROC curve of 0.7. The method used highlights the importance of using features extracted from different elements of a social media posts along with the traditional features extracted from the title and the article. In this research, we prove that it is possible to identify clickbaits using all parts of the post while having minimum number of features possible.