{"title":"Prediction of Twitter Message Deletion","authors":"A. Gazizullina, M. Mazzara","doi":"10.1109/DeSE.2019.00031","DOIUrl":null,"url":null,"abstract":"Social media are a way for people to build their reputation or to promote an idea. Twitter, in contrast with other social media sources, is a generator of real-time textual information, and it is mainly used to share ideas, opinions and breaking news. It is meant for short, quick, compelling statements that reach out millions of users around the world. Posting something inappropriate may affect the public image, privacy of celebrities, politicians as well as ordinary Twitter users. If we could in advance alarm the user of the potential vulnerability in the message to be posted we could protect his/her identity from being compromised. So, automatic identification of the message with the content causing it to be deleted in the future is a promising area of research. In this paper, we are analyzing Twitter messages in English language with the objective to build a classifier to predict whether a particular post will be deleted by the user or not. We apply the Recurrent Neural Networks (RNN) model that relies on the context-based information of tweets while doing the classification. An additional contribution of the work is the construction of a rich set of features including twitter metadata, user information and tweets’ text to train classical machine learning algorithms on Twitter data.","PeriodicalId":6632,"journal":{"name":"2019 12th International Conference on Developments in eSystems Engineering (DeSE)","volume":"47 1","pages":"117-122"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 12th International Conference on Developments in eSystems Engineering (DeSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DeSE.2019.00031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Social media are a way for people to build their reputation or to promote an idea. Twitter, in contrast with other social media sources, is a generator of real-time textual information, and it is mainly used to share ideas, opinions and breaking news. It is meant for short, quick, compelling statements that reach out millions of users around the world. Posting something inappropriate may affect the public image, privacy of celebrities, politicians as well as ordinary Twitter users. If we could in advance alarm the user of the potential vulnerability in the message to be posted we could protect his/her identity from being compromised. So, automatic identification of the message with the content causing it to be deleted in the future is a promising area of research. In this paper, we are analyzing Twitter messages in English language with the objective to build a classifier to predict whether a particular post will be deleted by the user or not. We apply the Recurrent Neural Networks (RNN) model that relies on the context-based information of tweets while doing the classification. An additional contribution of the work is the construction of a rich set of features including twitter metadata, user information and tweets’ text to train classical machine learning algorithms on Twitter data.