{"title":"推特消息删除预测","authors":"A. Gazizullina, M. Mazzara","doi":"10.1109/DeSE.2019.00031","DOIUrl":null,"url":null,"abstract":"Social media are a way for people to build their reputation or to promote an idea. Twitter, in contrast with other social media sources, is a generator of real-time textual information, and it is mainly used to share ideas, opinions and breaking news. It is meant for short, quick, compelling statements that reach out millions of users around the world. Posting something inappropriate may affect the public image, privacy of celebrities, politicians as well as ordinary Twitter users. If we could in advance alarm the user of the potential vulnerability in the message to be posted we could protect his/her identity from being compromised. So, automatic identification of the message with the content causing it to be deleted in the future is a promising area of research. In this paper, we are analyzing Twitter messages in English language with the objective to build a classifier to predict whether a particular post will be deleted by the user or not. We apply the Recurrent Neural Networks (RNN) model that relies on the context-based information of tweets while doing the classification. An additional contribution of the work is the construction of a rich set of features including twitter metadata, user information and tweets’ text to train classical machine learning algorithms on Twitter data.","PeriodicalId":6632,"journal":{"name":"2019 12th International Conference on Developments in eSystems Engineering (DeSE)","volume":"47 1","pages":"117-122"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Prediction of Twitter Message Deletion\",\"authors\":\"A. Gazizullina, M. Mazzara\",\"doi\":\"10.1109/DeSE.2019.00031\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Social media are a way for people to build their reputation or to promote an idea. Twitter, in contrast with other social media sources, is a generator of real-time textual information, and it is mainly used to share ideas, opinions and breaking news. It is meant for short, quick, compelling statements that reach out millions of users around the world. Posting something inappropriate may affect the public image, privacy of celebrities, politicians as well as ordinary Twitter users. If we could in advance alarm the user of the potential vulnerability in the message to be posted we could protect his/her identity from being compromised. So, automatic identification of the message with the content causing it to be deleted in the future is a promising area of research. In this paper, we are analyzing Twitter messages in English language with the objective to build a classifier to predict whether a particular post will be deleted by the user or not. We apply the Recurrent Neural Networks (RNN) model that relies on the context-based information of tweets while doing the classification. An additional contribution of the work is the construction of a rich set of features including twitter metadata, user information and tweets’ text to train classical machine learning algorithms on Twitter data.\",\"PeriodicalId\":6632,\"journal\":{\"name\":\"2019 12th International Conference on Developments in eSystems Engineering (DeSE)\",\"volume\":\"47 1\",\"pages\":\"117-122\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 12th International Conference on Developments in eSystems Engineering (DeSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DeSE.2019.00031\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 12th International Conference on Developments in eSystems Engineering (DeSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DeSE.2019.00031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Social media are a way for people to build their reputation or to promote an idea. Twitter, in contrast with other social media sources, is a generator of real-time textual information, and it is mainly used to share ideas, opinions and breaking news. It is meant for short, quick, compelling statements that reach out millions of users around the world. Posting something inappropriate may affect the public image, privacy of celebrities, politicians as well as ordinary Twitter users. If we could in advance alarm the user of the potential vulnerability in the message to be posted we could protect his/her identity from being compromised. So, automatic identification of the message with the content causing it to be deleted in the future is a promising area of research. In this paper, we are analyzing Twitter messages in English language with the objective to build a classifier to predict whether a particular post will be deleted by the user or not. We apply the Recurrent Neural Networks (RNN) model that relies on the context-based information of tweets while doing the classification. An additional contribution of the work is the construction of a rich set of features including twitter metadata, user information and tweets’ text to train classical machine learning algorithms on Twitter data.