推特消息删除预测

2019 12th International Conference on Developments in eSystems Engineering (DeSE) Pub Date : 2019-10-01 DOI:10.1109/DeSE.2019.00031

A. Gazizullina, M. Mazzara

{"title":"推特消息删除预测","authors":"A. Gazizullina, M. Mazzara","doi":"10.1109/DeSE.2019.00031","DOIUrl":null,"url":null,"abstract":"Social media are a way for people to build their reputation or to promote an idea. Twitter, in contrast with other social media sources, is a generator of real-time textual information, and it is mainly used to share ideas, opinions and breaking news. It is meant for short, quick, compelling statements that reach out millions of users around the world. Posting something inappropriate may affect the public image, privacy of celebrities, politicians as well as ordinary Twitter users. If we could in advance alarm the user of the potential vulnerability in the message to be posted we could protect his/her identity from being compromised. So, automatic identification of the message with the content causing it to be deleted in the future is a promising area of research. In this paper, we are analyzing Twitter messages in English language with the objective to build a classifier to predict whether a particular post will be deleted by the user or not. We apply the Recurrent Neural Networks (RNN) model that relies on the context-based information of tweets while doing the classification. An additional contribution of the work is the construction of a rich set of features including twitter metadata, user information and tweets’ text to train classical machine learning algorithms on Twitter data.","PeriodicalId":6632,"journal":{"name":"2019 12th International Conference on Developments in eSystems Engineering (DeSE)","volume":"47 1","pages":"117-122"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Prediction of Twitter Message Deletion\",\"authors\":\"A. Gazizullina, M. Mazzara\",\"doi\":\"10.1109/DeSE.2019.00031\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Social media are a way for people to build their reputation or to promote an idea. Twitter, in contrast with other social media sources, is a generator of real-time textual information, and it is mainly used to share ideas, opinions and breaking news. It is meant for short, quick, compelling statements that reach out millions of users around the world. Posting something inappropriate may affect the public image, privacy of celebrities, politicians as well as ordinary Twitter users. If we could in advance alarm the user of the potential vulnerability in the message to be posted we could protect his/her identity from being compromised. So, automatic identification of the message with the content causing it to be deleted in the future is a promising area of research. In this paper, we are analyzing Twitter messages in English language with the objective to build a classifier to predict whether a particular post will be deleted by the user or not. We apply the Recurrent Neural Networks (RNN) model that relies on the context-based information of tweets while doing the classification. An additional contribution of the work is the construction of a rich set of features including twitter metadata, user information and tweets’ text to train classical machine learning algorithms on Twitter data.\",\"PeriodicalId\":6632,\"journal\":{\"name\":\"2019 12th International Conference on Developments in eSystems Engineering (DeSE)\",\"volume\":\"47 1\",\"pages\":\"117-122\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 12th International Conference on Developments in eSystems Engineering (DeSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DeSE.2019.00031\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 12th International Conference on Developments in eSystems Engineering (DeSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DeSE.2019.00031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

社交媒体是人们建立声誉或推广想法的一种方式。与其他社交媒体来源不同，Twitter是实时文本信息的生成器，主要用于分享想法、观点和突发新闻。它意味着简短、快速、引人注目的陈述，可以触及世界各地的数百万用户。发布不适当的内容可能会影响公众形象，名人，政治家以及普通Twitter用户的隐私。如果我们能够提前提醒用户即将发布的消息中存在潜在的漏洞，我们就可以保护他/她的身份不被泄露。因此，自动识别消息的内容，使其在未来被删除是一个很有前途的研究领域。在本文中，我们正在分析英语Twitter消息，目的是建立一个分类器来预测特定的帖子是否会被用户删除。我们使用循环神经网络(RNN)模型，该模型在进行分类时依赖于推文的基于上下文的信息。这项工作的另一个贡献是构建了一组丰富的功能，包括twitter元数据、用户信息和tweet文本，以训练基于twitter数据的经典机器学习算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Prediction of Twitter Message Deletion

Social media are a way for people to build their reputation or to promote an idea. Twitter, in contrast with other social media sources, is a generator of real-time textual information, and it is mainly used to share ideas, opinions and breaking news. It is meant for short, quick, compelling statements that reach out millions of users around the world. Posting something inappropriate may affect the public image, privacy of celebrities, politicians as well as ordinary Twitter users. If we could in advance alarm the user of the potential vulnerability in the message to be posted we could protect his/her identity from being compromised. So, automatic identification of the message with the content causing it to be deleted in the future is a promising area of research. In this paper, we are analyzing Twitter messages in English language with the objective to build a classifier to predict whether a particular post will be deleted by the user or not. We apply the Recurrent Neural Networks (RNN) model that relies on the context-based information of tweets while doing the classification. An additional contribution of the work is the construction of a rich set of features including twitter metadata, user information and tweets’ text to train classical machine learning algorithms on Twitter data.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 12th International Conference on Developments in eSystems Engineering (DeSE)

自引率

0.00%

发文量