推特消息删除预测

A. Gazizullina, M. Mazzara
{"title":"推特消息删除预测","authors":"A. Gazizullina, M. Mazzara","doi":"10.1109/DeSE.2019.00031","DOIUrl":null,"url":null,"abstract":"Social media are a way for people to build their reputation or to promote an idea. Twitter, in contrast with other social media sources, is a generator of real-time textual information, and it is mainly used to share ideas, opinions and breaking news. It is meant for short, quick, compelling statements that reach out millions of users around the world. Posting something inappropriate may affect the public image, privacy of celebrities, politicians as well as ordinary Twitter users. If we could in advance alarm the user of the potential vulnerability in the message to be posted we could protect his/her identity from being compromised. So, automatic identification of the message with the content causing it to be deleted in the future is a promising area of research. In this paper, we are analyzing Twitter messages in English language with the objective to build a classifier to predict whether a particular post will be deleted by the user or not. We apply the Recurrent Neural Networks (RNN) model that relies on the context-based information of tweets while doing the classification. An additional contribution of the work is the construction of a rich set of features including twitter metadata, user information and tweets’ text to train classical machine learning algorithms on Twitter data.","PeriodicalId":6632,"journal":{"name":"2019 12th International Conference on Developments in eSystems Engineering (DeSE)","volume":"47 1","pages":"117-122"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Prediction of Twitter Message Deletion\",\"authors\":\"A. Gazizullina, M. Mazzara\",\"doi\":\"10.1109/DeSE.2019.00031\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Social media are a way for people to build their reputation or to promote an idea. Twitter, in contrast with other social media sources, is a generator of real-time textual information, and it is mainly used to share ideas, opinions and breaking news. It is meant for short, quick, compelling statements that reach out millions of users around the world. Posting something inappropriate may affect the public image, privacy of celebrities, politicians as well as ordinary Twitter users. If we could in advance alarm the user of the potential vulnerability in the message to be posted we could protect his/her identity from being compromised. So, automatic identification of the message with the content causing it to be deleted in the future is a promising area of research. In this paper, we are analyzing Twitter messages in English language with the objective to build a classifier to predict whether a particular post will be deleted by the user or not. We apply the Recurrent Neural Networks (RNN) model that relies on the context-based information of tweets while doing the classification. An additional contribution of the work is the construction of a rich set of features including twitter metadata, user information and tweets’ text to train classical machine learning algorithms on Twitter data.\",\"PeriodicalId\":6632,\"journal\":{\"name\":\"2019 12th International Conference on Developments in eSystems Engineering (DeSE)\",\"volume\":\"47 1\",\"pages\":\"117-122\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 12th International Conference on Developments in eSystems Engineering (DeSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DeSE.2019.00031\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 12th International Conference on Developments in eSystems Engineering (DeSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DeSE.2019.00031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

社交媒体是人们建立声誉或推广想法的一种方式。与其他社交媒体来源不同,Twitter是实时文本信息的生成器,主要用于分享想法、观点和突发新闻。它意味着简短、快速、引人注目的陈述,可以触及世界各地的数百万用户。发布不适当的内容可能会影响公众形象,名人,政治家以及普通Twitter用户的隐私。如果我们能够提前提醒用户即将发布的消息中存在潜在的漏洞,我们就可以保护他/她的身份不被泄露。因此,自动识别消息的内容,使其在未来被删除是一个很有前途的研究领域。在本文中,我们正在分析英语Twitter消息,目的是建立一个分类器来预测特定的帖子是否会被用户删除。我们使用循环神经网络(RNN)模型,该模型在进行分类时依赖于推文的基于上下文的信息。这项工作的另一个贡献是构建了一组丰富的功能,包括twitter元数据、用户信息和tweet文本,以训练基于twitter数据的经典机器学习算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Prediction of Twitter Message Deletion
Social media are a way for people to build their reputation or to promote an idea. Twitter, in contrast with other social media sources, is a generator of real-time textual information, and it is mainly used to share ideas, opinions and breaking news. It is meant for short, quick, compelling statements that reach out millions of users around the world. Posting something inappropriate may affect the public image, privacy of celebrities, politicians as well as ordinary Twitter users. If we could in advance alarm the user of the potential vulnerability in the message to be posted we could protect his/her identity from being compromised. So, automatic identification of the message with the content causing it to be deleted in the future is a promising area of research. In this paper, we are analyzing Twitter messages in English language with the objective to build a classifier to predict whether a particular post will be deleted by the user or not. We apply the Recurrent Neural Networks (RNN) model that relies on the context-based information of tweets while doing the classification. An additional contribution of the work is the construction of a rich set of features including twitter metadata, user information and tweets’ text to train classical machine learning algorithms on Twitter data.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信