Analyzing and Predicting Emoji Usages in Social Media

Companion Proceedings of the The Web Conference 2018 Pub Date : 2018-04-23 DOI:10.1145/3184558.3186344

Peijun Zhao, Jia Jia, Yongsheng An, Jie Liang, Lexing Xie, Jiebo Luo

{"title":"Analyzing and Predicting Emoji Usages in Social Media","authors":"Peijun Zhao, Jia Jia, Yongsheng An, Jie Liang, Lexing Xie, Jiebo Luo","doi":"10.1145/3184558.3186344","DOIUrl":null,"url":null,"abstract":"Emojis can be regarded as a language for graphical expression of emotions, and have been widely used in social media. They can express more delicate feelings beyond textual information and improve the effectiveness of computer-mediated communication. Recent advances in machine learning make it possible to automatic compose text messages with emojis. However, the usages of emojis can be complicated and subtle so that analyzing and predicting emojis is a challenging problem. In this paper, we first construct a benchmark dataset of emojis with tweets and systematically investigate emoji usages in terms of tweet content, tweet structure and user demographics. Inspired by the investigation results, we further propose a multitask multimodality gated recurrent unit (mmGRU) model to predict the categories and positions of emojis. The model leverages not only multimodality information such as text, image and user demographics, but also the strong correlations between emoji categories and their positions. Our experimental results show that the proposed method can significantly improve the accuracy for predicting emojis for tweets (+9.0% in F1-value for category and +4.6% in F1-value for position). Based on the experimental results, we further conduct a series of case studies to unveil how emojis are used in social media.","PeriodicalId":235572,"journal":{"name":"Companion Proceedings of the The Web Conference 2018","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"25","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Companion Proceedings of the The Web Conference 2018","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3184558.3186344","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 25

Abstract

Emojis can be regarded as a language for graphical expression of emotions, and have been widely used in social media. They can express more delicate feelings beyond textual information and improve the effectiveness of computer-mediated communication. Recent advances in machine learning make it possible to automatic compose text messages with emojis. However, the usages of emojis can be complicated and subtle so that analyzing and predicting emojis is a challenging problem. In this paper, we first construct a benchmark dataset of emojis with tweets and systematically investigate emoji usages in terms of tweet content, tweet structure and user demographics. Inspired by the investigation results, we further propose a multitask multimodality gated recurrent unit (mmGRU) model to predict the categories and positions of emojis. The model leverages not only multimodality information such as text, image and user demographics, but also the strong correlations between emoji categories and their positions. Our experimental results show that the proposed method can significantly improve the accuracy for predicting emojis for tweets (+9.0% in F1-value for category and +4.6% in F1-value for position). Based on the experimental results, we further conduct a series of case studies to unveil how emojis are used in social media.

查看原文本刊更多论文

分析和预测社交媒体中表情符号的使用

表情符号是一种图形化的情感表达语言，在社交媒体中得到了广泛的应用。它们可以表达文本信息之外的更细腻的情感，提高计算机媒介交流的有效性。机器学习的最新进展使用表情符号自动编写文本信息成为可能。然而，表情符号的用法可能是复杂而微妙的，因此分析和预测表情符号是一个具有挑战性的问题。本文首先构建了包含推文的表情符号基准数据集，并从推文内容、推文结构和用户人口统计等方面系统地研究了表情符号的使用情况。受调查结果的启发，我们进一步提出了一个多任务多模态门控循环单元(mmGRU)模型来预测表情符号的类别和位置。该模型不仅利用了文本、图像和用户人口统计等多模态信息，还利用了表情符号类别及其位置之间的强相关性。我们的实验结果表明，该方法可以显著提高推文表情符号预测的准确率(类别F1-value +9.0%，位置F1-value +4.6%)。在实验结果的基础上，我们进一步进行了一系列的案例研究，以揭示表情符号在社交媒体中的使用方式。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Companion Proceedings of the The Web Conference 2018

自引率

0.00%

发文量