Understanding the Sarcastic Nature of Emojis with SarcOji

Vandita Grover, H. Banati
{"title":"Understanding the Sarcastic Nature of Emojis with SarcOji","authors":"Vandita Grover, H. Banati","doi":"10.18653/v1/2022.emoji-1.4","DOIUrl":null,"url":null,"abstract":"Identifying sarcasm is a challenging research problem owing to its highly contextual nature. Several researchers have attempted numerous mechanisms to incorporate context, linguistic aspects, and supervised and semi-supervised techniques to determine sarcasm. It has also been noted that emojis in a text may also hold key indicators of sarcasm. However, the availability of sarcasm datasets with emojis is scarce. This makes it challenging to effectively study the sarcastic nature of emojis. In this work, we present SarcOji which has been compiled from five publicly available sarcasm datasets. SarcOji contains labeled English texts which all have emojis. We also analyze SarcOji to determine if there is an incongruence in the polarity of text and emojis used therein. Further, emojis’ usage, occurrences, and positions in the context of sarcasm are also studied in this compiled dataset. With SarcOji we have been able to demonstrate that frequency of occurrence of an emoji and its position are strong indicators of sarcasm. SarcOji dataset is now publicly available with several derived features like sentiment scores of text and emojis, most frequent emoji, and its position in the text. Compilation of the SarcOji dataset is an initial step to enable the study of the role of emojis in communicating sarcasm. SarcOji dataset can also serve as a go-to dataset for various emoji-based sarcasm detection techniques.","PeriodicalId":393822,"journal":{"name":"Proceedings of the The Fifth International Workshop on Emoji Understanding and Applications in Social Media","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the The Fifth International Workshop on Emoji Understanding and Applications in Social Media","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18653/v1/2022.emoji-1.4","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Identifying sarcasm is a challenging research problem owing to its highly contextual nature. Several researchers have attempted numerous mechanisms to incorporate context, linguistic aspects, and supervised and semi-supervised techniques to determine sarcasm. It has also been noted that emojis in a text may also hold key indicators of sarcasm. However, the availability of sarcasm datasets with emojis is scarce. This makes it challenging to effectively study the sarcastic nature of emojis. In this work, we present SarcOji which has been compiled from five publicly available sarcasm datasets. SarcOji contains labeled English texts which all have emojis. We also analyze SarcOji to determine if there is an incongruence in the polarity of text and emojis used therein. Further, emojis’ usage, occurrences, and positions in the context of sarcasm are also studied in this compiled dataset. With SarcOji we have been able to demonstrate that frequency of occurrence of an emoji and its position are strong indicators of sarcasm. SarcOji dataset is now publicly available with several derived features like sentiment scores of text and emojis, most frequent emoji, and its position in the text. Compilation of the SarcOji dataset is an initial step to enable the study of the role of emojis in communicating sarcasm. SarcOji dataset can also serve as a go-to dataset for various emoji-based sarcasm detection techniques.
用SarcOji理解表情符号的讽刺本质
识别讽刺是一个具有挑战性的研究问题,因为它的高度语境性质。一些研究人员尝试了多种机制来结合上下文、语言方面以及监督和半监督技术来确定讽刺。人们还注意到,文本中的表情符号也可能包含讽刺的关键指标。然而,带有表情符号的讽刺数据集的可用性很少。这使得有效地研究表情符号的讽刺性质变得具有挑战性。在这项工作中,我们介绍了SarcOji,它是从五个公开可用的讽刺数据集编译而成的。SarcOji包含有标签的英文文本,这些文本都有表情符号。我们还分析了SarcOji,以确定其中使用的文本和表情符号的极性是否存在不一致。此外,表情符号在讽刺语境中的使用、出现和位置也在这个汇编的数据集中进行了研究。通过SarcOji,我们已经能够证明一个表情符号出现的频率和它的位置是讽刺的有力指标。SarcOji数据集现在公开提供了几个派生功能,如文本和表情符号的情感分数,最常见的表情符号及其在文本中的位置。SarcOji数据集的编写是研究表情符号在讽刺交流中的作用的第一步。SarcOji数据集也可以作为各种基于表情符号的讽刺检测技术的首选数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信