一种基于回顾阅读的在线社交网络中文讽刺评论检测模型

IF 4.5 2区 计算机科学 Q1 COMPUTER SCIENCE, CYBERNETICS
Lei Zhang;Xiaoming Zhao;Qinfeng Mao;Yanbing Yang;Dong Li;Haizhou Wang
{"title":"一种基于回顾阅读的在线社交网络中文讽刺评论检测模型","authors":"Lei Zhang;Xiaoming Zhao;Qinfeng Mao;Yanbing Yang;Dong Li;Haizhou Wang","doi":"10.1109/TCSS.2024.3470317","DOIUrl":null,"url":null,"abstract":"Through the use of sarcastic sentences on social media, people can express their strong emotions. Therefore, the detection of sarcasm in social media has received more and more attention over the past years. Classifying a sentence as sarcastic or nonsarcastic heavily relies on the contextual information of the sentence. However, only focusing on the features of target text is the main solution of most existing research. Moreover, the scale of publicly available Chinese sarcasm dataset is very small and does not contain the contextual information. To address the issues mentioned above, we build a Chinese sarcasm dataset from Bilibili, which is one of the most widely used social network platforms in China and has a significant number of sarcastic comments and contextual information. As far as we know, our dataset is the first publicly available large-scale Chinese sarcasm dataset including contextual information. Additionally, we have proposed a novel retrospective reading method for detecting sarcasm that leverages contextual information to improve model's performance. The experimental results show the effectiveness of the proposed model and the significance of contextual information for Chinese sarcasm detection: achieving the highest F-score of 0.6942, outperforming existing state-of-the-art (SOTA) approaches. The study presented in this article offers approaches and ideas for future Chinese sarcasm detection studies.","PeriodicalId":13044,"journal":{"name":"IEEE Transactions on Computational Social Systems","volume":"12 2","pages":"792-806"},"PeriodicalIF":4.5000,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Novel Retrospective-Reading Model for Detecting Chinese Sarcasm Comments of Online Social Network\",\"authors\":\"Lei Zhang;Xiaoming Zhao;Qinfeng Mao;Yanbing Yang;Dong Li;Haizhou Wang\",\"doi\":\"10.1109/TCSS.2024.3470317\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Through the use of sarcastic sentences on social media, people can express their strong emotions. Therefore, the detection of sarcasm in social media has received more and more attention over the past years. Classifying a sentence as sarcastic or nonsarcastic heavily relies on the contextual information of the sentence. However, only focusing on the features of target text is the main solution of most existing research. Moreover, the scale of publicly available Chinese sarcasm dataset is very small and does not contain the contextual information. To address the issues mentioned above, we build a Chinese sarcasm dataset from Bilibili, which is one of the most widely used social network platforms in China and has a significant number of sarcastic comments and contextual information. As far as we know, our dataset is the first publicly available large-scale Chinese sarcasm dataset including contextual information. Additionally, we have proposed a novel retrospective reading method for detecting sarcasm that leverages contextual information to improve model's performance. The experimental results show the effectiveness of the proposed model and the significance of contextual information for Chinese sarcasm detection: achieving the highest F-score of 0.6942, outperforming existing state-of-the-art (SOTA) approaches. The study presented in this article offers approaches and ideas for future Chinese sarcasm detection studies.\",\"PeriodicalId\":13044,\"journal\":{\"name\":\"IEEE Transactions on Computational Social Systems\",\"volume\":\"12 2\",\"pages\":\"792-806\"},\"PeriodicalIF\":4.5000,\"publicationDate\":\"2024-10-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Computational Social Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10721207/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, CYBERNETICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Computational Social Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10721207/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, CYBERNETICS","Score":null,"Total":0}
引用次数: 0

摘要

通过在社交媒体上使用讽刺句子,人们可以表达自己强烈的情绪。因此,近年来,社交媒体中讽刺语的检测受到越来越多的关注。将一个句子划分为讽刺或非讽刺在很大程度上依赖于句子的上下文信息。然而,大多数现有研究的主要解决方案是只关注目标文本的特征。此外,公开的中文讽刺数据集规模很小,不包含上下文信息。为了解决上述问题,我们从Bilibili建立了一个中文讽刺数据集,Bilibili是中国使用最广泛的社交网络平台之一,拥有大量的讽刺评论和上下文信息。据我们所知,我们的数据集是第一个公开可用的包含上下文信息的大规模中文讽刺数据集。此外,我们提出了一种新的回顾性阅读方法来检测讽刺,利用上下文信息来提高模型的性能。实验结果表明了该模型的有效性和上下文信息对中文讽刺检测的重要性:达到了最高的f值0.6942,优于现有的最先进的(SOTA)方法。本文的研究为今后的汉语讽刺语检测研究提供了方法和思路。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Novel Retrospective-Reading Model for Detecting Chinese Sarcasm Comments of Online Social Network
Through the use of sarcastic sentences on social media, people can express their strong emotions. Therefore, the detection of sarcasm in social media has received more and more attention over the past years. Classifying a sentence as sarcastic or nonsarcastic heavily relies on the contextual information of the sentence. However, only focusing on the features of target text is the main solution of most existing research. Moreover, the scale of publicly available Chinese sarcasm dataset is very small and does not contain the contextual information. To address the issues mentioned above, we build a Chinese sarcasm dataset from Bilibili, which is one of the most widely used social network platforms in China and has a significant number of sarcastic comments and contextual information. As far as we know, our dataset is the first publicly available large-scale Chinese sarcasm dataset including contextual information. Additionally, we have proposed a novel retrospective reading method for detecting sarcasm that leverages contextual information to improve model's performance. The experimental results show the effectiveness of the proposed model and the significance of contextual information for Chinese sarcasm detection: achieving the highest F-score of 0.6942, outperforming existing state-of-the-art (SOTA) approaches. The study presented in this article offers approaches and ideas for future Chinese sarcasm detection studies.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEE Transactions on Computational Social Systems
IEEE Transactions on Computational Social Systems Social Sciences-Social Sciences (miscellaneous)
CiteScore
10.00
自引率
20.00%
发文量
316
期刊介绍: IEEE Transactions on Computational Social Systems focuses on such topics as modeling, simulation, analysis and understanding of social systems from the quantitative and/or computational perspective. "Systems" include man-man, man-machine and machine-machine organizations and adversarial situations as well as social media structures and their dynamics. More specifically, the proposed transactions publishes articles on modeling the dynamics of social systems, methodologies for incorporating and representing socio-cultural and behavioral aspects in computational modeling, analysis of social system behavior and structure, and paradigms for social systems modeling and simulation. The journal also features articles on social network dynamics, social intelligence and cognition, social systems design and architectures, socio-cultural modeling and representation, and computational behavior modeling, and their applications.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信