Xin Guan , Jiuxin Cao , Hui Zhang , Biwei Cao , Bo Liu
{"title":"MIAN: Multi-head Incongruity Aware Attention Network with transfer learning for sarcasm detection","authors":"Xin Guan , Jiuxin Cao , Hui Zhang , Biwei Cao , Bo Liu","doi":"10.1016/j.eswa.2024.125702","DOIUrl":null,"url":null,"abstract":"<div><div>Sarcasm is a common rhetorical metaphor in social media platforms, that individuals express emotion contrary to the literal meaning. Capturing the incongruity in the texts is the critical factor in sarcasm detection. Although several studies have looked at the incongruity of a single text, there is currently a lack of studies on modeling the incongruity of contextual information. Inspired by <em>Multi-Head Attention</em> mechanism from Transformer, we propose a <em>Multi-head Incongruity Aware Attention Network</em>, which concentrates on both target semantic incongruity and contextual semantic incongruity. Specifically, we design a multi-head self-match network to capture target semantic incongruity in a single text. Moreover, a multi-head co-match network is applied to model the contextual semantic incongruity. Furthermore, due to the scarcity of sarcasm data and considering the correlation between tasks of sentiment analysis and sarcasm detection, we pre-train the language model with a great amount of sentiment analysis data, which enhances its ability to capture sentimental features in the text. The experimental results demonstrate that our model achieves state-of-the-art performance on four benchmark datasets, with an accuracy gain of 3.8% on Tweets Ghost, 1.1% on SARC Pol, 1.9% on Ciron and an F1-Score gain of 0.3% on FigLang Twitter.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"263 ","pages":"Article 125702"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417424025697","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Sarcasm is a common rhetorical metaphor in social media platforms, that individuals express emotion contrary to the literal meaning. Capturing the incongruity in the texts is the critical factor in sarcasm detection. Although several studies have looked at the incongruity of a single text, there is currently a lack of studies on modeling the incongruity of contextual information. Inspired by Multi-Head Attention mechanism from Transformer, we propose a Multi-head Incongruity Aware Attention Network, which concentrates on both target semantic incongruity and contextual semantic incongruity. Specifically, we design a multi-head self-match network to capture target semantic incongruity in a single text. Moreover, a multi-head co-match network is applied to model the contextual semantic incongruity. Furthermore, due to the scarcity of sarcasm data and considering the correlation between tasks of sentiment analysis and sarcasm detection, we pre-train the language model with a great amount of sentiment analysis data, which enhances its ability to capture sentimental features in the text. The experimental results demonstrate that our model achieves state-of-the-art performance on four benchmark datasets, with an accuracy gain of 3.8% on Tweets Ghost, 1.1% on SARC Pol, 1.9% on Ciron and an F1-Score gain of 0.3% on FigLang Twitter.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.