MuLX-QA: Classifying Multi-Labels and Extracting Rationale Spans in Social Media Posts

IF 4.1 4区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on the Web Pub Date : 2024-03-21 DOI:10.1145/3653303

Soham Poddar, Rajdeep Mukherjee, Azlaan Mustafa Samad, Niloy Ganguly, Saptarshi Ghosh

{"title":"MuLX-QA: Classifying Multi-Labels and Extracting Rationale Spans in Social Media Posts","authors":"Soham Poddar, Rajdeep Mukherjee, Azlaan Mustafa Samad, Niloy Ganguly, Saptarshi Ghosh","doi":"10.1145/3653303","DOIUrl":null,"url":null,"abstract":"While social media platforms play an important role in our daily lives in obtaining the latest news and trends from across the globe, they are known to be prone to widespread proliferation of harmful information in different forms leading to misconceptions among the masses. Accordingly, several prior works have attempted to tag social media posts with labels/classes reflecting their veracity, sentiments, hate content, etc. However, in order to have a convincing impact, it is important to additionally extract the post snippets on which the labelling decision is based. We call such a post snippet as the ‘rationale’. These rationales significantly improve human trust and debuggability of the predictions, especially when detecting misinformation or stigmas from social media posts. These rationale spans or snippets are also helpful in post-classification social analysis, such as for finding out the target communities in hate-speech, or for understanding the arguments or concerns against the intake of vaccines. Also it is observed that a post may express multiple notions of misinformation, hate, sentiment, etc. Thus, the task of determining (one or multiple) labels for a given piece of text, along with the text snippets explaining the rationale behind each of the identified labels is a challenging multi-label, multi-rationale classification task, which is still nascent in the literature. While transformer-based encoder-decoder generative models such as BART and T5 are well-suited for the task, in this work we show how a relatively simpler encoder-only discriminative question-answering (QA) model can be effectively trained using simple template-based questions to accomplish the task. We thus propose MuLX-QA and demonstrate its utility in producing (label, rationale span) pairs in two different settings: multi-class (on the HateXplain dataset related to hate speech on social media), and multi-label (on the CAVES dataset related to COVID-19 anti-vaccine concerns). MuLX-QA outperforms heavier generative models in both settings. We also demonstrate the relative advantage of our proposed model MuLX-QA over strong baselines when trained with limited data. We perform several ablation studies, and experiments to better understand the effect of training MuLX-QA with different question prompts, and draw interesting inferences. Additionally, we show that MuLX-QA is effective on social media posts in resource-poor non-English languages as well. Finally, we perform a qualitative analysis of our model predictions and compare them with those of our strongest baseline.","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":"20 1","pages":""},"PeriodicalIF":4.1000,"publicationDate":"2024-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on the Web","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1145/3653303","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

While social media platforms play an important role in our daily lives in obtaining the latest news and trends from across the globe, they are known to be prone to widespread proliferation of harmful information in different forms leading to misconceptions among the masses. Accordingly, several prior works have attempted to tag social media posts with labels/classes reflecting their veracity, sentiments, hate content, etc. However, in order to have a convincing impact, it is important to additionally extract the post snippets on which the labelling decision is based. We call such a post snippet as the ‘rationale’. These rationales significantly improve human trust and debuggability of the predictions, especially when detecting misinformation or stigmas from social media posts. These rationale spans or snippets are also helpful in post-classification social analysis, such as for finding out the target communities in hate-speech, or for understanding the arguments or concerns against the intake of vaccines. Also it is observed that a post may express multiple notions of misinformation, hate, sentiment, etc. Thus, the task of determining (one or multiple) labels for a given piece of text, along with the text snippets explaining the rationale behind each of the identified labels is a challenging multi-label, multi-rationale classification task, which is still nascent in the literature.

While transformer-based encoder-decoder generative models such as BART and T5 are well-suited for the task, in this work we show how a relatively simpler encoder-only discriminative question-answering (QA) model can be effectively trained using simple template-based questions to accomplish the task. We thus propose MuLX-QA and demonstrate its utility in producing (label, rationale span) pairs in two different settings: multi-class (on the HateXplain dataset related to hate speech on social media), and multi-label (on the CAVES dataset related to COVID-19 anti-vaccine concerns). MuLX-QA outperforms heavier generative models in both settings. We also demonstrate the relative advantage of our proposed model MuLX-QA over strong baselines when trained with limited data. We perform several ablation studies, and experiments to better understand the effect of training MuLX-QA with different question prompts, and draw interesting inferences. Additionally, we show that MuLX-QA is effective on social media posts in resource-poor non-English languages as well. Finally, we perform a qualitative analysis of our model predictions and compare them with those of our strongest baseline.

查看原文本刊更多论文

MuLX-QA：对社交媒体帖子中的多标签进行分类并提取理由跨度

虽然社交媒体平台在我们获取全球最新新闻和趋势的日常生活中发挥着重要作用，但众所周知，这些平台容易以不同形式广泛传播有害信息，导致大众产生误解。因此，以前的一些著作试图给社交媒体帖子贴标签/分类，以反映其真实性、情感、仇恨内容等。然而，为了产生令人信服的效果，还必须额外提取帖子片段，并据此做出贴标签的决定。我们称这样的帖子片段为 "理由"。这些理由大大提高了人类对预测的信任度和可调试性，尤其是在检测社交媒体帖子中的错误信息或污名时。这些理由跨度或片段还有助于分类后的社会分析，例如找出仇恨言论的目标群体，或了解反对接种疫苗的论点或担忧。此外，我们还发现，一个帖子可能表达了错误信息、仇恨、情绪等多种概念。因此，为给定文本确定（一个或多个）标签以及解释每个已识别标签背后原理的文本片段，是一项具有挑战性的多标签、多原理分类任务，目前在文献中尚属新生事物。虽然基于变换器的编码器-解码器生成模型（如 BART 和 T5）非常适合这项任务，但在这项工作中，我们展示了如何使用基于模板的简单问题有效地训练相对简单的纯编码器判别式问答（QA）模型来完成这项任务。因此，我们提出了 MuLX-QA，并展示了它在两种不同环境下生成（标签、理由跨度）对的实用性：多类（在与社交媒体上的仇恨言论有关的 HateXplain 数据集上）和多标签（在与 COVID-19 反疫苗问题有关的 CAVES 数据集上）。在这两种情况下，MuLX-QA 都优于较重的生成模型。我们还展示了我们提出的模型 MuLX-QA 在使用有限数据进行训练时相对于强基线的优势。我们进行了多项消融研究和实验，以更好地了解用不同问题提示训练 MuLX-QA 的效果，并得出了有趣的推论。此外，我们还表明，MuLX-QA 对资源贫乏的非英语语言社交媒体帖子也很有效。最后，我们对模型预测进行了定性分析，并将其与最强基线进行了比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM Transactions on the Web 工程技术-计算机：软件工程

CiteScore

4.90

自引率

0.00%

发文量

审稿时长

7.5 months

期刊介绍： Transactions on the Web (TWEB) is a journal publishing refereed articles reporting the results of research on Web content, applications, use, and related enabling technologies. Topics in the scope of TWEB include but are not limited to the following: Browsers and Web Interfaces; Electronic Commerce; Electronic Publishing; Hypertext and Hypermedia; Semantic Web; Web Engineering; Web Services; and Service-Oriented Computing XML. In addition, papers addressing the intersection of the following broader technologies with the Web are also in scope: Accessibility; Business Services Education; Knowledge Management and Representation; Mobility and pervasive computing; Performance and scalability; Recommender systems; Searching, Indexing, Classification, Retrieval and Querying, Data Mining and Analysis; Security and Privacy; and User Interfaces. Papers discussing specific Web technologies, applications, content generation and management and use are within scope. Also, papers describing novel applications of the web as well as papers on the underlying technologies are welcome.