A.K. Indira Kumar , Gayathri Sthanusubramoniani , Deepa Gupta , Aarathi Rajagopalan Nair , Yousef Ajami Alotaibi , Mohammed Zakariah
{"title":"Multi-task detection of harmful content in code-mixed meme captions using large language models with zero-shot, few-shot, and fine-tuning approaches","authors":"A.K. Indira Kumar , Gayathri Sthanusubramoniani , Deepa Gupta , Aarathi Rajagopalan Nair , Yousef Ajami Alotaibi , Mohammed Zakariah","doi":"10.1016/j.eij.2025.100683","DOIUrl":null,"url":null,"abstract":"<div><div>In today’s digital world, memes have become a common form of communication, shaping online conversations and reflecting social events. However, some memes can negatively impact people’s emotions, especially when they involve sensitive topics or mock certain groups or individuals. To address this issue, it is important to create a system that can identify and remove harmful memes before they cause further harm. Using Large Language Models for text classification in this system offers a promising approach, as these models are skilled at understanding complex language structures and recognizing patterns, including those in code-mixed language. This research focuses on evaluating how well different Large Language Models perform in identifying memes that promote cyberbullying. It covers tasks like cyberbullying detection, sentiment analysis, emotion recognition, sarcasm detection, and harmfulness evaluation. The results show significant improvements, with a 7.94% increase in accuracy for cyberbullying detection, a 2.68% improvement in harmfulness evaluation, and a 1.7% boost in sarcasm detection compared to previous top models. There is also a 1.07% improvement in emotion detection. These findings highlight the ability of Large Language Models to help tackle cyberbullying and create safer online spaces.</div></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":"30 ","pages":"Article 100683"},"PeriodicalIF":5.0000,"publicationDate":"2025-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Egyptian Informatics Journal","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110866525000763","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
In today’s digital world, memes have become a common form of communication, shaping online conversations and reflecting social events. However, some memes can negatively impact people’s emotions, especially when they involve sensitive topics or mock certain groups or individuals. To address this issue, it is important to create a system that can identify and remove harmful memes before they cause further harm. Using Large Language Models for text classification in this system offers a promising approach, as these models are skilled at understanding complex language structures and recognizing patterns, including those in code-mixed language. This research focuses on evaluating how well different Large Language Models perform in identifying memes that promote cyberbullying. It covers tasks like cyberbullying detection, sentiment analysis, emotion recognition, sarcasm detection, and harmfulness evaluation. The results show significant improvements, with a 7.94% increase in accuracy for cyberbullying detection, a 2.68% improvement in harmfulness evaluation, and a 1.7% boost in sarcasm detection compared to previous top models. There is also a 1.07% improvement in emotion detection. These findings highlight the ability of Large Language Models to help tackle cyberbullying and create safer online spaces.
期刊介绍:
The Egyptian Informatics Journal is published by the Faculty of Computers and Artificial Intelligence, Cairo University. This Journal provides a forum for the state-of-the-art research and development in the fields of computing, including computer sciences, information technologies, information systems, operations research and decision support. Innovative and not-previously-published work in subjects covered by the Journal is encouraged to be submitted, whether from academic, research or commercial sources.