Hateful Meme Prediction Model Using Multimodal Deep Learning

2021 International Conference on Computing, Communication and Green Engineering (CCGE) Pub Date : 2021-09-23 DOI:10.1109/CCGE50943.2021.9776440

Md. Rekib Ahmed, Neeraj Bhadani, I. Chakraborty

{"title":"Hateful Meme Prediction Model Using Multimodal Deep Learning","authors":"Md. Rekib Ahmed, Neeraj Bhadani, I. Chakraborty","doi":"10.1109/CCGE50943.2021.9776440","DOIUrl":null,"url":null,"abstract":"With the emergence of deep neural networks along with high-end computers that can process deep architectures, there has been a lot of research when Computer Vision and Natural Language Processing has been fused into a single problem. To enable students and researchers to deep dive into multimodal deep learning Facebook AI Research team published a dataset on hateful meme classification “The Hateful Meme Challenge Dataset” in May 2020 that gave us the motivation to test ourselves and an opportunity to learn more about the dataset. The rise of communication on the internet with memes as a medium, they have been used to convey incorrect information, political agendas and also has led to cyberbullying, trolling etc. This results in the need of creating an automated tool that can detect such hateful content published on the internet and remove it at the root level before it does any harm. This paper intends to adopt Unimodal Text and Image models using Bert, LSTM and VGG16, Resnet50, SE-Resnet50, XSE-Resnet architectures and combining them into Multimodal models for effective prediction of a hateful meme. The paper compares various architectures both unimodal models and multimodal models on the evaluation metrics AUC-ROC score, F1 score and accuracy score.)","PeriodicalId":130452,"journal":{"name":"2021 International Conference on Computing, Communication and Green Engineering (CCGE)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computing, Communication and Green Engineering (CCGE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCGE50943.2021.9776440","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

With the emergence of deep neural networks along with high-end computers that can process deep architectures, there has been a lot of research when Computer Vision and Natural Language Processing has been fused into a single problem. To enable students and researchers to deep dive into multimodal deep learning Facebook AI Research team published a dataset on hateful meme classification “The Hateful Meme Challenge Dataset” in May 2020 that gave us the motivation to test ourselves and an opportunity to learn more about the dataset. The rise of communication on the internet with memes as a medium, they have been used to convey incorrect information, political agendas and also has led to cyberbullying, trolling etc. This results in the need of creating an automated tool that can detect such hateful content published on the internet and remove it at the root level before it does any harm. This paper intends to adopt Unimodal Text and Image models using Bert, LSTM and VGG16, Resnet50, SE-Resnet50, XSE-Resnet architectures and combining them into Multimodal models for effective prediction of a hateful meme. The paper compares various architectures both unimodal models and multimodal models on the evaluation metrics AUC-ROC score, F1 score and accuracy score.)

查看原文本刊更多论文

基于多模态深度学习的仇恨模因预测模型

随着深度神经网络的出现以及可以处理深度架构的高端计算机的出现，将计算机视觉和自然语言处理融合为一个问题的研究已经很多。为了让学生和研究人员深入研究多模态深度学习，Facebook人工智能研究团队于2020年5月发布了一个关于仇恨模因分类的数据集“仇恨模因挑战数据集”，这给了我们测试自己的动力，并有机会了解更多关于数据集的信息。以表情包为媒介的互联网交流的兴起，它们被用来传达不正确的信息、政治议程，也导致了网络欺凌、网络喷子等。这就需要创建一个自动化工具来检测互联网上发布的这种仇恨内容，并在其造成任何伤害之前从根本上将其删除。本文拟采用Bert、LSTM和VGG16、Resnet50、SE-Resnet50、XSE-Resnet架构的单模态文本和图像模型，并将它们组合成多模态模型，以有效预测仇恨模因。在评价指标AUC-ROC评分、F1评分和准确率评分上，比较了单模态模型和多模态模型的不同架构。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 International Conference on Computing, Communication and Green Engineering (CCGE)

自引率

0.00%

发文量