AraProp at WANLP 2022 Shared Task: Leveraging Pre-Trained Language Models for Arabic Propaganda Detection

Workshop on Arabic Natural Language Processing Pub Date : 1900-01-01 DOI:10.18653/v1/2022.wanlp-1.56

Gaurav Singh

引用次数: 2

Abstract

This paper presents the approach taken for the shared task on Propaganda Detection in Arabic at the Seventh Arabic Natural Language Processing Workshop (WANLP 2022). We participated in Sub-task 1 where the text of a tweet is provided, and the goal is to identify the different propaganda techniques used in it. This problem belongs to multi-label classification. For our solution, we approached leveraging different transformer based pre-trained language models with fine-tuning to solve this problem. We found that MARBERTv2 outperforms in terms of performance where F1-macro is 0.08175 and F1-micro is 0.61116 compared to other language models that we considered. Our method achieved rank 4 in the testing phase of the challenge.

查看原文本刊更多论文

AraProp在WANLP 2022共享任务:利用预训练的语言模型进行阿拉伯语宣传检测

本文介绍了第七届阿拉伯语自然语言处理研讨会(WANLP 2022)上阿拉伯语宣传检测的共同任务所采取的方法。我们参与了提供tweet文本的子任务1，目标是识别其中使用的不同宣传技术。这个问题属于多标签分类。对于我们的解决方案，我们利用不同的基于转换器的预训练语言模型来解决这个问题。我们发现，与我们考虑的其他语言模型相比，MARBERTv2在性能方面表现出色，其中F1-macro为0.08175,F1-micro为0.61116。我们的方法在挑战的测试阶段获得了第4名。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Workshop on Arabic Natural Language Processing

自引率

0.00%

发文量