ZCU-NLP at MADAR 2019: Recognizing Arabic Dialects

WANLP@ACL 2019 Pub Date : 1900-01-01 DOI:10.18653/v1/W19-4623

P. Pribán, Stephen Eugene Taylor

引用次数: 5

Abstract

In this paper, we present our systems for the MADAR Shared Task: Arabic Fine-Grained Dialect Identification. The shared task consists of two subtasks. The goal of Subtask– 1 (S-1) is to detect an Arabic city dialect in a given text and the goal of Subtask–2 (S-2) is to predict the country of origin of a Twitter user by using tweets posted by the user. In S-1, our proposed systems are based on language modelling. We use language models to extract features that are later used as an input for other machine learning algorithms. We also experiment with recurrent neural networks (RNN), but these experiments showed that simpler machine learning algorithms are more successful. Our system achieves 0.658 macro F1-score and our rank is 6th out of 19 teams in S-1 and 7th in S-2 with 0.475 macro F1-score.

查看原文本刊更多论文

ZCU-NLP在MADAR 2019:识别阿拉伯语方言

在本文中，我们提出了我们的系统MADAR共享任务:阿拉伯语细粒度方言识别。共享任务由两个子任务组成。Subtask -1 (S-1)的目标是检测给定文本中的阿拉伯城市方言，Subtask -2 (S-2)的目标是通过使用用户发布的tweet来预测Twitter用户的原籍国。在S-1中，我们提出的系统是基于语言建模的。我们使用语言模型来提取特征，这些特征后来被用作其他机器学习算法的输入。我们也对循环神经网络(RNN)进行了实验，但这些实验表明，更简单的机器学习算法更成功。我们的系统宏f1得分为0.658，在S-1的19支队伍中排名第6，在S-2的排名第7，宏f1得分为0.475。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

WANLP@ACL 2019

自引率

0.00%

发文量