The MADAR Shared Task on Arabic Fine-Grained Dialect Identification

WANLP@ACL 2019 Pub Date : 2019-08-01 DOI:10.18653/v1/W19-4622

Houda Bouamor, Sabit Hassan, Nizar Habash

引用次数: 96

Abstract

In this paper, we present the results and findings of the MADAR Shared Task on Arabic Fine-Grained Dialect Identification. This shared task was organized as part of The Fourth Arabic Natural Language Processing Workshop, collocated with ACL 2019. The shared task includes two subtasks: the MADAR Travel Domain Dialect Identification subtask (Subtask 1) and the MADAR Twitter User Dialect Identification subtask (Subtask 2). This shared task is the first to target a large set of dialect labels at the city and country levels. The data for the shared task was created or collected under the Multi-Arabic Dialect Applications and Resources (MADAR) project. A total of 21 teams from 15 countries participated in the shared task.

查看原文本刊更多论文

阿拉伯语细粒度方言识别的MADAR共享任务

在本文中，我们介绍了MADAR阿拉伯语细粒度方言识别共享任务的结果和发现。这项共同任务是第四届阿拉伯语自然语言处理研讨会的一部分，与ACL 2019同期举行。共享任务包括两个子任务:MADAR旅游领域方言识别子任务(subtask 1)和MADAR推特用户方言识别子任务(subtask 2)。这个共享任务是第一个针对城市和国家级别的大量方言标签的任务。共享任务的数据是在多阿拉伯语方言应用和资源(MADAR)项目下创建或收集的。共有来自15个国家的21支队伍参与了这项共同任务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

WANLP@ACL 2019

自引率

0.00%

发文量