No Army, No Navy: BERT Semi-Supervised Learning of Arabic Dialects

WANLP@ACL 2019 Pub Date : 2019-08-01 DOI:10.18653/v1/W19-4637

Chiyu Zhang, Muhammad Abdul-Mageed

引用次数: 30

Abstract

We present our deep leaning system submitted to MADAR shared task 2 focused on twitter user dialect identification. We develop tweet-level identification models based on GRUs and BERT in supervised and semi-supervised set-tings. We then introduce a simple, yet effective, method of porting tweet-level labels at the level of users. Our system ranks top 1 in the competition, with 71.70% macro F1 score and 77.40% accuracy.

查看原文本刊更多论文

没有陆军，没有海军:BERT半监督学习阿拉伯方言

我们将我们的深度学习系统提交给MADAR共享任务2，重点关注twitter用户方言识别。我们在监督和半监督设置中开发了基于gru和BERT的推文级识别模型。然后，我们将介绍一种简单而有效的方法，在用户级别上移植tweet级别的标签。我们的系统在比赛中排名第一，宏观F1得分为71.70%，准确率为77.40%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

WANLP@ACL 2019

自引率

0.00%

发文量