HaSpeeDe 2 @ EVALITA2020: Overview of the EVALITA 2020 Hate Speech Detection Task

EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020 Pub Date : 1900-01-01 DOI:10.4000/BOOKS.AACCADEMIA.6897

M. Sanguinetti, G. Comandini, Elisa Di Nuovo, Simona Frenda, M. Stranisci, C. Bosco, Tommaso Caselli, V. Patti, Irene Russo

引用次数: 54

Abstract

The Hate Speech Detection (HaSpeeDe 2) task is the second edition of a shared task on the detection of hateful content in Italian Twitter messages. HaSpeeDe 2 is composed of a Main task (hate speech detection) and two Pilot tasks, (stereotype and nominal utterance detection). Systems were challenged along two dimensions: (i) time, with test data coming from a different time period than the training data, and (ii) domain, with test data coming from the news domain (i.e., news headlines). Overall, 14 teams participated in the Main task, the best systems achieved a macro F1-score of 0.8088 and 0.7744 on the indomain in the out-of-domain test sets, respectively; 6 teams submitted their results for Pilot task 1 (stereotype detection), the best systems achieved a macro F1-score of 0.7719 and 0.7203 on in-domain and outof-domain test sets. We did not receive any submission for Pilot task 2.

查看原文本刊更多论文

HaSpeeDe 2 @ EVALITA2020: EVALITA2020仇恨言论检测任务概述

仇恨言论检测(HaSpeeDe 2)任务是关于检测意大利Twitter消息中仇恨内容的共享任务的第二版。HaSpeeDe 2由一个主任务(仇恨言语检测)和两个先导任务(刻板印象和名义话语检测)组成。系统在两个维度上受到挑战:(i)时间，测试数据来自与训练数据不同的时间段;(ii)领域，测试数据来自新闻领域(即新闻标题)。总体而言，有14个团队参与了Main任务，其中最好的系统在域外测试集中分别获得了0.8088和0.7744的宏观f1分数;6个团队提交了他们的实验任务1(刻板印象检测)的结果，最好的系统在域内和域外测试集中获得了0.7719和0.7203的宏观f1分数。我们没有收到任何关于试点任务2的提交。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020

自引率

0.00%

发文量