On Gender Biases in Offensive Language Classification Models

Proceedings of the 4th Workshop on Gender Bias in Natural Language Processing (GeBNLP) Pub Date : 1900-01-01 DOI:10.18653/v1/2022.gebnlp-1.19

Sanjana Marcé, Adam Poliak

引用次数: 1

Abstract

We explore whether neural Natural Language Processing models trained to identify offensive language in tweets contain gender biases. We add historically gendered and gender ambiguous American names to an existing offensive language evaluation set to determine whether models? predictions are sensitive or robust to gendered names. While we see some evidence that these models might be prone to biased stereotypes that men use more offensive language than women, our results indicate that these models? binary predictions might not greatly change based upon gendered names.

查看原文本刊更多论文

论攻击性语言分类模型中的性别偏见

我们探索神经自然语言处理模型是否训练识别推文中的攻击性语言包含性别偏见。我们将历史上性别化和性别模糊的美国名字添加到现有的冒犯性语言评估集中，以确定模型是否?预测对性别名字是敏感的或可靠的。虽然我们看到一些证据表明，这些模特可能倾向于有偏见的刻板印象，即男性比女性使用更多的攻击性语言，但我们的研究结果表明，这些模特?二元预测可能不会因为性别名字而发生很大变化。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 4th Workshop on Gender Bias in Natural Language Processing (GeBNLP)

自引率

0.00%

发文量