E-VAN : Enhanced Variational AutoEncoder Network for Mitigating Gender Bias in Static Word Embeddings

Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval Pub Date : 2022-12-16 DOI:10.1145/3582768.3582804

Swati Tyagi, Jiaheng Xie, Rick Andrews

引用次数: 0

Abstract

Recent research has shown that pre-trained context-independent word embeddings display biases such as racial bias, gender bias, etc. Using a novel, tunable algorithm, this study attempts to mitigate the hidden gender bias in static embeddings. In order to train the model, an enhanced variational autoencoder (E-VAN) is used to learn the latent space of the embedding. Then the latent distributions are used while adaptively resampling and re-weighting the rare/under-represented data. While the word embeddings retain semantic information, E-VAN effectively mitigates unwanted biased gendered associations. Our method E-VAN outperforms previous state-of-the-art methods in both quantitative and human evaluation.

查看原文本刊更多论文

E-VAN:用于减少静态词嵌入中性别偏见的增强变分自编码器网络

最近的研究表明，预先训练的上下文无关词嵌入会显示出种族偏见、性别偏见等偏见。本研究使用一种新颖的、可调的算法，试图减轻静态嵌入中隐藏的性别偏见。为了训练模型，使用了一种增强的变分自编码器(E-VAN)来学习嵌入的潜在空间。然后利用潜在分布自适应重采样和重加权稀有/代表性不足的数据。当词嵌入保留语义信息时，E-VAN有效地减轻了不必要的偏见性别关联。我们的方法E-VAN优于以前的最先进的方法在定量和人的评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval

自引率

0.00%

发文量