Prompt Tuning Pushes Farther, Contrastive Learning Pulls Closer: A Two-Stage Approach to Mitigate Social Biases

Annual Meeting of the Association for Computational Linguistics Pub Date : 2023-07-04 DOI:10.48550/arXiv.2307.01595

Yingji Li, Mengnan Du, Xin Wang, Y. Wang

{"title":"Prompt Tuning Pushes Farther, Contrastive Learning Pulls Closer: A Two-Stage Approach to Mitigate Social Biases","authors":"Yingji Li, Mengnan Du, Xin Wang, Y. Wang","doi":"10.48550/arXiv.2307.01595","DOIUrl":null,"url":null,"abstract":"As the representation capability of Pre-trained Language Models (PLMs) improve, there is growing concern that they will inherit social biases from unprocessed corpora. Most previous debiasing techniques used Counterfactual Data Augmentation (CDA) to balance the training corpus. However, CDA slightly modifies the original corpus, limiting the representation distance between different demographic groups to a narrow range. As a result, the debiasing model easily fits the differences between counterfactual pairs, which affects its debiasing performance with limited text resources. In this paper, we propose an adversarial training-inspired two-stage debiasing model using Contrastive learning with Continuous Prompt Augmentation (named CCPA) to mitigate social biases in PLMs’ encoding. In the first stage, we propose a data augmentation method based on continuous prompt tuning to push farther the representation distance between sample pairs along different demographic groups. In the second stage, we utilize contrastive learning to pull closer the representation distance between the augmented sample pairs and then fine-tune PLMs’ parameters to get debiased encoding. Our approach guides the model to achieve stronger debiasing performance by adding difficulty to the training process. Extensive experiments show that CCPA outperforms baselines in terms of debiasing performance. Meanwhile, experimental results on the GLUE benchmark show that CCPA retains the language modeling capability of PLMs.","PeriodicalId":352845,"journal":{"name":"Annual Meeting of the Association for Computational Linguistics","volume":"94 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annual Meeting of the Association for Computational Linguistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2307.01595","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

As the representation capability of Pre-trained Language Models (PLMs) improve, there is growing concern that they will inherit social biases from unprocessed corpora. Most previous debiasing techniques used Counterfactual Data Augmentation (CDA) to balance the training corpus. However, CDA slightly modifies the original corpus, limiting the representation distance between different demographic groups to a narrow range. As a result, the debiasing model easily fits the differences between counterfactual pairs, which affects its debiasing performance with limited text resources. In this paper, we propose an adversarial training-inspired two-stage debiasing model using Contrastive learning with Continuous Prompt Augmentation (named CCPA) to mitigate social biases in PLMs’ encoding. In the first stage, we propose a data augmentation method based on continuous prompt tuning to push farther the representation distance between sample pairs along different demographic groups. In the second stage, we utilize contrastive learning to pull closer the representation distance between the augmented sample pairs and then fine-tune PLMs’ parameters to get debiased encoding. Our approach guides the model to achieve stronger debiasing performance by adding difficulty to the training process. Extensive experiments show that CCPA outperforms baselines in terms of debiasing performance. Meanwhile, experimental results on the GLUE benchmark show that CCPA retains the language modeling capability of PLMs.

查看原文本刊更多论文

快速调整推动更远，对比学习更近:减轻社会偏见的两阶段方法

随着预训练语言模型(PLMs)表示能力的提高，人们越来越担心它们会从未处理的语料库中继承社会偏见。以前的去偏技术大多使用反事实数据增强(CDA)来平衡训练语料库。然而，CDA对原始语料库进行了轻微的修改，将不同人口统计群体之间的表示距离限制在一个狭窄的范围内。结果表明，该模型容易拟合反事实对之间的差异，从而影响了在有限文本资源下的去偏性能。在本文中，我们提出了一种对抗训练启发的两阶段去偏见模型，使用对比学习和连续提示增强(CCPA)来减轻PLMs编码中的社会偏见。在第一阶段，我们提出了一种基于连续提示调优的数据增强方法，将样本对在不同人口群体中的表示距离推得更远。在第二阶段，我们利用对比学习来拉近增强样本对之间的表示距离，然后微调plm的参数以获得去偏编码。我们的方法通过增加训练过程的难度来引导模型获得更强的去偏性能。大量的实验表明，CCPA在去偏性能方面优于基线。同时，在GLUE基准上的实验结果表明，CCPA保留了plm的语言建模能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Annual Meeting of the Association for Computational Linguistics

自引率

0.00%

发文量