Incorporating Fairness in Large Scale NLU Systems

Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining Pub Date : 2023-02-27 DOI:10.1145/3539597.3575785

Rahul Gupta, Lisa Bauer, Kai-Wei Chang, J. Dhamala, A. Galstyan, Palash Goyal, Qian Hu, Avni Khatri, Rohit Parimi, Charith S. Peris, Apurv Verma, R. Zemel, Premkumar Natarajan

{"title":"Incorporating Fairness in Large Scale NLU Systems","authors":"Rahul Gupta, Lisa Bauer, Kai-Wei Chang, J. Dhamala, A. Galstyan, Palash Goyal, Qian Hu, Avni Khatri, Rohit Parimi, Charith S. Peris, Apurv Verma, R. Zemel, Premkumar Natarajan","doi":"10.1145/3539597.3575785","DOIUrl":null,"url":null,"abstract":"NLU models power several user facing experiences such as conversations agents and chat bots. Building NLU models typically consist of 3 stages: a) building or finetuning a pre-trained model b) distilling or fine-tuning the pre-trained model to build task specific models and, c) deploying the task-specific model to production. In this presentation, we will identify fairness considerations that can be incorporated in the aforementioned three stages in the life-cycle of NLU model building: (i) selection/building of a large scale language model, (ii) distillation/fine-tuning the large model into task specific model and, (iii) deployment of the task specific model. We will present select metrics that can be used to quantify fairness in NLU models and fairness enhancement techniques that can be deployed in each of these stages. Finally, we will share some recommendations to successfully implement fairness considerations when building an industrial scale NLU system.","PeriodicalId":227804,"journal":{"name":"Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3539597.3575785","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

NLU models power several user facing experiences such as conversations agents and chat bots. Building NLU models typically consist of 3 stages: a) building or finetuning a pre-trained model b) distilling or fine-tuning the pre-trained model to build task specific models and, c) deploying the task-specific model to production. In this presentation, we will identify fairness considerations that can be incorporated in the aforementioned three stages in the life-cycle of NLU model building: (i) selection/building of a large scale language model, (ii) distillation/fine-tuning the large model into task specific model and, (iii) deployment of the task specific model. We will present select metrics that can be used to quantify fairness in NLU models and fairness enhancement techniques that can be deployed in each of these stages. Finally, we will share some recommendations to successfully implement fairness considerations when building an industrial scale NLU system.

查看原文本刊更多论文

大规模NLU系统中公平性的引入

NLU模型支持多种面向用户的体验，如对话代理和聊天机器人。构建NLU模型通常包括3个阶段:a)构建或微调预训练模型;b)提取或微调预训练模型以构建特定于任务的模型;c)将特定于任务的模型部署到生产环境中。在本次演讲中，我们将确定可以纳入NLU模型构建生命周期中上述三个阶段的公平性考虑因素:(i)选择/构建大规模语言模型，(ii)将大型模型蒸馏/微调为特定任务模型，(iii)部署特定任务模型。我们将提出可用于量化NLU模型中的公平性的选择指标，以及可在每个阶段部署的公平性增强技术。最后，我们将分享一些建议，以便在构建工业规模的NLU系统时成功实现公平性考虑。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining

自引率

0.00%

发文量