对协变量移位具有鲁棒性的公平人工智能的Lq正则化

Statistical Analysis and Data Mining: The ASA Data Science Journal Pub Date : 2023-02-22 DOI:10.1002/sam.11616

Seonghyeon Kim, Sara Kim, Kunwoong Kim, Yongdai Kim

{"title":"对协变量移位具有鲁棒性的公平人工智能的Lq正则化","authors":"Seonghyeon Kim, Sara Kim, Kunwoong Kim, Yongdai Kim","doi":"10.1002/sam.11616","DOIUrl":null,"url":null,"abstract":"It is well recognized that historical biases exist in training data against a certain sensitive group (e.g., non‐White, women) which are socially unacceptable, and these unfair biases are inherited in trained artificial intelligence (AI) models. Various learning algorithms have been proposed to remove or alleviate unfair biases in trained AI models. In this paper, we consider another type of bias in training data so‐called covariate shift in view of fair AI. Here, covariate shift means that training data do not represent the population of interest well. Covariate shift occurs when special sampling designs (e.g., stratified sampling) are used when collecting training data, or the population where training data are collected is different from the population of interest. When covariate shift exists, fair AI models on training data may not be fair in test data. To ensure fairness on test data, we develop computationally efficient learning algorithms robust to covariate shifts. In particular, we propose a robust fairness constraint based on the Lq norm which is a generic algorithm to be applied to various fairness AI problems without much hampering. By analyzing multiple benchmark datasets, we show that our proposed robust fairness AI algorithm improves existing fair AI algorithms much in terms of the fairness‐accuracy tradeoff to covariate shift and has significant computational advantages compared to other robust fair AI algorithms.","PeriodicalId":342679,"journal":{"name":"Statistical Analysis and Data Mining: The ASA Data Science Journal","volume":"83 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Lq regularization for fair artificial intelligence robust to covariate shift\",\"authors\":\"Seonghyeon Kim, Sara Kim, Kunwoong Kim, Yongdai Kim\",\"doi\":\"10.1002/sam.11616\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"It is well recognized that historical biases exist in training data against a certain sensitive group (e.g., non‐White, women) which are socially unacceptable, and these unfair biases are inherited in trained artificial intelligence (AI) models. Various learning algorithms have been proposed to remove or alleviate unfair biases in trained AI models. In this paper, we consider another type of bias in training data so‐called covariate shift in view of fair AI. Here, covariate shift means that training data do not represent the population of interest well. Covariate shift occurs when special sampling designs (e.g., stratified sampling) are used when collecting training data, or the population where training data are collected is different from the population of interest. When covariate shift exists, fair AI models on training data may not be fair in test data. To ensure fairness on test data, we develop computationally efficient learning algorithms robust to covariate shifts. In particular, we propose a robust fairness constraint based on the Lq norm which is a generic algorithm to be applied to various fairness AI problems without much hampering. By analyzing multiple benchmark datasets, we show that our proposed robust fairness AI algorithm improves existing fair AI algorithms much in terms of the fairness‐accuracy tradeoff to covariate shift and has significant computational advantages compared to other robust fair AI algorithms.\",\"PeriodicalId\":342679,\"journal\":{\"name\":\"Statistical Analysis and Data Mining: The ASA Data Science Journal\",\"volume\":\"83 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-02-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Statistical Analysis and Data Mining: The ASA Data Science Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1002/sam.11616\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistical Analysis and Data Mining: The ASA Data Science Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/sam.11616","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

众所周知，训练数据中存在针对特定敏感群体(如非白人、女性)的历史偏见，这是社会不可接受的，这些不公平的偏见在训练过的人工智能(AI)模型中是遗传的。人们提出了各种学习算法来消除或减轻训练有素的人工智能模型中的不公平偏见。在本文中，我们考虑了训练数据中的另一种类型的偏差，即所谓的协变量移位。在这里，协变量移位意味着训练数据不能很好地代表感兴趣的总体。当在收集训练数据时使用特殊的抽样设计(例如，分层抽样)，或者收集训练数据的总体不同于感兴趣的总体时，协变量移位就会发生。当协变量移位存在时，对训练数据公平的AI模型可能对测试数据不公平。为了确保测试数据的公平性，我们开发了计算效率高的学习算法，对协变量移位具有鲁棒性。特别地，我们提出了一种基于Lq范数的鲁棒公平性约束，这是一种通用算法，可以应用于各种公平性人工智能问题而不会受到太多阻碍。通过分析多个基准数据集，我们表明，我们提出的鲁棒公平AI算法在公平-准确性权衡到协变量移位方面大大改进了现有的公平AI算法，并且与其他鲁棒公平AI算法相比具有显着的计算优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Lq regularization for fair artificial intelligence robust to covariate shift

It is well recognized that historical biases exist in training data against a certain sensitive group (e.g., non‐White, women) which are socially unacceptable, and these unfair biases are inherited in trained artificial intelligence (AI) models. Various learning algorithms have been proposed to remove or alleviate unfair biases in trained AI models. In this paper, we consider another type of bias in training data so‐called covariate shift in view of fair AI. Here, covariate shift means that training data do not represent the population of interest well. Covariate shift occurs when special sampling designs (e.g., stratified sampling) are used when collecting training data, or the population where training data are collected is different from the population of interest. When covariate shift exists, fair AI models on training data may not be fair in test data. To ensure fairness on test data, we develop computationally efficient learning algorithms robust to covariate shifts. In particular, we propose a robust fairness constraint based on the Lq norm which is a generic algorithm to be applied to various fairness AI problems without much hampering. By analyzing multiple benchmark datasets, we show that our proposed robust fairness AI algorithm improves existing fair AI algorithms much in terms of the fairness‐accuracy tradeoff to covariate shift and has significant computational advantages compared to other robust fair AI algorithms.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Statistical Analysis and Data Mining: The ASA Data Science Journal

自引率

0.00%

发文量