Low-level Bias discovery and Mitigation for Image Classification

2022 IEEE International Conference on Signal Processing and Communications (SPCOM) Pub Date : 2022-07-11 DOI:10.1109/SPCOM55316.2022.9840811

Vartika Sengar, S. VivekB., Gaurab Bhattacharya, J. Gubbi, Arpan Pal, P. Balamuralidhar

{"title":"Low-level Bias discovery and Mitigation for Image Classification","authors":"Vartika Sengar, S. VivekB., Gaurab Bhattacharya, J. Gubbi, Arpan Pal, P. Balamuralidhar","doi":"10.1109/SPCOM55316.2022.9840811","DOIUrl":null,"url":null,"abstract":"Identification of bias and its mitigation in a classifier is a fundamental sanity check required in trustworthy AI systems. There have been many methods for mitigation of bias in literature that use bias as apriori information. In this work, we propose a system that can detect the low-level bias (e.g., color, texture) and mitigate the same. A novel auto-encoder architecture to explain the predictions made by a deep neural network is built that helps in identification of the bias. The auto-encoder is trained to produce a generalized representation of the input image by decomposing it into a set of latent embeddings. These embeddings are learned by specializing the group of higher dimensional feature maps to learn the disentangled color and shape concepts. The shape embeddings are trained to reconstruct discrete wavelet transform components of an image and the color embeddings are trained to capture the color information. The feature specialization is done by reconstructing the RGB image using the shape embeddings modulated by color embeddings. We have shown that these representations can be used to detect low level bias in a classification task. Post detection of bias, we also propose a method to de-bias the classifier by training it with counterfactual images generated by manipulating the representations learned by the auto-encoder. We have shown that our proposed method of bias discovery and mitigation is able to achieve state-of-the-art results on ColorMNIST and the newly proposed BiasedShape dataset.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"2014 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPCOM55316.2022.9840811","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Identification of bias and its mitigation in a classifier is a fundamental sanity check required in trustworthy AI systems. There have been many methods for mitigation of bias in literature that use bias as apriori information. In this work, we propose a system that can detect the low-level bias (e.g., color, texture) and mitigate the same. A novel auto-encoder architecture to explain the predictions made by a deep neural network is built that helps in identification of the bias. The auto-encoder is trained to produce a generalized representation of the input image by decomposing it into a set of latent embeddings. These embeddings are learned by specializing the group of higher dimensional feature maps to learn the disentangled color and shape concepts. The shape embeddings are trained to reconstruct discrete wavelet transform components of an image and the color embeddings are trained to capture the color information. The feature specialization is done by reconstructing the RGB image using the shape embeddings modulated by color embeddings. We have shown that these representations can be used to detect low level bias in a classification task. Post detection of bias, we also propose a method to de-bias the classifier by training it with counterfactual images generated by manipulating the representations learned by the auto-encoder. We have shown that our proposed method of bias discovery and mitigation is able to achieve state-of-the-art results on ColorMNIST and the newly proposed BiasedShape dataset.

查看原文本刊更多论文

图像分类中的低水平偏差发现与消除

在分类器中识别偏见并减轻其影响是值得信赖的人工智能系统所需的基本完整性检查。在文献中，有许多方法可以利用偏见作为先验信息来减轻偏见。在这项工作中，我们提出了一个可以检测低水平偏差(例如，颜色，纹理)并减轻其影响的系统。一种新的自编码器架构来解释由深度神经网络做出的预测，有助于识别偏差。训练自编码器通过将输入图像分解成一组潜在嵌入来产生输入图像的广义表示。这些嵌入是通过专门化高维特征映射来学习解纠缠的颜色和形状概念来学习的。训练形状嵌入来重建图像的离散小波变换分量，训练颜色嵌入来捕获图像的颜色信息。特征专门化是通过颜色嵌入调制的形状嵌入重构RGB图像来实现的。我们已经证明，这些表征可以用来检测分类任务中的低水平偏差。在检测到偏差后，我们还提出了一种方法，通过操纵自编码器学习的表示生成的反事实图像来训练分类器来消除偏差。我们已经证明，我们提出的偏差发现和缓解方法能够在ColorMNIST和新提出的BiasedShape数据集上获得最先进的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE International Conference on Signal Processing and Communications (SPCOM)

自引率

0.00%

发文量