{"title":"特权与非特权群体:年龄属性对公平影响的实证研究","authors":"Max Hort, Federica Sarro","doi":"10.1145/3524491.3527308","DOIUrl":null,"url":null,"abstract":"Recent advances in software fairness investigate bias in the treatment of different population groups, which are devised based on attributes such as gender, race and age. Groups are divided into privileged groups (favourable treatment) and unprivileged groups (unfavourable treatment). To truthfully represent the real world and to measure the degree of bias according to age (young vs. old), one needs to pick a threshold to separate those groups. In this study we investigate two popular datasets (i.e., German and Bank) and the bias observed when using every possible age threshold in order to divide the population into “young” and “old” groups, in combination with three different Machine Learning models (i.e., Logistic Regression, Decision Tree, Support Vector Machine). Our results show that age thresholds do not only impact the intensity of bias in these datasets, but also the direction (i.e., which population group receives a favourable outcome). For the two investigated datasets, we present a selection of suitable age thresholds. We also found strong and very strong correlations between the dataset bias and the respective bias of trained classification models, in 83% of the cases studied. CCS CONCEPTS • Social and professional topics → User characteristics; • General and reference → Empirical studies. ACM Reference Format: Max Hort and Federica Sarro. 2022. Privileged and Unprivileged Groups: An Empirical Study on the Impact of the Age Attribute on Fairness. In International Workshop on Equirable Data and Technology (FairWare ’22), May 9, 2022, Pittsburgh, PA, USA. ACM, New York, NY, USA, 8 pages. https://doi.org/10.1145/3524491.3527308","PeriodicalId":287874,"journal":{"name":"2022 IEEE/ACM International Workshop on Equitable Data & Technology (FairWare)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Privileged and Unprivileged Groups: An Empirical Study on the Impact of the Age Attribute on Fairness\",\"authors\":\"Max Hort, Federica Sarro\",\"doi\":\"10.1145/3524491.3527308\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recent advances in software fairness investigate bias in the treatment of different population groups, which are devised based on attributes such as gender, race and age. Groups are divided into privileged groups (favourable treatment) and unprivileged groups (unfavourable treatment). To truthfully represent the real world and to measure the degree of bias according to age (young vs. old), one needs to pick a threshold to separate those groups. In this study we investigate two popular datasets (i.e., German and Bank) and the bias observed when using every possible age threshold in order to divide the population into “young” and “old” groups, in combination with three different Machine Learning models (i.e., Logistic Regression, Decision Tree, Support Vector Machine). Our results show that age thresholds do not only impact the intensity of bias in these datasets, but also the direction (i.e., which population group receives a favourable outcome). For the two investigated datasets, we present a selection of suitable age thresholds. We also found strong and very strong correlations between the dataset bias and the respective bias of trained classification models, in 83% of the cases studied. CCS CONCEPTS • Social and professional topics → User characteristics; • General and reference → Empirical studies. ACM Reference Format: Max Hort and Federica Sarro. 2022. Privileged and Unprivileged Groups: An Empirical Study on the Impact of the Age Attribute on Fairness. In International Workshop on Equirable Data and Technology (FairWare ’22), May 9, 2022, Pittsburgh, PA, USA. ACM, New York, NY, USA, 8 pages. https://doi.org/10.1145/3524491.3527308\",\"PeriodicalId\":287874,\"journal\":{\"name\":\"2022 IEEE/ACM International Workshop on Equitable Data & Technology (FairWare)\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE/ACM International Workshop on Equitable Data & Technology (FairWare)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3524491.3527308\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/ACM International Workshop on Equitable Data & Technology (FairWare)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3524491.3527308","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
摘要
软件公平的最新进展调查了在对待不同人群时的偏见,这些群体是根据性别、种族和年龄等属性设计的。群体分为特权群体(优惠待遇)和非特权群体(不利待遇)。为了真实地呈现现实世界,并根据年龄(年轻与年老)衡量偏见的程度,需要选择一个阈值来区分这些群体。在本研究中,我们研究了两个流行的数据集(即德国和银行),并结合三种不同的机器学习模型(即逻辑回归,决策树,支持向量机),使用每个可能的年龄阈值将人口划分为“年轻”和“年老”群体时观察到的偏差。我们的研究结果表明,年龄阈值不仅影响这些数据集中的偏差强度,而且影响方向(即哪个人群获得有利的结果)。对于两个调查数据集,我们提出了合适的年龄阈值的选择。我们还发现,在83%的研究案例中,数据集偏差与训练分类模型的各自偏差之间存在很强和非常强的相关性。•社会和专业主题→用户特征;•一般和参考→实证研究。ACM参考格式:Max Hort and Federica Sarro。2022。特权与非特权群体:年龄属性对公平影响的实证研究。公平数据和技术国际研讨会(FairWare ' 22), 2022年5月9日,美国宾夕法尼亚州匹兹堡。ACM,纽约,美国,8页。https://doi.org/10.1145/3524491.3527308
Privileged and Unprivileged Groups: An Empirical Study on the Impact of the Age Attribute on Fairness
Recent advances in software fairness investigate bias in the treatment of different population groups, which are devised based on attributes such as gender, race and age. Groups are divided into privileged groups (favourable treatment) and unprivileged groups (unfavourable treatment). To truthfully represent the real world and to measure the degree of bias according to age (young vs. old), one needs to pick a threshold to separate those groups. In this study we investigate two popular datasets (i.e., German and Bank) and the bias observed when using every possible age threshold in order to divide the population into “young” and “old” groups, in combination with three different Machine Learning models (i.e., Logistic Regression, Decision Tree, Support Vector Machine). Our results show that age thresholds do not only impact the intensity of bias in these datasets, but also the direction (i.e., which population group receives a favourable outcome). For the two investigated datasets, we present a selection of suitable age thresholds. We also found strong and very strong correlations between the dataset bias and the respective bias of trained classification models, in 83% of the cases studied. CCS CONCEPTS • Social and professional topics → User characteristics; • General and reference → Empirical studies. ACM Reference Format: Max Hort and Federica Sarro. 2022. Privileged and Unprivileged Groups: An Empirical Study on the Impact of the Age Attribute on Fairness. In International Workshop on Equirable Data and Technology (FairWare ’22), May 9, 2022, Pittsburgh, PA, USA. ACM, New York, NY, USA, 8 pages. https://doi.org/10.1145/3524491.3527308