Statistical Discrimination in Stable Matchings

Rémi Castera, P. Loiseau, Bary S. R. Pradelski
{"title":"Statistical Discrimination in Stable Matchings","authors":"Rémi Castera, P. Loiseau, Bary S. R. Pradelski","doi":"10.1145/3490486.3538364","DOIUrl":null,"url":null,"abstract":"Statistical discrimination results when a decision-maker observes an imperfect estimate of the quality of each candidate dependent on which demographic group they belong to [1,8]. Imperfect estimates have been modelled via noise, where the variance depends on the candidate's group ([4,6,7]). Prior literature, however, is limited to simple selection problems, where a single decision-maker tries to choose the best candidates among the applications they received. In this paper, we initiate the study of statistical discrimination in matching, where multiple decision-makers are simultaneously facing selection problems from the same pool of candidates. We consider the college admission problem as first introduced in [5] and recently extended to a model with a continuum of students [3]. We propose a model where two colleges A and B observe noisy estimates of each candidate's quality, where Ws, the vector of estimates for student s, is assumed to be a bivariate normal random variable. In this setting, the estimation noise controls a new key feature of the problem, namely correlation, ρ, between the estimates of the two colleges: if the noise is high, the correlation is low and if the noise is low the correlation is high. We assume that the population of students is divided into two groups G1 and G2, and that members of these two groups are subject to different correlation levels between their grades at colleges A and B. Concretely, for each student s, their grade vector (WAs, WBs) is drawn according to a centered bivariate normal distribution with variance 1 and covariance ρGs, where Gs is the group student s belongs to. We consider the stable matching induced by this distribution and characterize how key outcome characteristics vary with the parameters, in particular with the group-dependent correlation coefficient. Our results summarize as follows: We show that the probability that a student is assigned to their first choice is independent of the student's group, but that it decreases when the correlation of either group decreases. This means that higher measurement noise (inducing lower correlation) on one group hurts not only the students of that group, but the students of all groups. We show that the probability that a student is assigned to their second choice and the probability that they remain unassigned both depend on the student's group, which reveals the presence of statistical discrimination coming from the correlation effect alone. Specifically, we find that the probability that a student remains unmatched is decreasing when the correlation of their group decreases (higher measurement noise) and when the correlation of the other group increases. In other words, the higher the measurement noise of their own group, the better off students are with regard to getting assigned a college at all. This is somewhat counter-intuitive, but is explained by the observation that with high noise (i.e., low correlation) the fact that a student is rejected from one college gives only little information about the outcome at the other college. That is, a student has an independent second chance for admission. These two comparative static results give insights on the effect of correlation on the stable matching outcome for different demographic groups and show that indeed, statistical discrimination is an important theory to understand discrimination in matching problems. We also analyze a number of special cases of our model, in particular the case of a single group, to show that even in this case correlation affects the outcome. It is interesting to notice that the effect of correlation on the number of students getting their first choice in our model is the same as in [2], i.e., a higher correlation leads to more students getting their first choice. Our work is the first to investigate statistical discrimination in the context of matching. Overall we find that group-dependent measurement noises of the candidates quality---and the resulting group-dependent correlation between the colleges' estimates---play an important role in leading to unequal outcomes for different demographic groups, and in particular underrepresentation of one of the groups. Of course, we do not argue that statistical discrimination is the only possible cause of discrimination. In particular, if there is bias in the quality estimates for one group, then it will naturally also hurt the representation of that group. We do not model bias since our primary purpose is to isolate the effect of statistical discrimination. Throughout the paper, we make a number of other simplifying assumptions (e.g., focusing on two colleges) whose purpose is also to simplify our results and isolate the effect of correlation. Our analysis, however, extends to more general contexts.","PeriodicalId":209859,"journal":{"name":"Proceedings of the 23rd ACM Conference on Economics and Computation","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 23rd ACM Conference on Economics and Computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3490486.3538364","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Statistical discrimination results when a decision-maker observes an imperfect estimate of the quality of each candidate dependent on which demographic group they belong to [1,8]. Imperfect estimates have been modelled via noise, where the variance depends on the candidate's group ([4,6,7]). Prior literature, however, is limited to simple selection problems, where a single decision-maker tries to choose the best candidates among the applications they received. In this paper, we initiate the study of statistical discrimination in matching, where multiple decision-makers are simultaneously facing selection problems from the same pool of candidates. We consider the college admission problem as first introduced in [5] and recently extended to a model with a continuum of students [3]. We propose a model where two colleges A and B observe noisy estimates of each candidate's quality, where Ws, the vector of estimates for student s, is assumed to be a bivariate normal random variable. In this setting, the estimation noise controls a new key feature of the problem, namely correlation, ρ, between the estimates of the two colleges: if the noise is high, the correlation is low and if the noise is low the correlation is high. We assume that the population of students is divided into two groups G1 and G2, and that members of these two groups are subject to different correlation levels between their grades at colleges A and B. Concretely, for each student s, their grade vector (WAs, WBs) is drawn according to a centered bivariate normal distribution with variance 1 and covariance ρGs, where Gs is the group student s belongs to. We consider the stable matching induced by this distribution and characterize how key outcome characteristics vary with the parameters, in particular with the group-dependent correlation coefficient. Our results summarize as follows: We show that the probability that a student is assigned to their first choice is independent of the student's group, but that it decreases when the correlation of either group decreases. This means that higher measurement noise (inducing lower correlation) on one group hurts not only the students of that group, but the students of all groups. We show that the probability that a student is assigned to their second choice and the probability that they remain unassigned both depend on the student's group, which reveals the presence of statistical discrimination coming from the correlation effect alone. Specifically, we find that the probability that a student remains unmatched is decreasing when the correlation of their group decreases (higher measurement noise) and when the correlation of the other group increases. In other words, the higher the measurement noise of their own group, the better off students are with regard to getting assigned a college at all. This is somewhat counter-intuitive, but is explained by the observation that with high noise (i.e., low correlation) the fact that a student is rejected from one college gives only little information about the outcome at the other college. That is, a student has an independent second chance for admission. These two comparative static results give insights on the effect of correlation on the stable matching outcome for different demographic groups and show that indeed, statistical discrimination is an important theory to understand discrimination in matching problems. We also analyze a number of special cases of our model, in particular the case of a single group, to show that even in this case correlation affects the outcome. It is interesting to notice that the effect of correlation on the number of students getting their first choice in our model is the same as in [2], i.e., a higher correlation leads to more students getting their first choice. Our work is the first to investigate statistical discrimination in the context of matching. Overall we find that group-dependent measurement noises of the candidates quality---and the resulting group-dependent correlation between the colleges' estimates---play an important role in leading to unequal outcomes for different demographic groups, and in particular underrepresentation of one of the groups. Of course, we do not argue that statistical discrimination is the only possible cause of discrimination. In particular, if there is bias in the quality estimates for one group, then it will naturally also hurt the representation of that group. We do not model bias since our primary purpose is to isolate the effect of statistical discrimination. Throughout the paper, we make a number of other simplifying assumptions (e.g., focusing on two colleges) whose purpose is also to simplify our results and isolate the effect of correlation. Our analysis, however, extends to more general contexts.
稳定匹配中的统计歧视
当决策者观察到每个候选人的素质取决于他们属于哪个人口群体时,统计歧视就会产生[1,8]。不完美的估计已经通过噪声建模,其中方差取决于候选人的群体([4,6,7])。然而,先前的文献仅限于简单的选择问题,即单个决策者试图从他们收到的申请中选择最佳候选人。在本文中,我们开始研究匹配中的统计歧视,其中多个决策者同时面临从同一候选人池中选择问题。我们认为大学录取问题在b[5]中首次引入,最近扩展到一个具有连续学生b[3]的模型。我们提出了一个模型,其中两个学院a和B观察每个候选人质量的噪声估计,其中Ws,学生s的估计向量,被假设为一个二元正态随机变量。在这种情况下,估计噪声控制了问题的一个新的关键特征,即两个学院的估计之间的相关性ρ:如果噪声高,相关性就低,如果噪声低,相关性就高。我们假设学生总体分为G1和G2两组,这两组的成员在A和b大学的成绩具有不同的相关水平。具体来说,对于每个学生s,他们的成绩向量(WAs, WBs)根据方差为1,协方差为ρGs的中心二元正态分布绘制,其中g为学生s所属的组。我们考虑了由这种分布引起的稳定匹配,并描述了关键结果特征如何随参数变化,特别是随组相关系数变化。我们的结果总结如下:我们表明,学生被分配到第一选择的概率与学生所在的组无关,但当任何一组的相关性降低时,它就会降低。这意味着对一个组较高的测量噪声(导致较低的相关性)不仅伤害了该组的学生,而且伤害了所有组的学生。我们表明,学生被分配到第二选择的概率和他们未被分配的概率都取决于学生的群体,这揭示了仅来自相关效应的统计歧视的存在。具体来说,我们发现,当一个学生所在组的相关性降低(测量噪声较高),而另一个组的相关性增加时,这个学生保持不匹配的概率就会降低。换句话说,他们所在群体的测量噪声越高,学生被分配到大学的可能性就越大。这有点违反直觉,但可以通过观察来解释,即在高噪声(即低相关性)的情况下,学生被一所大学拒绝的事实只能提供很少关于另一所大学结果的信息。也就是说,学生有独立的第二次入学机会。这两个比较静态结果揭示了相关性对不同人口群体稳定匹配结果的影响,表明统计歧视确实是理解匹配问题中歧视的重要理论。我们还分析了我们模型的一些特殊情况,特别是单个组的情况,以表明即使在这种情况下,相关性也会影响结果。有趣的是,在我们的模型中,相关性对获得第一选择的学生数量的影响与[2]中相同,即相关性越高,获得第一选择的学生越多。我们的工作是第一个调查匹配背景下的统计歧视。总体而言,我们发现候选人质量的群体依赖测量噪声-以及由此产生的大学估计之间的群体依赖相关性-在导致不同人口群体的不平等结果方面发挥了重要作用,特别是其中一个群体的代表性不足。当然,我们并不认为统计上的歧视是造成歧视的唯一可能原因。特别是,如果对一个群体的质量估计存在偏差,那么它自然也会损害该群体的代表性。我们没有建立偏差模型,因为我们的主要目的是隔离统计歧视的影响。在整篇论文中,我们做了一些其他的简化假设(例如,专注于两所大学),其目的也是为了简化我们的结果并隔离相关性的影响。然而,我们的分析扩展到更一般的情况。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信