Abstrakt: |
Statistical discrimination results when a decision-maker observes an imperfect estimate of the quality of each candidate dependent on which demographic group they belong to [1,8]. Imperfect estimates have been modelled via noise, where the variance depends on the candidate's group ([4,6,7]). Prior literature, however, is limited to simple selection problems, where a single decision-maker tries to choose the best candidates among the applications they received. In this paper, we initiate the study of statistical discrimination in matching, where multiple decision-makers are simultaneously facing selection problems from the same pool of candidates. We consider the college admission problem as first introduced in [5] and recently extended to a model with a continuum of students [3]. We propose a model where two colleges A and B observe noisy estimates of each candidate's quality, where Ws, the vector of estimates for student s, is assumed to be a bivariate normal random variable. In this setting, the estimation noise controls a new key feature of the problem, namely correlation, ρ, between the estimates of the two colleges: if the noise is high, the correlation is low and if the noise is low the correlation is high. We assume that the population of students is divided into two groups G1 and G2, and that members of these two groups are subject to different correlation levels between their grades at colleges A and B. Concretely, for each student s, their grade vector (WAs, WBs) is drawn according to a centered bivariate normal distribution with variance 1 and covariance ρGs, where Gs is the group student s belongs to. We consider the stable matching induced by this distribution and characterize how key outcome characteristics vary with the parameters, in particular with the group-dependent correlation coefficient. Our results summarize as follows: We show that the probability that a student is assigned to their first choice is independent of the student's group, but that it decreases when the correlation of either group decreases. This means that higher measurement noise (inducing lower correlation) on one group hurts not only the students of that group, but the students of all groups. We show that the probability that a student is assigned to their second choice and the probability that they remain unassigned both depend on the student's group, which reveals the presence of statistical discrimination coming from the correlation effect alone. Specifically, we find that the probability that a student remains unmatched is decreasing when the correlation of their group decreases (higher measurement noise) and when the correlation of the other group increases. In other words, the higher the measurement noise of their own group, the better off students are with regard to getting assigned a college at all. This is somewhat counter-intuitive, but is explained by the observation that with high noise (i.e., low correlation) the fact that a student is rejected from one college gives only little information about the outcome at the other college. That is, a student has an independent second chance for admission. These two comparative static results give insights on the effect of correlation on the stable matching outcome for different demographic groups and show that indeed, statistical discrimination is an important theory to understand discrimination in matching problems. We also analyze a number of special cases of our model, in particular the case of a single group, to show that even in this case correlation affects the outcome. It is interesting to notice that the effect of correlation on the number of students getting their first choice in our model is the same as in [2], i.e., a higher correlation leads to more students getting their first choice. Our work is the first to investigate statistical discrimination in the context of matching. Overall we find that group-dependent measurement noises of the candidates quality---and the resulting group-dependent correlation between the colleges' estimates---play an important role in leading to unequal outcomes for different demographic groups, and in particular underrepresentation of one of the groups. Of course, we do not argue that statistical discrimination is the only possible cause of discrimination. In particular, if there is bias in the quality estimates for one group, then it will naturally also hurt the representation of that group. We do not model bias since our primary purpose is to isolate the effect of statistical discrimination. Throughout the paper, we make a number of other simplifying assumptions (e.g., focusing on two colleges) whose purpose is also to simplify our results and isolate the effect of correlation. Our analysis, however, extends to more general contexts. [ABSTRACT FROM AUTHOR] |