Exploring the Impact of Deleting (or Retaining) a Biased Item: A Procedure Based on Classification Accuracy.

IF 3.4 2区心理学 Q1 PSYCHOLOGY, CLINICAL

Assessment Pub Date : 2024-12-10 DOI:10.1177/10731911241298081

Meltem Ozcan, Mark H C Lai

引用次数: 0

Abstract

Psychological test scores are commonly used in high-stakes settings to classify individuals. While measurement invariance across groups is necessary for valid and meaningful inferences of group differences, full measurement invariance rarely holds in practice. The classification accuracy analysis framework aims to quantify the degree and practical impact of noninvariance. However, how to best navigate the next steps remains unclear, and methods devised to account for noninvariance at the group level may be insufficient when the goal is classification. Furthermore, deleting a biased item may improve fairness but negatively affect performance, and replacing the test can be costly. We propose item-level effect size indices that allow test users to make more informed decisions by quantifying the impact of deleting (or retaining) an item on test performance and fairness, provide an illustrative example, and introduce unbiasr, an R package implementing the proposed methods.

查看原文本刊更多论文

探索删除（或保留）有偏见的项目的影响：一个基于分类准确性的过程。

心理测试分数通常用于高风险环境中对个体进行分类。虽然跨群体的测量不变性对于有效和有意义的群体差异推断是必要的，但在实践中很少保持完全的测量不变性。分类精度分析框架旨在量化非不变性的程度和实际影响。然而，如何最好地导航接下来的步骤仍然不清楚，并且当目标是分类时，设计用于解释组级别的非不变性的方法可能是不够的。此外，删除一个有偏见的项目可能会提高公平性，但会对性能产生负面影响，并且替换测试可能代价高昂。我们提出了项目级效应大小指数，通过量化删除（或保留）项目对测试性能和公平性的影响，允许测试用户做出更明智的决定，提供了一个说明性的例子，并介绍了unbiasr，一个实现所提议方法的R包。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Assessment PSYCHOLOGY, CLINICAL-

CiteScore

8.90

自引率

2.60%

发文量

期刊介绍： Assessment publishes articles in the domain of applied clinical assessment. The emphasis of this journal is on publication of information of relevance to the use of assessment measures, including test development, validation, and interpretation practices. The scope of the journal includes research that can inform assessment practices in mental health, forensic, medical, and other applied settings. Papers that focus on the assessment of cognitive and neuropsychological functioning, personality, and psychopathology are invited. Most papers published in Assessment report the results of original empirical research, however integrative review articles and scholarly case studies will also be considered.