Hwanggyu Lim, Danqi Zhu, Edison M. Choe, KyungT. Han, Chris
{"title":"Detecting Differential Item Functioning among Multiple Groups Using IRT Residual DIF Framework","authors":"Hwanggyu Lim, Danqi Zhu, Edison M. Choe, KyungT. Han, Chris","doi":"10.1111/jedm.12415","DOIUrl":null,"url":null,"abstract":"<p>This study presents a generalized version of the residual differential item functioning (RDIF) detection framework in item response theory, named GRDIF, to analyze differential item functioning (DIF) in multiple groups. The GRDIF framework retains the advantages of the original RDIF framework, such as computational efficiency and ease of implementation. The performance of GRDIF was assessed through a simulation study and compared with existing DIF detection methods, including the generalized Mantel-Haenszel, Lasso-DIF, and alignment methods. Results showed that the GRDIF framework demonstrated well-controlled Type I error rates close to the nominal level of .05 and satisfactory power in detecting uniform, nonuniform, and mixed DIF across different simulated conditions. Each of the three GRDIF statistics, <span></span><math>\n <semantics>\n <mrow>\n <mi>G</mi>\n <mi>R</mi>\n <mi>D</mi>\n <mi>I</mi>\n <msub>\n <mi>F</mi>\n <mi>R</mi>\n </msub>\n </mrow>\n <annotation>$GRDI{{F}_R}$</annotation>\n </semantics></math>, <span></span><math>\n <semantics>\n <mrow>\n <mi>G</mi>\n <mi>R</mi>\n <mi>D</mi>\n <mi>I</mi>\n <msub>\n <mi>F</mi>\n <mi>S</mi>\n </msub>\n </mrow>\n <annotation>$GRDI{{F}_S}$</annotation>\n </semantics></math>, and <span></span><math>\n <semantics>\n <mrow>\n <mi>G</mi>\n <mi>R</mi>\n <mi>D</mi>\n <mi>I</mi>\n <msub>\n <mi>F</mi>\n <mrow>\n <mi>R</mi>\n <mi>S</mi>\n </mrow>\n </msub>\n </mrow>\n <annotation>$GRDI{{F}_{RS}}$</annotation>\n </semantics></math>, effectively detected the specific type of DIF for which it was designed, with <span></span><math>\n <semantics>\n <mrow>\n <mi>G</mi>\n <mi>R</mi>\n <mi>D</mi>\n <mi>I</mi>\n <msub>\n <mi>F</mi>\n <mrow>\n <mi>R</mi>\n <mi>S</mi>\n </mrow>\n </msub>\n </mrow>\n <annotation>$GRDI{{F}_{RS}}$</annotation>\n </semantics></math> exhibiting the most robust performance across all types of DIF. The GRDIF framework outperformed other DIF detection methods under various conditions, suggesting its potential for practical applications, particularly in large-scale assessments involving multiple groups. Additionally, an empirical study demonstrated the efficacy and utility of the GRDIF framework in conducting DIF analysis with a high-stakes assessment data set.</p>","PeriodicalId":47871,"journal":{"name":"Journal of Educational Measurement","volume":"61 4","pages":"656-681"},"PeriodicalIF":1.6000,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/jedm.12415","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Educational Measurement","FirstCategoryId":"102","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/jedm.12415","RegionNum":4,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"PSYCHOLOGY, APPLIED","Score":null,"Total":0}
引用次数: 0
Abstract
This study presents a generalized version of the residual differential item functioning (RDIF) detection framework in item response theory, named GRDIF, to analyze differential item functioning (DIF) in multiple groups. The GRDIF framework retains the advantages of the original RDIF framework, such as computational efficiency and ease of implementation. The performance of GRDIF was assessed through a simulation study and compared with existing DIF detection methods, including the generalized Mantel-Haenszel, Lasso-DIF, and alignment methods. Results showed that the GRDIF framework demonstrated well-controlled Type I error rates close to the nominal level of .05 and satisfactory power in detecting uniform, nonuniform, and mixed DIF across different simulated conditions. Each of the three GRDIF statistics, , , and , effectively detected the specific type of DIF for which it was designed, with exhibiting the most robust performance across all types of DIF. The GRDIF framework outperformed other DIF detection methods under various conditions, suggesting its potential for practical applications, particularly in large-scale assessments involving multiple groups. Additionally, an empirical study demonstrated the efficacy and utility of the GRDIF framework in conducting DIF analysis with a high-stakes assessment data set.
本研究提出了项目反应理论中残余差异项目功能(RDIF)检测框架的广义版本GRDIF,用于分析多群体中的差异项目功能(DIF)。GRDIF框架保留了原始RDIF框架的优点,如计算效率和易于实现。通过仿真研究评估GRDIF的性能,并与现有的DIF检测方法(包括广义Mantel-Haenszel、Lasso-DIF和对准方法)进行比较。结果表明,GRDIF框架在不同模拟条件下检测均匀、非均匀和混合DIF时具有良好的I型错误率控制,接近0.05的标称水平,并且具有令人满意的能力。GRDIF的三个统计量分别为GRDIF R $GRDI{{F}_R}$,G r d I f s $ grdi {{f} _s}$,和GRDI F RS $GRDI{{F}_{RS}}$,有效地检测了所设计的特定类型的DIF;其中GRDI F RS $GRDI{{F}_{RS}}$在所有类型的DIF中表现出最稳健的性能。GRDIF框架在各种条件下优于其他DIF检测方法,表明其具有实际应用潜力,特别是在涉及多群体的大规模评估中。此外,一项实证研究证明了GRDIF框架在使用高风险评估数据集进行DIF分析时的有效性和实用性。
期刊介绍:
The Journal of Educational Measurement (JEM) publishes original measurement research, provides reviews of measurement publications, and reports on innovative measurement applications. The topics addressed will interest those concerned with the practice of measurement in field settings, as well as be of interest to measurement theorists. In addition to presenting new contributions to measurement theory and practice, JEM also serves as a vehicle for improving educational measurement applications in a variety of settings.