项目反应理论与经典测试理论:项目/人统计的实证比较。

Journal of applied measurement Pub Date : 2020-01-01

Nicholas Marosszeky, E Arthur Shores, Michael P Jones, Rassoul Sadeghi

{"title":"项目反应理论与经典测试理论:项目/人统计的实证比较。","authors":"Nicholas Marosszeky, E Arthur Shores, Michael P Jones, Rassoul Sadeghi","doi":"","DOIUrl":null,"url":null,"abstract":"Streiner, Norman and Cairney (2015) \"Health Measurement Scales: A practical guide to their development and use\", now in its fifth edition, is one of the foundational texts of the health outcomes movement. It states that \"the differences between scales constructed with IRT and CTT are trivial.\" (Streiner, Norman and Cairney, 2015, p. 299) This statement is representative of the view which emphasizes the equivalence of True-Score Theory (TST) (also known as Classical Test Theory [CTT]) and the Rasch Measurement Model [RMM]). This view is widely held and has been one factor in limiting the application of RMM in the development of health outcome measures. However, this equivalence view relies heavily on a paper by Fan (1998) which examined the item statistics derived from TST, IRT (Item Response Theory) and the RMM for a large educational dataset. While subject to a number of theoretical and practical criticisms from a RMM perspective this paper has not been replicated with a large sample. This paper by replicating and extending the paper by Fan (1998) challenges the finding that item difficulty indexes derived from high and low ability samples using TST techniques are invariant. They are not. On the other hand, item locations derived from the RMM have a high degree of invariance. This secondary data analysis, by working through the methods used by Fan (1998) also demonstrates that a reliance on the magnitude of correlational coefficients cannot be used to determine the invariance of item difficulty indexes. An investigation into the linearity of the correlations using scatter plots is also required. Finally, an item analysis derived from the item difficulty indexes which displays a picture of the test as a whole shows that, for this large sample, the differences between scales constructed with TST and the RMM are not trivial.","PeriodicalId":73608,"journal":{"name":"Journal of applied measurement","volume":"21 4","pages":"456-480"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Psychometric Replication of Fan (1998) Item Response Theory and Classical Test Theory: An Empirical Comparison of their Item/Person Statistics.\",\"authors\":\"Nicholas Marosszeky, E Arthur Shores, Michael P Jones, Rassoul Sadeghi\",\"doi\":\"\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Streiner, Norman and Cairney (2015) \\\"Health Measurement Scales: A practical guide to their development and use\\\", now in its fifth edition, is one of the foundational texts of the health outcomes movement. It states that \\\"the differences between scales constructed with IRT and CTT are trivial.\\\" (Streiner, Norman and Cairney, 2015, p. 299) This statement is representative of the view which emphasizes the equivalence of True-Score Theory (TST) (also known as Classical Test Theory [CTT]) and the Rasch Measurement Model [RMM]). This view is widely held and has been one factor in limiting the application of RMM in the development of health outcome measures. However, this equivalence view relies heavily on a paper by Fan (1998) which examined the item statistics derived from TST, IRT (Item Response Theory) and the RMM for a large educational dataset. While subject to a number of theoretical and practical criticisms from a RMM perspective this paper has not been replicated with a large sample. This paper by replicating and extending the paper by Fan (1998) challenges the finding that item difficulty indexes derived from high and low ability samples using TST techniques are invariant. They are not. On the other hand, item locations derived from the RMM have a high degree of invariance. This secondary data analysis, by working through the methods used by Fan (1998) also demonstrates that a reliance on the magnitude of correlational coefficients cannot be used to determine the invariance of item difficulty indexes. An investigation into the linearity of the correlations using scatter plots is also required. Finally, an item analysis derived from the item difficulty indexes which displays a picture of the test as a whole shows that, for this large sample, the differences between scales constructed with TST and the RMM are not trivial.\",\"PeriodicalId\":73608,\"journal\":{\"name\":\"Journal of applied measurement\",\"volume\":\"21 4\",\"pages\":\"456-480\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of applied measurement\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of applied measurement","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

斯特雷纳、诺曼和凯恩(2015)《健康测量量表:编制和使用的实用指南》现已出版第五版，是健康成果运动的基础文本之一。它指出“用IRT和CTT构建的量表之间的差异是微不足道的。”(Streiner, Norman and Cairney, 2015, p. 299)这一说法代表了强调真分数理论(TST)(也称为经典测试理论[CTT])和Rasch测量模型[RMM]等价的观点。这一观点被广泛接受，并且是限制在制定保健结果措施中应用RMM的一个因素。然而，这种等效性观点在很大程度上依赖于范(1998)的一篇论文，该论文对大型教育数据集的项目统计数据进行了研究，这些统计数据来自TST、IRT(项目反应理论)和RMM。虽然从RMM的角度来看，这篇论文受到了许多理论和实践的批评，但没有得到大样本的复制。本文复制并扩展了Fan(1998)的论文，挑战了使用TST技术从高能力和低能力样本中得出的项目难度指数不变的发现。事实并非如此。另一方面，从RMM派生的项目位置具有高度的不变性。通过Fan(1998)使用的方法进行的二次数据分析也表明，不能使用对相关系数大小的依赖来确定项目难度指数的不变性。还需要使用散点图对相关性的线性进行调查。最后，从项目难度指数中得出的项目分析显示了测试的整体情况，对于这个大样本，用TST和RMM构建的量表之间的差异并非微不足道。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

本刊更多论文

A Psychometric Replication of Fan (1998) Item Response Theory and Classical Test Theory: An Empirical Comparison of their Item/Person Statistics.

Streiner, Norman and Cairney (2015) "Health Measurement Scales: A practical guide to their development and use", now in its fifth edition, is one of the foundational texts of the health outcomes movement. It states that "the differences between scales constructed with IRT and CTT are trivial." (Streiner, Norman and Cairney, 2015, p. 299) This statement is representative of the view which emphasizes the equivalence of True-Score Theory (TST) (also known as Classical Test Theory [CTT]) and the Rasch Measurement Model [RMM]). This view is widely held and has been one factor in limiting the application of RMM in the development of health outcome measures. However, this equivalence view relies heavily on a paper by Fan (1998) which examined the item statistics derived from TST, IRT (Item Response Theory) and the RMM for a large educational dataset. While subject to a number of theoretical and practical criticisms from a RMM perspective this paper has not been replicated with a large sample. This paper by replicating and extending the paper by Fan (1998) challenges the finding that item difficulty indexes derived from high and low ability samples using TST techniques are invariant. They are not. On the other hand, item locations derived from the RMM have a high degree of invariance. This secondary data analysis, by working through the methods used by Fan (1998) also demonstrates that a reliance on the magnitude of correlational coefficients cannot be used to determine the invariance of item difficulty indexes. An investigation into the linearity of the correlations using scatter plots is also required. Finally, an item analysis derived from the item difficulty indexes which displays a picture of the test as a whole shows that, for this large sample, the differences between scales constructed with TST and the RMM are not trivial.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of applied measurement

自引率

0.00%

发文量