{"title":"用修改后的nedelsky程序通过公共项目测试等效验证标准设置。","authors":"R M Smith, L J Gross","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>It is often impossible to validate cut scores set using judged item review methods due to the fact that many high stakes testing programs attempt to limit the number of common items across consecutively administered forms. However, over time, with a stable item pool, secondary links through other test administrations allow the use of common item equating to test the stability of the judged cut scores. In this study five forms of a basic science examination administered over a three year period in a national board testing program were analyzed to determine the stability of judged cut scores. The stability was determined by comparison of the judged cut scores with the equated cut scores derived by the Rasch common item equating technique. The results indicate cut scores derived from the modified Nedelsky procedure were within equating error of the Rasch equated cut scores over five administrations.</p>","PeriodicalId":79673,"journal":{"name":"Journal of outcome measurement","volume":"1 2","pages":"164-72"},"PeriodicalIF":0.0000,"publicationDate":"1997-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Validating standard setting with a modified nedelsky procedure through common item test equating.\",\"authors\":\"R M Smith, L J Gross\",\"doi\":\"\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>It is often impossible to validate cut scores set using judged item review methods due to the fact that many high stakes testing programs attempt to limit the number of common items across consecutively administered forms. However, over time, with a stable item pool, secondary links through other test administrations allow the use of common item equating to test the stability of the judged cut scores. In this study five forms of a basic science examination administered over a three year period in a national board testing program were analyzed to determine the stability of judged cut scores. The stability was determined by comparison of the judged cut scores with the equated cut scores derived by the Rasch common item equating technique. The results indicate cut scores derived from the modified Nedelsky procedure were within equating error of the Rasch equated cut scores over five administrations.</p>\",\"PeriodicalId\":79673,\"journal\":{\"name\":\"Journal of outcome measurement\",\"volume\":\"1 2\",\"pages\":\"164-72\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1997-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of outcome measurement\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of outcome measurement","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Validating standard setting with a modified nedelsky procedure through common item test equating.
It is often impossible to validate cut scores set using judged item review methods due to the fact that many high stakes testing programs attempt to limit the number of common items across consecutively administered forms. However, over time, with a stable item pool, secondary links through other test administrations allow the use of common item equating to test the stability of the judged cut scores. In this study five forms of a basic science examination administered over a three year period in a national board testing program were analyzed to determine the stability of judged cut scores. The stability was determined by comparison of the judged cut scores with the equated cut scores derived by the Rasch common item equating technique. The results indicate cut scores derived from the modified Nedelsky procedure were within equating error of the Rasch equated cut scores over five administrations.