{"title":"An Analysis of TOEFL CBT Writing Prompt Difficulty and Comparability for Different Gender Groups. Research Reports. Report 76. RR-04-05.","authors":"H. Breland, Yong-Won Lee, M. Najarian, E. Muraki","doi":"10.1002/J.2333-8504.2004.TB01932.X","DOIUrl":null,"url":null,"abstract":"This investigation of the comparability of writing assessment prompts was conducted in two phases. In an exploratory Phase I, 47 writing prompts administered in the computer-based Test of English as a Foreign Language™ (TOEFL® CBT) from July through December 1998 were examined. Logistic regression procedures were used to estimate prompt difficulty and gender effects. A panel of experts reviewed selected prompts, and a taxonomy of prompt characteristics was developed and related to prompt difficulty and gender differences. In Phase II, 87 prompts administered from July 1998 through March 2000 were analyzed. All of the prompts used in Phase I, together with 40 new prompts, were analyzed using the larger Phase II database. Recommendations are made for statistical quality control procedures to identify less comparable prompts.","PeriodicalId":347951,"journal":{"name":"Educational Testing Service","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"40","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Educational Testing Service","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/J.2333-8504.2004.TB01932.X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 40
Abstract
This investigation of the comparability of writing assessment prompts was conducted in two phases. In an exploratory Phase I, 47 writing prompts administered in the computer-based Test of English as a Foreign Language™ (TOEFL® CBT) from July through December 1998 were examined. Logistic regression procedures were used to estimate prompt difficulty and gender effects. A panel of experts reviewed selected prompts, and a taxonomy of prompt characteristics was developed and related to prompt difficulty and gender differences. In Phase II, 87 prompts administered from July 1998 through March 2000 were analyzed. All of the prompts used in Phase I, together with 40 new prompts, were analyzed using the larger Phase II database. Recommendations are made for statistical quality control procedures to identify less comparable prompts.
对写作评估提示可比性的调查分两个阶段进行。在探索性的第一阶段,研究人员对1998年7月至12月期间参加托福®CBT (computer-based Test of English as a Foreign Language™)考试的47个写作题目进行了测试。使用逻辑回归程序来估计提示难度和性别的影响。一个专家小组审查了选定的提示语,并制定了提示语特征分类,并与提示语困难和性别差异有关。在第二阶段,分析了从1998年7月到2000年3月管理的87个提示。第一阶段使用的所有提示符,以及40个新的提示符,都使用第二阶段更大的数据库进行了分析。对统计质量控制程序提出了建议,以确定较少可比性的提示。