Applied Measurement in Education最新文献_第5页

Not-reached Items: An Issue of Time and of test-taking Disengagement? the Case of PISA 2015 Reading Data 未达到的项目：时间和测试脱离的问题？PISA 2015阅读数据案例

IF 1.5 4区教育学

Applied Measurement in Education Pub Date : 2022-07-03 DOI: 10.1080/08957347.2022.2103136

Elodie Pools

引用次数: 0

Response Demands of Reading Comprehension Test Items: A Review of Item Difficulty Modeling Studies 阅读理解测试项目的反应需求——项目难度建模研究综述

IF 1.5 4区教育学

Applied Measurement in Education Pub Date : 2022-07-03 DOI: 10.1080/08957347.2022.2103135

Steve Ferrara, J. Steedle, R. Frantz

引用次数: 1

Using Bayesian Networks to Characterize Student Performance across Multiple Assessments of Individual Standards 使用贝叶斯网络表征学生的表现跨多个评估的个别标准

IF 1.5 4区教育学

Applied Measurement in Education Pub Date : 2022-07-03 DOI: 10.1080/08957347.2022.2103134

Jiajun Xu, Nathan Dadey

引用次数: 0

Guiding Educators’ Evaluation of the Measurement Quality of Social and Emotional Learning (SEL) Assessments 指导教育工作者评估社会和情感学习（SEL）评估的测量质量

IF 1.5 4区教育学

Applied Measurement in Education Pub Date : 2022-04-03 DOI: 10.1080/08957347.2022.2067541

Jessica L. Jonson

引用次数: 0

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing 三种非参数DIF程序与差分快速猜测的鲁棒性比较

IF 1.5 4区教育学

Applied Measurement in Education Pub Date : 2022-04-03 DOI: 10.1080/08957347.2022.2067542

Mohammed A. A. Abulela, Joseph A. Rios

{"title":"Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing","authors":"Mohammed A. A. Abulela, Joseph A. Rios","doi":"10.1080/08957347.2022.2067542","DOIUrl":"https://doi.org/10.1080/08957347.2022.2067542","url":null,"abstract":"ABSTRACT When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the Mantel-Haenszel (MH), standardization index (STD), and logistic regression (LR) differential item functioning (DIF) procedures to type I error in the presence of differential RG. Sample size, test difficulty, group impact, and differential RG rates were manipulated. Findings revealed that the LR procedure was completely robust to type I errors, while slightly elevated false positive rates (< 1%) were observed for the MH and STD procedures. An applied analysis examining data from the Programme for International Student Assessment showed minimal differences in DIF classifications when comparing data in which RG responses were unfiltered and filtered. These results suggest that large rates of differences in RG rates between subgroups are unassociated with false positive classifications of DIF.","PeriodicalId":51609,"journal":{"name":"Applied Measurement in Education","volume":"35 1","pages":"81 - 94"},"PeriodicalIF":1.5,"publicationDate":"2022-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45517411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Does the Response Options Placement Provide Clues to the Correct Answers in Multiple-choice Tests? A Systematic Review 答案选项的设置是否为多项选择测试的正确答案提供了线索？系统综述

IF 1.5 4区教育学

Applied Measurement in Education Pub Date : 2022-04-03 DOI: 10.1080/08957347.2022.2067539

Séverin Lions, Carlos Monsalve, P. Dartnell, María Paz Blanco, Gabriel Ortega, Julie Lemarié

{"title":"Does the Response Options Placement Provide Clues to the Correct Answers in Multiple-choice Tests? A Systematic Review","authors":"Séverin Lions, Carlos Monsalve, P. Dartnell, María Paz Blanco, Gabriel Ortega, Julie Lemarié","doi":"10.1080/08957347.2022.2067539","DOIUrl":"https://doi.org/10.1080/08957347.2022.2067539","url":null,"abstract":"ABSTRACT Multiple-choice tests are widely used in education, often for high-stakes assessment purposes. Consequently, these tests should be constructed following the highest standards. Many efforts have been undertaken to advance item-writing guidelines intended to improve tests. One important issue is the unwanted effects of the options’ position on test outcomes. Any possible effects should be controlled through an adequate response options placement strategy. However, literature is not straightforward about how test developers arrange options. Therefore, this research synthesis systematically reviewed studies examining adherence to options placement guidelines. Relevant item features, such as the item source (standardized or teacher-made tests) and the number of options were considered. Results show that answer keys’ distribution across tests is often biased, which might provide examinees with clues to select correct options. Findings also show that options are not always arranged in a “logical” fashion (numerically, alphabetically…) despite being suited to be so arranged. The reasons underlying non-adherence to options placement guidelines are discussed, as is the appropriateness of observed response options placement strategies. Suggestions are provided to help developers better arrange items options.","PeriodicalId":51609,"journal":{"name":"Applied Measurement in Education","volume":"35 1","pages":"133 - 152"},"PeriodicalIF":1.5,"publicationDate":"2022-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48343747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Effects of Using Double Ratings as Item Scores on IRT Proficiency Estimation 使用双重评分作为项目得分对IRT能力评估的影响

IF 1.5 4区教育学

Applied Measurement in Education Pub Date : 2022-04-03 DOI: 10.1080/08957347.2022.2067543

Yoon Ah Song, Won‐Chan Lee

引用次数: 0

Performance of Infit and Outfit Confidence Intervals Calculated via Parametric Bootstrapping 通过参数自举计算输入和输出置信区间的性能

IF 1.5 4区教育学

Applied Measurement in Education Pub Date : 2022-04-03 DOI: 10.1080/08957347.2022.2067540

John Alexander Silva Diaz, Carmen Köhler, J. Hartig

引用次数: 2

Teacher Assessment Literacy: Implications for Diagnostic Assessment Systems 教师评估素养：对诊断性评估系统的启示

IF 1.5 4区教育学

Applied Measurement in Education Pub Date : 2022-01-02 DOI: 10.1080/08957347.2022.2034823

Amy K. Clark, Brooke L. Nash, Meagan Karvonen

{"title":"Teacher Assessment Literacy: Implications for Diagnostic Assessment Systems","authors":"Amy K. Clark, Brooke L. Nash, Meagan Karvonen","doi":"10.1080/08957347.2022.2034823","DOIUrl":"https://doi.org/10.1080/08957347.2022.2034823","url":null,"abstract":"ABSTRACT Assessments scored with diagnostic models are increasingly popular because they provide fine-grained information about student achievement. Because of differences in how diagnostic assessments are scored and how results are used, the information teachers must know to interpret and use results may differ from concepts traditionally included in assessment literacy trainings for assessments that produce a raw or scale score. In this study, we connect assessment literacy and score reporting literature to understand teachers’ assessment literacy in a diagnostic assessment context as demonstrated by responses to focus groups and surveys. Results summarize teachers’ descriptions of fundamental diagnostic assessment concepts, understanding of the diagnostic assessment and results produced, and how diagnostic assessment results influence their instructional decision-making. Teachers understood how to use results and were comfortable using the term mastery when interpreting score report contents and planning next instruction. However, teachers were unsure how mastery was calculated and some misinterpreted mastery as representing a percent correct rather than a probability value. We share implications for others implementing large-scale diagnostic assessments or designing score reports for these systems.","PeriodicalId":51609,"journal":{"name":"Applied Measurement in Education","volume":"35 1","pages":"17 - 32"},"PeriodicalIF":1.5,"publicationDate":"2022-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43909414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Analyzing Student Response Processes to Evaluate Success on a Technology-Based Problem-Solving Task 分析学生的反应过程以评估基于技术的问题解决任务的成功

IF 1.5 4区教育学

Applied Measurement in Education Pub Date : 2022-01-02 DOI: 10.1080/08957347.2022.2034821

Yuting Han, M. Wilson

引用次数: 2