Journal of Educational and Behavioral Statistics最新文献_第9页

Using Sequence Mining Techniques for Understanding Incorrect Behavioral Patterns on Interactive Tasks 使用序列挖掘技术理解交互任务中的错误行为模式

IF 2.4 3区心理学

Journal of Educational and Behavioral Statistics Pub Date : 2021-05-03 DOI: 10.3102/10769986211010467

Esther Ulitzsch, Qiwei He, S. Pohl

{"title":"Using Sequence Mining Techniques for Understanding Incorrect Behavioral Patterns on Interactive Tasks","authors":"Esther Ulitzsch, Qiwei He, S. Pohl","doi":"10.3102/10769986211010467","DOIUrl":"https://doi.org/10.3102/10769986211010467","url":null,"abstract":"Interactive tasks designed to elicit real-life problem-solving behavior are rapidly becoming more widely used in educational assessment. Incorrect responses to such tasks can occur for a variety of different reasons such as low proficiency levels, low metacognitive strategies, or motivational issues. We demonstrate how behavioral patterns associated with incorrect responses can, in part, be understood, supporting insights into the different sources of failure on a task. To this end, we make use of sequence mining techniques that leverage the information contained in time-stamped action sequences commonly logged in assessments with interactive tasks for (a) investigating what distinguishes incorrect behavioral patterns from correct ones and (b) identifying subgroups of examinees with similar incorrect behavioral patterns. Analyzing a task from the Programme for the International Assessment of Adult Competencies 2012 assessment, we find incorrect behavioral patterns to be more heterogeneous than correct ones. We identify multiple subgroups of incorrect behavioral patterns, which point toward different levels of effort and lack of different subskills needed for solving the task. Albeit focusing on a single task, meaningful patterns of major differences in how examinees approach a given task that generalize across multiple tasks are uncovered. Implications for the construction and analysis of interactive tasks as well as the design of interventions for complex problem-solving skills are derived.","PeriodicalId":48001,"journal":{"name":"Journal of Educational and Behavioral Statistics","volume":"47 1","pages":"3 - 35"},"PeriodicalIF":2.4,"publicationDate":"2021-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41989802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 15

A Case Study of Nonresponse Bias Analysis in Educational Assessment Surveys 教育评估调查中的无应答偏差分析案例研究

IF 2.4 3区心理学

Journal of Educational and Behavioral Statistics Pub Date : 2021-04-09 DOI: 10.3102/10769986221141074

Yajuan Si, R. Little, Ya Mo, N. Sedransk

引用次数: 3

Introduction to JEBS Special Issue on NAEP Linked Aggregate Scores 关于NAEP关联总分的JEBS特刊简介

IF 2.4 3区心理学

Journal of Educational and Behavioral Statistics Pub Date : 2021-04-01 DOI: 10.3102/10769986211001480

D. McCaffrey, S. Culpepper

{"title":"Introduction to JEBS Special Issue on NAEP Linked Aggregate Scores","authors":"D. McCaffrey, S. Culpepper","doi":"10.3102/10769986211001480","DOIUrl":"https://doi.org/10.3102/10769986211001480","url":null,"abstract":"The Stanford Education Data Archive (SEDA) was created by Sean Reardon, Andrew Ho, Demetra Kalogrides, and their colleagues using annual state summative test score data retrieved from the EDFacts Restricted-Use Files and publicly available NAEP data from the National Center for Education Statistics. SEDA provides test score data on a common scale across all states for mathematics and reading language arts for students in Grades 3 through 8 for almost all schools, districts, and counties in the United States. An online tool (edopportu nity.org) allows users to visually compare schools and districts from anywhere in the country. Data also include various covariates at each of these levels, and all the data can be downloaded for free for analysis. These data have the potential to be a very valuable resource for researchers, educators, policy makers, and possibly even the general public. The catch is that there is no common standardized test administered to students in Grades 3 through 8 in all schools and school districts in all states. NAEP is only administered in a relatively small sample of schools in each state and only to students in Grades 4 and 8 and only every other year. The school data in SEDA are derived from the annual tests administered by each state in accordance with federal regulations. Reardon, Ho, Kalogrides, and colleagues start with aggregate data of the numbers of students in each school or district meeting various performance levels on their state standardized tests. State tests are on different scales and test somewhat different content. They also use different cutoffs for performance levels that are not common across states. Reardon, Ho, Kalogrides, and colleagues convert these frequencies to means and standard deviations for the scores in each school or district using the Heteroskedastic Ordered Probit model that was developed into a series of papers in JEBS (Lockwood et al., 2018; Reardon et al., 2017; Shear & Reardon, 2021). They then link these means and standard deviations to the NAEP scale using methods described in Reardon et al. (2021). Reardon, Ho, Kalogrides, and colleagues stitched together a collection of methods to create a national data source of Journal of Educational and Behavioral Statistics 2021, Vol. 46, No. 2, pp. 135–137 DOI: 10.3102/10769986211001480 Article reuse guidelines: sagepub.com/journals-permissions © 2021 AERA. https://journals.sagepub.com/home/jeb","PeriodicalId":48001,"journal":{"name":"Journal of Educational and Behavioral Statistics","volume":"46 1","pages":"135 - 137"},"PeriodicalIF":2.4,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46561773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Validation Methods for Aggregate-Level Test Scale Linking: A Rejoinder 聚合级测试量表链接的验证方法:一个反驳

IF 2.4 3区心理学

Journal of Educational and Behavioral Statistics Pub Date : 2021-04-01 DOI: 10.3102/1076998621994540

Andrew D. Ho, Sean F. Reardon, Demetra Kalogrides

{"title":"Validation Methods for Aggregate-Level Test Scale Linking: A Rejoinder","authors":"Andrew D. Ho, Sean F. Reardon, Demetra Kalogrides","doi":"10.3102/1076998621994540","DOIUrl":"https://doi.org/10.3102/1076998621994540","url":null,"abstract":"In this issue, Reardon, Kalogrides, and Ho developed precision-adjusted random effects models to estimate aggregate-level linking error, for populations and subpopulations, for averages and progress over time. We are grateful to past editor Dan McCaffrey for selecting our paper as the focal article for a set of commentaries from our colleagues Daniel Bolt, Mark Davison, Alina von Davier, Tim Moses, and Neil Dorans. These commentaries reinforce important cautions and identify promising directions for future research. In this rejoinder, we clarify aspects of our originally proposed method. (1) Validation methods provide evidence of benefits and risks that different experts may weigh differently for different purposes. (2) Our proposed method differs from “standard mapping” procedures using the National Assessment of Educational Progress not only by using a linear (vs. equipercentile) link but also by targeting direct validity evidence about counterfactual aggregate scores. (3) Multilevel approaches that assume common score scales across states are indeed a promising next step for validation, and we hope that states enable researchers to use more of their common-core-era consortium test data for this purpose. Finally, we apply our linking method to an extended panel of data from 2009 to 2017 to show that linking recovery has remained stable.","PeriodicalId":48001,"journal":{"name":"Journal of Educational and Behavioral Statistics","volume":"46 1","pages":"209 - 218"},"PeriodicalIF":2.4,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49049001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

A Rating Scale Mixture Model to Account for the Tendency to Middle and Extreme Categories 考虑中等和极端类别倾向的评级量表混合模型

IF 2.4 3区心理学

Journal of Educational and Behavioral Statistics Pub Date : 2021-03-31 DOI: 10.3102/1076998621992554

R. Colombi, S. Giordano, G. Tutz

引用次数: 2

Detecting Noneffortful Responses Based on a Residual Method Using an Iterative Purification Process 基于残差法的迭代纯化过程非努力响应检测

IF 2.4 3区心理学

Journal of Educational and Behavioral Statistics Pub Date : 2021-03-29 DOI: 10.3102/1076998621994366

Yue Liu, Hongyun Liu

{"title":"Detecting Noneffortful Responses Based on a Residual Method Using an Iterative Purification Process","authors":"Yue Liu, Hongyun Liu","doi":"10.3102/1076998621994366","DOIUrl":"https://doi.org/10.3102/1076998621994366","url":null,"abstract":"The prevalence and serious consequences of noneffortful responses from unmotivated examinees are well-known in educational measurement. In this study, we propose to apply an iterative purification process based on a response time residual method with fixed item parameter estimates to detect noneffortful responses. The proposed method is compared with the traditional residual method and noniterative method with fixed item parameters in two simulation studies in terms of noneffort detection accuracy and parameter recovery. The results show that when severity of noneffort is high, the proposed method leads to a much higher true positive rate with a small increase of false discovery rate. In addition, parameter estimation is significantly improved by the strategies of fixing item parameters and iteratively cleansing. These results suggest that the proposed method is a potential solution to reduce the impact of data contamination due to severe low test-taking effort and to obtain more accurate parameter estimates. An empirical study is also conducted to show the differences in the detection rate and parameter estimates among different approaches.","PeriodicalId":48001,"journal":{"name":"Journal of Educational and Behavioral Statistics","volume":"46 1","pages":"717 - 752"},"PeriodicalIF":2.4,"publicationDate":"2021-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44643360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Item Characteristic Curve Asymmetry: A Better Way to Accommodate Slips and Guesses Than a Four-Parameter Model? 项目特征曲线不对称:比四参数模型更好地适应滑移和猜测?

IF 2.4 3区心理学

Journal of Educational and Behavioral Statistics Pub Date : 2021-03-29 DOI: 10.3102/10769986211003283

Xiangyi Liao, D. Bolt

{"title":"Item Characteristic Curve Asymmetry: A Better Way to Accommodate Slips and Guesses Than a Four-Parameter Model?","authors":"Xiangyi Liao, D. Bolt","doi":"10.3102/10769986211003283","DOIUrl":"https://doi.org/10.3102/10769986211003283","url":null,"abstract":"Four-parameter models have received increasing psychometric attention in recent years, as a reduced upper asymptote for item characteristic curves can be appealing for measurement applications such as adaptive testing and person-fit assessment. However, applications can be challenging due to the large number of parameters in the model. In this article, we demonstrate in the context of mathematics assessments how the slip and guess parameters of a four-parameter model may often be empirically related. This observation also has a psychological explanation to the extent that both asymptote parameters may be manifestations of a single item complexity characteristic. The relationship between lower and upper asymptotes motivates the consideration of an asymmetric item response theory model as a three-parameter alternative to the four-parameter model. Using actual response data from mathematics multiple-choice tests, we demonstrate the empirical superiority of a three-parameter asymmetric model in several standardized tests of mathematics. To the extent that a model of asymmetry ultimately portrays slips and guesses not as purely random but rather as proficiency-related phenomena, we argue that the asymmetric approach may also have greater psychological plausibility.","PeriodicalId":48001,"journal":{"name":"Journal of Educational and Behavioral Statistics","volume":"46 1","pages":"753 - 775"},"PeriodicalIF":2.4,"publicationDate":"2021-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49036230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Monitoring Item Performance With CUSUM Statistics in Continuous Testing 在连续测试中使用CUSUM统计监测项目性能

IF 2.4 3区心理学

Journal of Educational and Behavioral Statistics Pub Date : 2021-03-08 DOI: 10.3102/1076998621994563

Yi-Hsuan Lee, C. Lewis

引用次数: 3

Jenss–Bayley Latent Change Score Model With Individual Ratio of the Growth Acceleration in the Framework of Individual Measurement Occasions 个体测量情景框架下具有个体增长加速率的Jens–Bayley潜在变化得分模型

IF 2.4 3区心理学

Journal of Educational and Behavioral Statistics Pub Date : 2021-02-27 DOI: 10.3102/10769986221099919

Jin Liu

{"title":"Jenss–Bayley Latent Change Score Model With Individual Ratio of the Growth Acceleration in the Framework of Individual Measurement Occasions","authors":"Jin Liu","doi":"10.3102/10769986221099919","DOIUrl":"https://doi.org/10.3102/10769986221099919","url":null,"abstract":"Longitudinal data analysis has been widely employed to examine between-individual differences in within-individual changes. One challenge of such analyses is that the rate-of-change is only available indirectly when change patterns are nonlinear with respect to time. Latent change score models (LCSMs), which can be employed to investigate the change in rate-of-change at the individual level, have been developed to address this challenge. We extend an existing LCSM with the Jenss–Bayley growth curve and propose a novel expression for change scores that allows for (1) unequally spaced study waves and (2) individual measurement occasions around each wave. We also extend the existing model to estimate the individual ratio of the growth acceleration (that largely determines the trajectory shape and is viewed as the most important parameter in the Jenss–Bayley model). We present the proposed model by a simulation study and a real-world data analysis. Our simulation study demonstrates that the proposed model can estimate the parameters unbiasedly and precisely and exhibit target confidence interval coverage. The simulation study also shows that the proposed model with the novel expression for the change scores outperforms the existing model. An empirical example using longitudinal reading scores shows that the model can estimate the individual ratio of the growth acceleration and generate individual rate-of-change in practice. We also provide the corresponding code for the proposed model.","PeriodicalId":48001,"journal":{"name":"Journal of Educational and Behavioral Statistics","volume":"47 1","pages":"507 - 543"},"PeriodicalIF":2.4,"publicationDate":"2021-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47673583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Estimating Difference-Score Reliability in Pretest–Posttest Settings 评估测试前-测试后设置中的差异得分可靠性

IF 2.4 3区心理学

Journal of Educational and Behavioral Statistics Pub Date : 2021-02-15 DOI: 10.3102/1076998620986948

Zhengguo Gu, W. Emons, K. Sijtsma

引用次数: 2