Journal of applied measurement最新文献_第2页

Bootstrap Estimate of Bias for Intraclass Correlation. 类内相关偏差的自举估计。

Journal of applied measurement Pub Date : 2020-01-01

Xiaofeng Steven Liu, Kelvin Terrell Pompey

引用次数: 0

Evaluating the Impact of Multidimensionality on Type I and Type II Error Rates using the Q-Index Item Fit Statistic for the Rasch Model. 利用Rasch模型的Q-Index项目拟合统计量评估多维度对I型和II型错误率的影响。

Journal of applied measurement Pub Date : 2020-01-01 DOI: 10.31219/osf.io/kh7vq

Samantha Estrada

{"title":"Evaluating the Impact of Multidimensionality on Type I and Type II Error Rates using the Q-Index Item Fit Statistic for the Rasch Model.","authors":"Samantha Estrada","doi":"10.31219/osf.io/kh7vq","DOIUrl":"https://doi.org/10.31219/osf.io/kh7vq","url":null,"abstract":"To understand the role of fit statistics in Rasch measurement is simple: applied researchers can only benefit from the desirable properties of the Rasch model when the data fit the model. The purpose of the current study was to assess the Q-Index robustness (Ostini and Nering, 2006), and its performance was compared to the current popular fit statistics known as MSQ Infit, MSQ Outfit, and standardized Infit and Outfit (ZSTDs) under varying conditions of test length, sample size, item difficulty (normal and uniform), and dimensionality utilizing a Monte Carlo simulation. The Type I and Type II error rates are also examined across fit indices. This study provides applied researchers guidelines the robustness and appropriateness of the use of the Q-Index, which is an alternative to the currently available item fit statistics. The Q-Index was slightly more sensitive to the levels of multidimensionality set in the study while MSQ Infit, Outfit, and standardized Infit and Outfit (ZSTDs) failed to identify the multidimensional conditions. The Type I error rate of the Q-Index was lower than the rest of the fit indices; however, the Type II error rate was higher than the anticipated beta = .20 across all fit indices.","PeriodicalId":73608,"journal":{"name":"Journal of applied measurement","volume":"21 4 1","pages":"496-514"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69636647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Examining the Pre-service School Principals' Impromptu Speech Skills with a Many-Facet Rasch Model. 用多面Rasch模型考察职前学校校长的即席演讲技巧。

Journal of applied measurement Pub Date : 2020-01-01

Mingchuan Hsieh, Akihito Kamata

引用次数: 0

Assessing Differential Statement Functioning in Polytomous Multidimensional Pairwise Comparison Items. 多元多维两两比较项目中差异陈述功能的评估。

Journal of applied measurement Pub Date : 2020-01-01

Xue-Lan Qiu

引用次数: 0

A Psychometric Replication of Fan (1998) Item Response Theory and Classical Test Theory: An Empirical Comparison of their Item/Person Statistics. 项目反应理论与经典测试理论:项目/人统计的实证比较。

Journal of applied measurement Pub Date : 2020-01-01

Nicholas Marosszeky, E Arthur Shores, Michael P Jones, Rassoul Sadeghi

{"title":"A Psychometric Replication of Fan (1998) Item Response Theory and Classical Test Theory: An Empirical Comparison of their Item/Person Statistics.","authors":"Nicholas Marosszeky, E Arthur Shores, Michael P Jones, Rassoul Sadeghi","doi":"","DOIUrl":"","url":null,"abstract":"Streiner, Norman and Cairney (2015) \"Health Measurement Scales: A practical guide to their development and use\", now in its fifth edition, is one of the foundational texts of the health outcomes movement. It states that \"the differences between scales constructed with IRT and CTT are trivial.\" (Streiner, Norman and Cairney, 2015, p. 299) This statement is representative of the view which emphasizes the equivalence of True-Score Theory (TST) (also known as Classical Test Theory [CTT]) and the Rasch Measurement Model [RMM]). This view is widely held and has been one factor in limiting the application of RMM in the development of health outcome measures. However, this equivalence view relies heavily on a paper by Fan (1998) which examined the item statistics derived from TST, IRT (Item Response Theory) and the RMM for a large educational dataset. While subject to a number of theoretical and practical criticisms from a RMM perspective this paper has not been replicated with a large sample. This paper by replicating and extending the paper by Fan (1998) challenges the finding that item difficulty indexes derived from high and low ability samples using TST techniques are invariant. They are not. On the other hand, item locations derived from the RMM have a high degree of invariance. This secondary data analysis, by working through the methods used by Fan (1998) also demonstrates that a reliance on the magnitude of correlational coefficients cannot be used to determine the invariance of item difficulty indexes. An investigation into the linearity of the correlations using scatter plots is also required. Finally, an item analysis derived from the item difficulty indexes which displays a picture of the test as a whole shows that, for this large sample, the differences between scales constructed with TST and the RMM are not trivial.","PeriodicalId":73608,"journal":{"name":"Journal of applied measurement","volume":"21 4","pages":"456-480"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38912689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A-priori Weighting of Items with the Rasch Model. 基于Rasch模型的项目先验加权。

Journal of applied measurement Pub Date : 2020-01-01

David Andrich, Sonia Sappl

引用次数: 0

Trade-Offs in the Implementation of Observational Ratings Systems. 实施观察评级系统的权衡。

Journal of applied measurement Pub Date : 2020-01-01

Stephen M Ponisciak, Rob Meyer, Anna Brown, Tracy Schatzberg

引用次数: 0

Response Differences in Appraisals of Working Conditions among Elementary and High School Teachers. 中小学教师工作条件评价的反应差异。

Journal of applied measurement Pub Date : 2020-01-01

Richard G Lambert, C Missy Moore, Christopher McCarthy, Bryndle L Bottoms

{"title":"Response Differences in Appraisals of Working Conditions among Elementary and High School Teachers.","authors":"Richard G Lambert, C Missy Moore, Christopher McCarthy, Bryndle L Bottoms","doi":"","DOIUrl":"","url":null,"abstract":"Research using the National Teacher and Principal Survey (NTPS) has consistently demonstrated that teachers' reported working conditions are related to both intentions to leave the profession and attrition (Tickle, Chang, and Kim, 2011). However, limited research evaluates teacher appraisals of job-related demands and resources as an antecedent to job dissatisfaction. We tested for differential item functioning (DIF) using a partial credit model approach within a Rasch modeling context to examine whether elementary and secondary teachers with similar overall stress levels respond to the NTPS Demands and Resources items in similar ways. For the Demands items, seven of the items displayed differences that were negligible, four were intermediate, and three items indicated large DIF contrasts. For the Resources items, 10 items displayed differences that were negligible, two were intermediate, and zero items indicated large DIF contrasts. These results indicate elementary and secondary teachers exhibit different appraisal patterns, suggesting implications for the development and use of survey data in public school settings in general, and for the use of the NTPS data in particular.","PeriodicalId":73608,"journal":{"name":"Journal of applied measurement","volume":"21 3","pages":"347-360"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38978110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Alignment of a Language Instrument Scores to CEFR Levels: Methodological and Empirical Considerations. 语言工具分数与CEFR水平的一致性:方法和经验考虑。

Journal of applied measurement Pub Date : 2020-01-01

Georgios D Sideridis, Abdulrahman Al-Samrani, Bjorn Norrbom

{"title":"Alignment of a Language Instrument Scores to CEFR Levels: Methodological and Empirical Considerations.","authors":"Georgios D Sideridis, Abdulrahman Al-Samrani, Bjorn Norrbom","doi":"","DOIUrl":"","url":null,"abstract":"The purpose of the present report was to assess congruence between a language-based national examination (termed English placement test - EPT) and the Common European Framework of Reference for Languages (CEFR) levels. To this end, a series of methodological steps were put forth to accumulate evidence suggesting that language performance based on the EPT instrument can be split onto meaningful subgroups based on theoretical (expert judgement on difficulty level and CEFR correspondence) and empirical considerations (i.e., how well these levels and subgroups emerged). Participants were 2642 high school graduates who took on the EPT instrument as part of their entry criteria to the university and for the purposes of the present study only the structure subscale is presented. Items were classified as reflecting specific CEFR levels and a person-based analysis attempted to classify individuals sharing the same behavioral patterns. Results using a latent class analysis (LCA) indicated that a Pre-A1, an A1 an A2 a B1 and a B2 levels were present with regard to the structure domain of language. Results showed a strong alignment between the EPT structure domain and CEFR guidelines using various methodological approaches.","PeriodicalId":73608,"journal":{"name":"Journal of applied measurement","volume":"21 1","pages":"68-90"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"37704117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Comparing Causes of Dependency: Shared Latent Trait or Dependence on Observed Response. 比较依赖的原因:共同的潜在特质或对观察反应的依赖。

Journal of applied measurement Pub Date : 2020-01-01

Christine E DeMars

引用次数: 0