Measurement-Interdisciplinary Research and Perspectives最新文献

筛选
英文 中文
Exploring Rater Accuracy Using Unfolding Models Combined with Topic Models: Incorporating Supervised Latent Dirichlet Allocation 利用展开模型结合主题模型探索更高的准确性:纳入监督潜在狄利克雷分配
IF 1
Measurement-Interdisciplinary Research and Perspectives Pub Date : 2022-01-02 DOI: 10.1080/15366367.2021.1915094
Jordan M. Wheeler, G. Engelhard, Jue Wang
{"title":"Exploring Rater Accuracy Using Unfolding Models Combined with Topic Models: Incorporating Supervised Latent Dirichlet Allocation","authors":"Jordan M. Wheeler, G. Engelhard, Jue Wang","doi":"10.1080/15366367.2021.1915094","DOIUrl":"https://doi.org/10.1080/15366367.2021.1915094","url":null,"abstract":"ABSTRACT Objectively scoring constructed-response items on educational assessments has long been a challenge due to the use of human raters. Even well-trained raters using a rubric can inaccurately assess essays. Unfolding models measure rater’s scoring accuracy by capturing the discrepancy between criterion and operational ratings by placing essays on an unfolding continuum with an ideal-point location. Essay unfolding locations indicate how difficult it is for raters to score an essay accurately. This study aims to explore a substantive interpretation of the unfolding scale based on a supervised Latent Dirichlet Allocation (sLDA) model. We investigate the relationship between latent topics extracted using sLDA and unfolding locations with a sample of essays (n = 100) obtained from an integrated writing assessment. Results show that (a) three latent topics moderately explain (r 2 = 0.561) essay locations defined by the unfolding scale and (b) failing to use and/or cite the source articles led to essays that are difficult-to-score accurately.","PeriodicalId":46596,"journal":{"name":"Measurement-Interdisciplinary Research and Perspectives","volume":"12 1","pages":"34 - 46"},"PeriodicalIF":1.0,"publicationDate":"2022-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87560836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Now in JMP® Pro: Structual Equation Modeling 现在在JMP®Pro:结构方程建模
IF 1
Measurement-Interdisciplinary Research and Perspectives Pub Date : 2022-01-02 DOI: 10.1080/15366367.2022.2014446
{"title":"Now in JMP® Pro: Structual Equation Modeling","authors":"","doi":"10.1080/15366367.2022.2014446","DOIUrl":"https://doi.org/10.1080/15366367.2022.2014446","url":null,"abstract":"","PeriodicalId":46596,"journal":{"name":"Measurement-Interdisciplinary Research and Perspectives","volume":"53 1","pages":"1 - 1"},"PeriodicalIF":1.0,"publicationDate":"2022-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84870253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using SAS PROC IRT for Multidimensional Item Response Theory Analysis 运用SAS PROC IRT进行多维项目反应理论分析
IF 1
Measurement-Interdisciplinary Research and Perspectives Pub Date : 2022-01-02 DOI: 10.1080/15366367.2021.1976090
Ki Cole, Insu Paek
{"title":"Using SAS PROC IRT for Multidimensional Item Response Theory Analysis","authors":"Ki Cole, Insu Paek","doi":"10.1080/15366367.2021.1976090","DOIUrl":"https://doi.org/10.1080/15366367.2021.1976090","url":null,"abstract":"ABSTRACT Statistical Analysis Software (SAS) is a widely used tool for data management analysis across a variety of fields. The procedure for item response theory (PROC IRT) is one to perform unidimensional and multidimensional item response theory (IRT) analysis for dichotomous and polytomous data. This review provides a summary of the features of PROC IRT specifically for multidimensional data with examples provided for simple structure data, complex structure data, and bifactor data. Instructive examples for dichotomous data (using the Rasch and 2-parameter logistic models) and polytomous data (using the graded response model) are given. Explanations of the syntax are also presented.","PeriodicalId":46596,"journal":{"name":"Measurement-Interdisciplinary Research and Perspectives","volume":"13 1","pages":"49 - 55"},"PeriodicalIF":1.0,"publicationDate":"2022-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85032110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Comparison of Common IRT Model-selection Methods with Mixed-Format Tests 混合格式测试常用IRT模型选择方法的比较
IF 1
Measurement-Interdisciplinary Research and Perspectives Pub Date : 2021-10-02 DOI: 10.1080/15366367.2021.1878779
Yong Luo
{"title":"A Comparison of Common IRT Model-selection Methods with Mixed-Format Tests","authors":"Yong Luo","doi":"10.1080/15366367.2021.1878779","DOIUrl":"https://doi.org/10.1080/15366367.2021.1878779","url":null,"abstract":"ABSTRACT To date, only frequentist model-selection methods have been studied with mixed-format data in the context of IRT model-selection, and it is unknown how popular Bayesian model-selection methods such as DIC, WAIC, and LOO perform. In this study, we present the results of a comprehensive simulation study that compared the performances of eight model-selection methods with mixed-format data to select the correct combination of IRT models. Findings of the simulation study indicate that DIC, WAIC, and LOO had excellent statistical power to choose the correct IRT model combination. They performed comparably with LRT and slightly preferably than AIC, and considerably better than BIC, AICc, and SABIC. In addition, the performances of the three Bayesian methods were more stable than those of AIC and LRT regardless of the sample size and ability distribution. The eight model-selection methods were applied to a real dataset for demonstration purpose.","PeriodicalId":46596,"journal":{"name":"Measurement-Interdisciplinary Research and Perspectives","volume":"47 1","pages":"199 - 212"},"PeriodicalIF":1.0,"publicationDate":"2021-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84754702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Now in JMP® Pro: Structual Equation Modeling 现在在JMP®Pro:结构方程建模
IF 1
Measurement-Interdisciplinary Research and Perspectives Pub Date : 2021-10-02 DOI: 10.1080/15366367.2021.1982169
{"title":"Now in JMP® Pro: Structual Equation Modeling","authors":"","doi":"10.1080/15366367.2021.1982169","DOIUrl":"https://doi.org/10.1080/15366367.2021.1982169","url":null,"abstract":"","PeriodicalId":46596,"journal":{"name":"Measurement-Interdisciplinary Research and Perspectives","volume":"19 1","pages":"1 - 1"},"PeriodicalIF":1.0,"publicationDate":"2021-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87777285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Resources for Identifying Measurement Instruments for Social Science Research 鉴定社会科学研究测量工具的资源
IF 1
Measurement-Interdisciplinary Research and Perspectives Pub Date : 2021-10-02 DOI: 10.1080/15366367.2021.1950486
R. Schumacker, Stefanie A. Wind, Lauren F. Holmes
{"title":"Resources for Identifying Measurement Instruments for Social Science Research","authors":"R. Schumacker, Stefanie A. Wind, Lauren F. Holmes","doi":"10.1080/15366367.2021.1950486","DOIUrl":"https://doi.org/10.1080/15366367.2021.1950486","url":null,"abstract":"ABSTRACT A variety of resources are available from which researchers can identify measurement instruments, including peer-reviewed journal articles, collections of technical information about published instruments, and electronic databases that are sponsored by universities, testing organizations, and other groups. Although these resources are widespread, many researchers are not aware of them. We provide a brief overview of several selected resources that researchers can use to identify measurement instruments for social science research.","PeriodicalId":46596,"journal":{"name":"Measurement-Interdisciplinary Research and Perspectives","volume":"600 1","pages":"250 - 257"},"PeriodicalIF":1.0,"publicationDate":"2021-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77255688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Applying the Rasch Model in Social Sciences Using R and BlueSky Statistics Rasch模型在R和蓝天统计中的应用
IF 1
Measurement-Interdisciplinary Research and Perspectives Pub Date : 2021-10-02 DOI: 10.1080/15366367.2021.1940667
David Torres Irribarra
{"title":"Applying the Rasch Model in Social Sciences Using R and BlueSky Statistics","authors":"David Torres Irribarra","doi":"10.1080/15366367.2021.1940667","DOIUrl":"https://doi.org/10.1080/15366367.2021.1940667","url":null,"abstract":"","PeriodicalId":46596,"journal":{"name":"Measurement-Interdisciplinary Research and Perspectives","volume":"26 1","pages":"246 - 249"},"PeriodicalIF":1.0,"publicationDate":"2021-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89774696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The 2013-15 Decline in NAEP Mathematics: What it Teaches Us about NAEP and the Common Core 2013-15年NAEP数学成绩的下降:它告诉我们NAEP和共同核心的什么
IF 1
Measurement-Interdisciplinary Research and Perspectives Pub Date : 2021-10-02 DOI: 10.1080/15366367.2021.1873062
Gregory Camilli
{"title":"The 2013-15 Decline in NAEP Mathematics: What it Teaches Us about NAEP and the Common Core","authors":"Gregory Camilli","doi":"10.1080/15366367.2021.1873062","DOIUrl":"https://doi.org/10.1080/15366367.2021.1873062","url":null,"abstract":"ABSTRACT After 25 years with small to moderate gains in performance in mathematics, scores on the National Assessment of Educational Progress (NAEP) main assessment declined between 2013 and 2015 in Grades 4 and 8. Previous research has suggested the decline may be linked to the implementation of the Common Core state standards. In this article, the decline in the NAEP composite score is shown to be driven primarily by losses in the content strands of Geometry and of Data Analysis, Statistics, and Probability. A gain in fractions achievement is also evident in an item-level examination of the NAEP results, but not in reported NAEP scores. These effects are discussed with respect to the CCSS, the rationale for evaluating national progress, and a potential redesign of the NAEP assessment.","PeriodicalId":46596,"journal":{"name":"Measurement-Interdisciplinary Research and Perspectives","volume":"61 1","pages":"236 - 245"},"PeriodicalIF":1.0,"publicationDate":"2021-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87697889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating Six Approaches to Handling Zero-Frequency Scores under Equipercentile Equating 评价在等百分位等价下处理零频率分数的六种方法
IF 1
Measurement-Interdisciplinary Research and Perspectives Pub Date : 2021-10-02 DOI: 10.1080/15366367.2020.1855034
Ting Sun, S. Y. Kim
{"title":"Evaluating Six Approaches to Handling Zero-Frequency Scores under Equipercentile Equating","authors":"Ting Sun, S. Y. Kim","doi":"10.1080/15366367.2020.1855034","DOIUrl":"https://doi.org/10.1080/15366367.2020.1855034","url":null,"abstract":"ABSTRACT In many large testing programs, equipercentile equating has been widely used under a random groups design to adjust test difficulty between forms. However, one thorny issue occurs with equipercentile equating when a particular score has no observed frequency. The purpose of this study is to suggest and evaluate six potential methods in equipercentile equating when an observed-score distribution involves zero-frequency scores. A simulation study involving two levels of test lengths (30 and 50 items), five levels of sample sizes (100, 500, 1000, 3000, and 5000), and two levels of similarity in score distributions between two forms, was conducted to assess these methods in terms of equating accuracy. Results revealed that presmoothing was the most accurate method in estimating the equipercentile equating relationship when the population distributions for two forms differ with respect to the form of score distributions. When the populations have a similar score distribution, the presmoothing method was also found to be the most accurate method with longer tests (50 items). Furthermore, the performance of these methods does not vary as a function of the number of zero-frequency scores. This study informs practitioners of approaches to handling a zero-frequency issue with equipercentile equating that leads to more accurate equating results.","PeriodicalId":46596,"journal":{"name":"Measurement-Interdisciplinary Research and Perspectives","volume":"35 1","pages":"213 - 235"},"PeriodicalIF":1.0,"publicationDate":"2021-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82193167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Investigation of Item Calibration Methods in Multistage Testing 多级测试中项目标定方法的研究
IF 1
Measurement-Interdisciplinary Research and Perspectives Pub Date : 2021-07-03 DOI: 10.1080/15366367.2021.1878778
L. Cai, Anthony D. Albano, L. Roussos
{"title":"An Investigation of Item Calibration Methods in Multistage Testing","authors":"L. Cai, Anthony D. Albano, L. Roussos","doi":"10.1080/15366367.2021.1878778","DOIUrl":"https://doi.org/10.1080/15366367.2021.1878778","url":null,"abstract":"ABSTRACT Multistage testing (MST), an adaptive test delivery mode that involves algorithmic selection of predefined item modules rather than individual items, offers a practical alternative to linear and fully computerized adaptive testing. However, interactions across stages between item modules and examinee groups can lead to challenges in item calibration with MST. This study used simulated data based on an operational program to investigate the performance of four item calibration methods under a 1–3 MST design. Conditions included routing module length, routing rule, and sample size. Calibration methods were evaluated based on item and person parameter recovery and classification accuracy. Results indicated that calibration with fixed common item parameters and concurrent calibration assuming a single ability distribution similarly outperformed both separate calibration with linking and concurrent calibration with the multiple-group procedure.","PeriodicalId":46596,"journal":{"name":"Measurement-Interdisciplinary Research and Perspectives","volume":"25 1","pages":"163 - 178"},"PeriodicalIF":1.0,"publicationDate":"2021-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73776718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信