Practical Assessment, Research and Evaluation最新文献_第10页

Assessing the Assessment: Rubrics Training for Pre-Service and New In-Service Teachers. 评估评估:职前教师和新在职教师的培训大纲。

Practical Assessment, Research and Evaluation Pub Date : 2011-10-01 DOI: 10.7275/SJT6-5K13

Michael G. Lovorn, A. Rezaei

引用次数: 23

Best Practices in Using Large, Complex Samples: The Importance of Using Appropriate Weights and Design Effect Compensation. 使用大型复杂样本的最佳实践:使用适当权重和设计效果补偿的重要性。

Practical Assessment, Research and Evaluation Pub Date : 2011-09-01 DOI: 10.7275/2KYG-M659

J. Osborne

引用次数: 30

A Graphical Transition Table for Communicating Status and Growth. 沟通状态和成长的图形化过渡表。

Practical Assessment, Research and Evaluation Pub Date : 2011-06-01 DOI: 10.7275/T9R9-D719

Adam E. Wyse, Ji Zeng, Joseph A. Martineau

引用次数: 0

Too Reliable to Be True? Response Bias as a Potential Source of Inflation in Paper-and-Pencil Questionnaire Reliability. 太可靠而不真实?反应偏差是纸笔问卷可靠性膨胀的潜在来源。

Practical Assessment, Research and Evaluation Pub Date : 2011-06-01 DOI: 10.7275/E482-N724

Eyal Péer, Eyal Gamliel

引用次数: 63

Is a Picture Is Worth a Thousand Words? Creating Effective Questionnaires with Pictures. 一幅图胜过千言万语吗?用图片制作有效的问卷。

Practical Assessment, Research and Evaluation Pub Date : 2011-05-01 DOI: 10.7275/BGPE-A067

Laura Reynolds-Keefer, Robert Johnson

引用次数: 62

Applying Tests of Equivalence for Multiple Group Comparisons: Demonstration of the Confidence Interval Approach. 多组比较等效检验的应用:置信区间方法的论证。

Practical Assessment, Research and Evaluation Pub Date : 2011-04-01 DOI: 10.7275/D5WF-5P77

Shayna A. Rusticus, C. Lovato

引用次数: 48

Evaluating the Quantity-Quality Trade-off in the Selection of Anchor Items: a Vertical Scaling Approach 评价锚项目选择中的数量-质量权衡:一种垂直尺度方法

Practical Assessment, Research and Evaluation Pub Date : 2011-04-01 DOI: 10.7275/NNCY-EW26

Florian Pibal, H. Cesnik

{"title":"Evaluating the Quantity-Quality Trade-off in the Selection of Anchor Items: a Vertical Scaling Approach","authors":"Florian Pibal, H. Cesnik","doi":"10.7275/NNCY-EW26","DOIUrl":"https://doi.org/10.7275/NNCY-EW26","url":null,"abstract":"When administering tests across grades, vertical scaling is often employed to place scores from different tests on a common overall scale so that test-takers’ progress can be tracked. In order to be able to link the results across grades, however, common items are needed that are included in both test forms. In the literature there seems to be no clear agreement about the ideal number of common items. In line with some scholars, we argue that a greater number of anchor items bear a higher risk of unwanted effects like displacement, item drift, or undesired fit statistics and that having fewer psychometrically well-functioning anchor items can sometimes be more desirable. In order to demonstrate this, a study was conducted that included the administration of a reading-comprehension test to 1,350 test-takers across grades 6 to 8. In employing a step-by-step approach, we found that the paradox of high item drift in test administrations across grades can be mitigated and eventually even be eliminated. At the same time, a positive side effect was an increase in the explanatory power of the empirical data. Moreover, it was found that scaling adjustment can be used to evaluate the effectiveness of a vertical scaling approach and, in certain cases, can lead to more accurate results than the use of calibrated anchor items.","PeriodicalId":20361,"journal":{"name":"Practical Assessment, Research and Evaluation","volume":"64 1","pages":"6"},"PeriodicalIF":0.0,"publicationDate":"2011-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91005580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Do more online instructional ratings lead to better prediction of instructor quality 更多的在线教学评分能更好地预测教师的质量吗

Practical Assessment, Research and Evaluation Pub Date : 2011-02-01 DOI: 10.7275/NHNN-1N13

S. Sanders, Bhavneet Walia, Joel Potter, Kenneth W. Linna

引用次数: 17

Termination Criteria for Computerized Classification Testing. 计算机分类试验终止标准。

Practical Assessment, Research and Evaluation Pub Date : 2011-02-01 DOI: 10.7275/WQ8M-ZK25

Nathan A. Thompson

引用次数: 21

FORMATIVE USE OF ASSESSMENT INFORMATION: IT'S A PROCESS, SO LET'S SAY WHAT WE MEAN 评估信息的形成性使用:这是一个过程，所以让我们说一下我们的意思

Practical Assessment, Research and Evaluation Pub Date : 2011-02-01 DOI: 10.7275/3YVY-AT83

Robert Good

{"title":"FORMATIVE USE OF ASSESSMENT INFORMATION: IT'S A PROCESS, SO LET'S SAY WHAT WE MEAN","authors":"Robert Good","doi":"10.7275/3YVY-AT83","DOIUrl":"https://doi.org/10.7275/3YVY-AT83","url":null,"abstract":"The term formative assessment is often used to describe a type of assessment. The purpose of this paper is to challenge the use of this phrase given that formative assessment as a noun phrase ignores the well-established understanding that it is a process more than an object. A model that combines content, context, and strategies is presented as one way to view the process nature of assessing formatively. The alternate phrase formative use of assessment information is suggested as a more appropriate way to describe how content, context, and strategies can be used together in order to close the gap between where a student is performing currently and the intended learning goal. Let’s start with an elementary grammar review: adjectives modify nouns; adverbs modify verbs, adjectives, and other adverbs. Applied to recent assessment literature, the term formative assessment would therefore contain the adjective formative modifying the noun assessment, creating a noun phrase representing a thing or object. Indeed, formative assessment as a noun phrase is regularly juxtaposed to summative assessment in both purpose and timing. Formative assessment is commonly understood to occur during instruction with the intent to identify relative strengths and weaknesses and guide instruction, while summative assessment occurs after a unit of instruction with the intent of measuring performance levels of the skills and content related to the unit of instruction (Stiggins, Arter, Chappuis, & Chappuis, 2006). Distinguishing formative and summative assessments in this manner may have served an important introductory purpose, however using formative as a descriptor of a type of assessment has had ramifi cations that merit critical consideration. Given that formative assessment has received considerable attention in the literature over the last 20 or so years, this article contends that it is time to move beyond the well-established broad distinctions between formative and summative assessments and consider the subtle – yet important – distinction between the term formative assessment as an object and the intended meaning. The focus here is to suggest that if we want to realize the true potential of formative practices in our classrooms, then we need to start saying what we mean.","PeriodicalId":20361,"journal":{"name":"Practical Assessment, Research and Evaluation","volume":"34 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2011-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75781333","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 41