Keynote: Rethinking measurement for accountable assessment

Mark R. Wilson
{"title":"Keynote: Rethinking measurement for accountable assessment","authors":"Mark R. Wilson","doi":"10.37517/978-1-74286-638-3_13","DOIUrl":null,"url":null,"abstract":"The underlying model for most formal educational measurement (e.g. standardised tests) is based on a very simple model: the student takes a test (possibly alongside other students). The complications of there being an instructional plan, actual instruction, interpretation of the outcome, and formulation of next steps, are all bypassed in considering how to model the process of measurement. There are some standard exceptions, of course: a pre-test/post-test context will involve two measurements, and attention to gain score, or similar. However, if we wish to design measurement to hold to Lehrer’s (2021) definition of ‘accountable assessment’ – as ‘actionable information for improving classroom instruction’ – then this narrow conceptualisation must be extended. In this presentation, I will posit a simple model that reflects the simple one-test context described above, and then elaborate on it by adding in a) a framework for design of the assessments that is keyed to educational interpretation, b) further rounds of data collection that can indicate changes in a student’s underlying ability, and c) provision for varied assessment modes that will allow for i) classroom-independent tasks that operate at the summative and meso levels, and ii) classroom-dependent tasks that operate at the micro level. The former are designed to provide a basis for triangulating student responses across different contexts, and the latter are designed to closely track the variation of student performance over time in a classroom instructional context. This framing will be exemplified in a in a K–5 elementary school that is seeking to improve the quality of instruction and students’ understandings of measure and arithmetic. The different levels of data collection will be instantiated by two different pieces of software, which operate at the micro level and the meso/summative levels respectively.","PeriodicalId":413895,"journal":{"name":"Research Conference 2021: Excellent progress for every student: Proceedings and Program","volume":"96 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research Conference 2021: Excellent progress for every student: Proceedings and Program","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37517/978-1-74286-638-3_13","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The underlying model for most formal educational measurement (e.g. standardised tests) is based on a very simple model: the student takes a test (possibly alongside other students). The complications of there being an instructional plan, actual instruction, interpretation of the outcome, and formulation of next steps, are all bypassed in considering how to model the process of measurement. There are some standard exceptions, of course: a pre-test/post-test context will involve two measurements, and attention to gain score, or similar. However, if we wish to design measurement to hold to Lehrer’s (2021) definition of ‘accountable assessment’ – as ‘actionable information for improving classroom instruction’ – then this narrow conceptualisation must be extended. In this presentation, I will posit a simple model that reflects the simple one-test context described above, and then elaborate on it by adding in a) a framework for design of the assessments that is keyed to educational interpretation, b) further rounds of data collection that can indicate changes in a student’s underlying ability, and c) provision for varied assessment modes that will allow for i) classroom-independent tasks that operate at the summative and meso levels, and ii) classroom-dependent tasks that operate at the micro level. The former are designed to provide a basis for triangulating student responses across different contexts, and the latter are designed to closely track the variation of student performance over time in a classroom instructional context. This framing will be exemplified in a in a K–5 elementary school that is seeking to improve the quality of instruction and students’ understandings of measure and arithmetic. The different levels of data collection will be instantiated by two different pieces of software, which operate at the micro level and the meso/summative levels respectively.
主题演讲:重新思考可问责评估的测量方法
大多数正规教育测量(如标准化测试)的基本模型是基于一个非常简单的模型:学生参加测试(可能与其他学生一起)。在考虑如何对测量过程建模时,会忽略教学计划、实际指导、结果解释以及后续步骤的制定等复杂问题。当然,也有一些标准的例外:测试前/测试后的环境将涉及两个测量,以及对获得分数的关注,或类似的。然而,如果我们希望设计测量来坚持Lehrer(2021)对“问责评估”的定义-作为“改善课堂教学的可操作信息”-那么这个狭隘的概念必须扩展。在本课程中,我将假设一个简单的模型,该模型反映了上述简单的一个测试环境,然后详细说明它通过添加)的框架设计的评估的教育解释,b)进一步的数据收集可以表明改变一个学生的潜在能力,和c)提供不同的评估模式,允许我)classroom-independent任务操作总结性和内消旋的水平,ii)在微观层面上运作的课堂相关任务。前者旨在为学生在不同情境下的反应提供三角测量基础,后者旨在密切跟踪学生在课堂教学情境中随时间变化的表现。这一框架将在一所K-5小学中得到例证,该小学正在寻求提高教学质量和学生对测量和算术的理解。不同级别的数据收集将由两个不同的软件实例化,它们分别在微观层面和中观/总结层面运行。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信