Signal, error, or bias? exploring the uses of scores from observation systems

IF 1.6 3区教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH

Educational Assessment Evaluation and Accountability Pub Date : 2024-01-30 DOI:10.1007/s11092-024-09427-8

Mark White, Kirsti Klette

{"title":"Signal, error, or bias? exploring the uses of scores from observation systems","authors":"Mark White, Kirsti Klette","doi":"10.1007/s11092-024-09427-8","DOIUrl":null,"url":null,"abstract":"<p>Scores from observational measures of teaching have recently been put to many uses within school systems, including communicating a standard of practice and providing teacher feedback, identifying teachers for professional development, monitoring system equity, and making employment decisions. In each of these uses, observation scores are interpreted as representing some aspect of the enacted instruction or teachers’ capacity to enact instruction, as seen through the observation systems lens for understanding teaching quality. The quality of these interpretations, or the extent to which observation scores are composed of a signal that accurately reflects the interpretation, has important implications for the overall validity of uses of observation systems. Starting from an explicit conceptualization of instruction, this paper combines generalizability theory and hierarchical linear modelling approaches to decompose observation scores to explore the extent to which scores from observation systems are composed of signal, error, and bias across four different uses (i.e., teacher feedback, professional development, monitoring system equity, and employment decisions) of scores. We show that the quality of observation scores may depend more on what scores are interpreted as representing (i.e., the proposed use) than on the specific observation rubric being used. Further, we show that rater errors and biases are a major threat to any attempt to interpret observation scores as capturing the observation system’s understanding of teaching quality. We discuss implications for using scores from observation systems.</p>","PeriodicalId":46725,"journal":{"name":"Educational Assessment Evaluation and Accountability","volume":"15 1","pages":""},"PeriodicalIF":1.6000,"publicationDate":"2024-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Educational Assessment Evaluation and Accountability","FirstCategoryId":"95","ListUrlMain":"https://doi.org/10.1007/s11092-024-09427-8","RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}

引用次数: 0

Abstract

Scores from observational measures of teaching have recently been put to many uses within school systems, including communicating a standard of practice and providing teacher feedback, identifying teachers for professional development, monitoring system equity, and making employment decisions. In each of these uses, observation scores are interpreted as representing some aspect of the enacted instruction or teachers’ capacity to enact instruction, as seen through the observation systems lens for understanding teaching quality. The quality of these interpretations, or the extent to which observation scores are composed of a signal that accurately reflects the interpretation, has important implications for the overall validity of uses of observation systems. Starting from an explicit conceptualization of instruction, this paper combines generalizability theory and hierarchical linear modelling approaches to decompose observation scores to explore the extent to which scores from observation systems are composed of signal, error, and bias across four different uses (i.e., teacher feedback, professional development, monitoring system equity, and employment decisions) of scores. We show that the quality of observation scores may depend more on what scores are interpreted as representing (i.e., the proposed use) than on the specific observation rubric being used. Further, we show that rater errors and biases are a major threat to any attempt to interpret observation scores as capturing the observation system’s understanding of teaching quality. We discuss implications for using scores from observation systems.

Abstract Image

查看原文本刊更多论文

信号、误差还是偏差？探索观察系统评分的用途

近来，教学观测的分数在学校系统中被广泛应用，包括传达实践标准和提供教师反馈、确定教师的专业发展、监督系统的公平性以及做出聘用决定。在上述每一种用途中，观察分数都被解释为代表了教学的某些方面或教师开展教学的能力，正如通过观察系统的视角来理解教学质量一样。这些解释的质量，或者说观察分数在多大程度上是由能准确反映解释的信号组成的，对观察系统使用的整体有效性有着重要影响。本文从明确的教学概念出发，结合可概括性理论和分层线性建模方法，对观察评分进行分解，以探讨观察系统的评分在四种不同用途（即教师反馈、专业发展、监测系统公平性和就业决策）中由信号、误差和偏差组成的程度。我们的研究表明，观察评分的质量可能更多地取决于评分被解释为代表什么（即建议的用途），而不是所使用的具体观察评分标准。此外，我们还表明，任何试图将观察评分解释为观察系统对教学质量的理解的做法，都会受到评分者误差和偏见的严重威胁。我们讨论了使用观察系统评分的意义。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Educational Assessment Evaluation and Accountability EDUCATION & EDUCATIONAL RESEARCH-

CiteScore

9.40

自引率

2.60%

发文量

期刊介绍： The main objective of this international journal is to advance knowledge and dissemination of research on and about assessment, evaluation and accountability of all kinds and on various levels as well as in all fields of education. The journal provides readers with an understanding of the rich contextual nature of evaluation, assessment and accountability in education. The journal is theory-oriented and methodology-based and seeks to connect research, policy making and practice. The journal publishes outstanding empirical works, peer-reviewed by eminent scholars around the world.Aims and Scope in more detail: The main objective of this international journal is to advance knowledge and dissemination of research on and about evaluation, assessment and accountability: - of all kinds (e.g. person, programme, organisation), - on various levels (state, regional, local), - in all fields of education (primary, secondary, higher education/tertiary, as well as non-school sector) and across all different life phases (e.g. adult education/andragogy/Human Resource Management/professional development).The journal provides readers with an understanding of the rich contextual nature of evaluation, assessment and accountability in education. The journal is theory-oriented and methodology-based and seeks to connect research, policy making and practice. Therefore, the journal explores and discusses: - theories of evaluation, assessment and accountability, - function, role, aims and purpose of evaluation, assessment and accountability, - impact of evaluation, assessment and accountability, - methodology, design and methods of evaluation, assessment and accountability, - principles, standards and quality of evaluation, assessment and accountability, - issues of planning, coordinating, conducting, reporting of evaluation, assessment and accountability.The journal also covers the quality of different instruments or procedures or approaches which are used for evaluation, assessment and accountability.The journal only includes research findings from evaluation, assessment and accountability, if the design or approach of it is meta-reflected in the article.The journal publishes outstanding empirical works, peer-reviewed by eminent scholars around the world.