Beyond normalized gain: Improved comparison of physics educational outcomes

IF 2.6 2区 教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH
Elaine Christman, Paul Miller, John Stewart
{"title":"Beyond normalized gain: Improved comparison of physics educational outcomes","authors":"Elaine Christman, Paul Miller, John Stewart","doi":"10.1103/physrevphyseducres.20.010123","DOIUrl":null,"url":null,"abstract":"This study proposes methods of reporting results of physics conceptual evaluations that more fully characterize the range of outcomes experienced by students with differing levels of prior preparation, allowing for more meaningful comparison of the outcomes of educational interventions within and across institutions. Factors leading to variation in post-test scores on the Force and Motion Conceptual Evaluation (FMCE) across different instructors, semesters, and course models in a sample collected in introductory calculus-based mechanics at a large, eastern land-grant university were examined. The sample was collected over nine years and contains a total of <math display=\"inline\" xmlns=\"http://www.w3.org/1998/Math/MathML\"><mi>N</mi><mo>=</mo><mn>4409</mn></math> matched pretest and post-test records. The data showed a systematic semester-by-semester variation in both pretest scores and ACT or SAT mathematics percentile scores. Neither the normalized gain nor Cohen’s <math display=\"inline\" xmlns=\"http://www.w3.org/1998/Math/MathML\"><mi>d</mi></math> removed the semester-to-semester variation observed in post-test scores. The local average curve plotting post-test scores against pretest scores, which we call a conceptual growth curve, allowed for the characterization of outcomes for students with different pretest scores. Regression models were used to produce an approximation to this curve. By using either the full curve or a mathematical approximation developed through linear regression, the post-test score that would be observed if a class enrolled students with a given level of prior preparation measured by pretest scores can be predicted. This predicted post-test score can then be used to calculate the predicted normalized gain if desired. These methods rely on using the natural variation of incoming student preparation at one institution to predict how a class would perform if it enrolled students with different prior preparation. The study presents an example of converting the outcomes at an institution with a weakly prepared student population to the outcomes which would have been observed if the course enrolled a more prepared student population; converting the outcomes for a different student population dramatically changed the interpretation of how the class studied was functioning.","PeriodicalId":54296,"journal":{"name":"Physical Review Physics Education Research","volume":null,"pages":null},"PeriodicalIF":2.6000,"publicationDate":"2024-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physical Review Physics Education Research","FirstCategoryId":"95","ListUrlMain":"https://doi.org/10.1103/physrevphyseducres.20.010123","RegionNum":2,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}
引用次数: 0

Abstract

This study proposes methods of reporting results of physics conceptual evaluations that more fully characterize the range of outcomes experienced by students with differing levels of prior preparation, allowing for more meaningful comparison of the outcomes of educational interventions within and across institutions. Factors leading to variation in post-test scores on the Force and Motion Conceptual Evaluation (FMCE) across different instructors, semesters, and course models in a sample collected in introductory calculus-based mechanics at a large, eastern land-grant university were examined. The sample was collected over nine years and contains a total of N=4409 matched pretest and post-test records. The data showed a systematic semester-by-semester variation in both pretest scores and ACT or SAT mathematics percentile scores. Neither the normalized gain nor Cohen’s d removed the semester-to-semester variation observed in post-test scores. The local average curve plotting post-test scores against pretest scores, which we call a conceptual growth curve, allowed for the characterization of outcomes for students with different pretest scores. Regression models were used to produce an approximation to this curve. By using either the full curve or a mathematical approximation developed through linear regression, the post-test score that would be observed if a class enrolled students with a given level of prior preparation measured by pretest scores can be predicted. This predicted post-test score can then be used to calculate the predicted normalized gain if desired. These methods rely on using the natural variation of incoming student preparation at one institution to predict how a class would perform if it enrolled students with different prior preparation. The study presents an example of converting the outcomes at an institution with a weakly prepared student population to the outcomes which would have been observed if the course enrolled a more prepared student population; converting the outcomes for a different student population dramatically changed the interpretation of how the class studied was functioning.

Abstract Image

超越标准化收益:改进物理教育成果的比较
本研究提出了报告物理概念评价结果的方法,这些方法能更全面地描述具有不同先期准备水平的学生所经历的结果范围,从而更有意义地比较机构内和机构间教育干预的结果。在东部一所大型赠地大学的微积分力学入门课程中收集的样本中,研究了导致不同教师、不同学期和不同课程模式的 "力与运动概念评价"(FMCE)测试后得分差异的因素。样本收集历时九年,共包含 N=4409 个匹配的前测和后测记录。数据显示,每个学期的考前成绩和 ACT 或 SAT 数学百分位数成绩都有系统性的变化。无论是归一化增益还是 Cohen's d,都无法消除在测验后分数中观察到的学期间差异。将后测分数与前测分数绘制成的局部平均曲线(我们称之为概念成长曲线),可用于描述不同前测分数的学生的学习结果。回归模型用于生成该曲线的近似值。通过使用完整的曲线或通过线性回归建立的数学近似值,可以预测出如果一个班级招收了具有特定水平的学生,而这些学生之前的准备情况是以考前分数来衡量的,那么该班级学生的考后分数是多少。然后,如果需要,还可以用预测的测验后分数来计算预测的归一化增益。这些方法依赖于利用一所学校新生准备情况的自然变化,来预测一个班级如果招收了具有不同先期准备情况的学生,将会取得怎样的成绩。本研究举例说明了如何将一所院校中准备薄弱的学生群体的成绩转换为如果该课程招收准备更充分的学生群体所能观察到的成绩;将不同学生群体的成绩进行转换,极大地改变了对所研究班级运作情况的解释。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Physical Review Physics Education Research
Physical Review Physics Education Research Social Sciences-Education
CiteScore
5.70
自引率
41.90%
发文量
84
审稿时长
32 weeks
期刊介绍: PRPER covers all educational levels, from elementary through graduate education. All topics in experimental and theoretical physics education research are accepted, including, but not limited to: Educational policy Instructional strategies, and materials development Research methodology Epistemology, attitudes, and beliefs Learning environment Scientific reasoning and problem solving Diversity and inclusion Learning theory Student participation Faculty and teacher professional development
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信