Opening the black box of psychometrics

IF 4.7 2区 教育学 Q1 EDUCATION, SCIENTIFIC DISCIPLINES
Jonathan J. Wisco, Jason Organ
{"title":"Opening the black box of psychometrics","authors":"Jonathan J. Wisco,&nbsp;Jason Organ","doi":"10.1002/ase.70091","DOIUrl":null,"url":null,"abstract":"<p>The term “psychometrics” is inherently intimidating to the average health sciences educator and scholar, and yet, the foundational principles of the discipline—determining the impact of latent variables that contribute to human behavior—are important for determining whether the tools we use as educators and educational scholars are consistent at measuring what they were intended to measure. The terms “reliability” and “validity” themselves conjure images of head-scratching and question marks in bubble clouds. In the simplest of terms, reliability is a mathematical term for precision, whereas validity is a mathematical term for accuracy. The dartboard analogy visualization is apropos here: Darts thrown in the same vicinity on a dartboard reflect precision or reliability; darts thrown at the intended target of the bullseye reflect accuracy or validity. In an ideal world, we want our tools measuring performance to be both reliable and valid.</p><p>Need to know if a rater measuring performance on an Objective Structured Clinical Exam (OSCE) is observing the same phenomena across students based on a grading rubric?Perform a test–retest analysis. Need to know if two different raters for that same OSCE are observing the same phenomena? Perform an inter-rater reliability analysis. Need to know if that survey you used as a data collection instrument is reliable across subjects? Perform a Cronbach's alpha analysis. Need to know if an exam you wrote was written clearly without introducing unintended bias and measuring what you intended to assess? Perform an item analysis. These are just some examples of why psychometrics matter.</p><p>There are so many different analyses that can be performed to measure reliability and validity. This alone is an intimidating attribute of psychometrics. Add the fact that all of these analyses are mathematically based, and that leaves educators and scholars in the black box of analysis trust and an urge to seek out the resident statistician. (But there's hope! The math is all based on “y = mx + b,” which is a generalized linear model, so it is really not that bad!) We did not go into health sciences education for the math, after all, so why all the fuss?!</p><p>Why all the fuss is <i>exactly</i> what this month's Guest Editor, Dr. Adam Wilson of Rush University in Chicago, IL, has explored in the pages that follow. One of us (JMO) has known Dr. Wilson since the last days of his graduate work at Indiana University, where even then he was pushing anatomy educators to think more critically about our work. In the years since he graduated, he has become one of the world's leading experts in quantitative approaches to enhancing anatomy, medical, and health education. This is one of the primary reasons he was invited into an Associate Editor role at <i>Anatomical Sciences Education</i>, a role he has used to elevate the quality of scholarship published in our pages. Our journal has a long history of publishing psychometric work,<span><sup>1-10</sup></span> including several important articles from Dr. Wilson's research group.<span><sup>11-19</sup></span> This month's special issue expands on those studies and brings psychometrics to the forefront so we can all begin to think about enhancing outcomes for our learners with more rigor.</p><p>The most important thing to know about psychometrics was stated eloquently by Dr. Wilson in his introduction to this month's issue<span><sup>19</sup></span>: “No one wants to make a high-stakes decision about a learner only to later discover the instrument upon which the decision was made failed to accurately or reliably portray the individual's knowledge, skills, behaviors, or perceptions.” He continues by saying, “If there is one takeaway worth emphasizing in this special issue, it is that <i>the quality of research conclusions and data-driven decisions is only as good as the foundational psychometric evidence upon which these arguments are built</i>. Such foundational evidence ought never to be assumed and should always be transparently disclosed for proper contextualization.” Wise words for sure. These are principles to which all educators and educational scholars aspire.</p><p>If you want to become a better medical sciences educator and scholar, learning what is in the black box of psychometrics is as essential to you as learning how a generalized linear model is used to compare means of sample populations for a basic scientist or clinical scientist. The difference here is that educators and educational scholars are measuring human performance, which can determine the path and success of individual students. In our estimation, as educators and humanists, the impact has significant ramifications.</p><p>This special issue was assembled with you in mind, to help you understand better why psychometrics should be important to you and how it informs your impact as an educator and scholar on the lives of your students. So, dive into that black box!</p><p><b>Jonathan J. Wisco:</b> Conceptualization; writing – original draft; writing – review and editing; methodology; resources; supervision. <b>Jason Organ:</b> Conceptualization; writing – original draft; methodology; writing – review and editing; resources; supervision.</p><p>N/A.</p><p>The authors declare no conflicts of interest.</p><p>N/A.</p><p>N/A.</p>","PeriodicalId":124,"journal":{"name":"Anatomical Sciences Education","volume":"18 8","pages":"747-748"},"PeriodicalIF":4.7000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ase.70091","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Anatomical Sciences Education","FirstCategoryId":"95","ListUrlMain":"https://anatomypubs.onlinelibrary.wiley.com/doi/10.1002/ase.70091","RegionNum":2,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EDUCATION, SCIENTIFIC DISCIPLINES","Score":null,"Total":0}
引用次数: 0

Abstract

The term “psychometrics” is inherently intimidating to the average health sciences educator and scholar, and yet, the foundational principles of the discipline—determining the impact of latent variables that contribute to human behavior—are important for determining whether the tools we use as educators and educational scholars are consistent at measuring what they were intended to measure. The terms “reliability” and “validity” themselves conjure images of head-scratching and question marks in bubble clouds. In the simplest of terms, reliability is a mathematical term for precision, whereas validity is a mathematical term for accuracy. The dartboard analogy visualization is apropos here: Darts thrown in the same vicinity on a dartboard reflect precision or reliability; darts thrown at the intended target of the bullseye reflect accuracy or validity. In an ideal world, we want our tools measuring performance to be both reliable and valid.

Need to know if a rater measuring performance on an Objective Structured Clinical Exam (OSCE) is observing the same phenomena across students based on a grading rubric?Perform a test–retest analysis. Need to know if two different raters for that same OSCE are observing the same phenomena? Perform an inter-rater reliability analysis. Need to know if that survey you used as a data collection instrument is reliable across subjects? Perform a Cronbach's alpha analysis. Need to know if an exam you wrote was written clearly without introducing unintended bias and measuring what you intended to assess? Perform an item analysis. These are just some examples of why psychometrics matter.

There are so many different analyses that can be performed to measure reliability and validity. This alone is an intimidating attribute of psychometrics. Add the fact that all of these analyses are mathematically based, and that leaves educators and scholars in the black box of analysis trust and an urge to seek out the resident statistician. (But there's hope! The math is all based on “y = mx + b,” which is a generalized linear model, so it is really not that bad!) We did not go into health sciences education for the math, after all, so why all the fuss?!

Why all the fuss is exactly what this month's Guest Editor, Dr. Adam Wilson of Rush University in Chicago, IL, has explored in the pages that follow. One of us (JMO) has known Dr. Wilson since the last days of his graduate work at Indiana University, where even then he was pushing anatomy educators to think more critically about our work. In the years since he graduated, he has become one of the world's leading experts in quantitative approaches to enhancing anatomy, medical, and health education. This is one of the primary reasons he was invited into an Associate Editor role at Anatomical Sciences Education, a role he has used to elevate the quality of scholarship published in our pages. Our journal has a long history of publishing psychometric work,1-10 including several important articles from Dr. Wilson's research group.11-19 This month's special issue expands on those studies and brings psychometrics to the forefront so we can all begin to think about enhancing outcomes for our learners with more rigor.

The most important thing to know about psychometrics was stated eloquently by Dr. Wilson in his introduction to this month's issue19: “No one wants to make a high-stakes decision about a learner only to later discover the instrument upon which the decision was made failed to accurately or reliably portray the individual's knowledge, skills, behaviors, or perceptions.” He continues by saying, “If there is one takeaway worth emphasizing in this special issue, it is that the quality of research conclusions and data-driven decisions is only as good as the foundational psychometric evidence upon which these arguments are built. Such foundational evidence ought never to be assumed and should always be transparently disclosed for proper contextualization.” Wise words for sure. These are principles to which all educators and educational scholars aspire.

If you want to become a better medical sciences educator and scholar, learning what is in the black box of psychometrics is as essential to you as learning how a generalized linear model is used to compare means of sample populations for a basic scientist or clinical scientist. The difference here is that educators and educational scholars are measuring human performance, which can determine the path and success of individual students. In our estimation, as educators and humanists, the impact has significant ramifications.

This special issue was assembled with you in mind, to help you understand better why psychometrics should be important to you and how it informs your impact as an educator and scholar on the lives of your students. So, dive into that black box!

Jonathan J. Wisco: Conceptualization; writing – original draft; writing – review and editing; methodology; resources; supervision. Jason Organ: Conceptualization; writing – original draft; methodology; writing – review and editing; resources; supervision.

N/A.

The authors declare no conflicts of interest.

N/A.

N/A.

Abstract Image

Abstract Image

Abstract Image

打开心理测量的黑盒子。
“心理测量学”这个术语对于普通的健康科学教育者和学者来说,本质上是令人生畏的,然而,这门学科的基本原则——确定对人类行为有贡献的潜在变量的影响——对于确定我们作为教育者和教育学者使用的工具在测量他们想要测量的东西时是否一致是很重要的。“可靠性”和“有效性”这两个词本身就会让人联想到令人挠头的画面和泡沫云中的问号。用最简单的术语来说,可靠性是精确的数学术语,而有效性是精确的数学术语。用飞镖类比的可视化方法在这里是恰当的:掷在飞镖附近的飞镖反映了精确度或可靠性;掷向靶心预定目标的飞镖反映了准确性或有效性。在理想的情况下,我们希望我们的工具既可靠又有效。需要知道评分员在客观结构化临床考试(OSCE)中的表现是否根据评分标准在学生中观察到相同的现象?执行测试-重测试分析。需要知道同一个欧安组织的两个不同评级员是否观察到相同的现象?进行内部可靠性分析。需要知道作为数据收集工具的调查是否可靠?进行Cronbach alpha分析。需要知道你写的考试是否写得很清楚,没有引入无意的偏见,也没有衡量你想要评估的内容?执行项目分析。这些只是心理测量学重要的一些例子。有很多不同的分析可以用来衡量信度和效度。这本身就是心理测量学的一个令人生畏的特性。再加上所有这些分析都是以数学为基础的事实,这让教育工作者和学者们陷入了分析信任的黑箱中,并迫切需要寻找常驻统计学家。(但是还有希望!数学都是基于“y = mx + b”,这是一个广义的线性模型,所以它真的没有那么糟糕!)毕竟,我们进入健康科学教育并不是为了数学,所以为什么大惊小怪呢?这个月的客座编辑,伊利诺斯州芝加哥拉什大学的亚当·威尔逊博士,将在接下来的几页中探讨为什么会有这么多的争论。我们中的一位(JMO)从威尔逊博士在印第安纳大学毕业的最后几天就认识他了,那时他还在推动解剖学教育工作者对我们的工作进行更批判性的思考。在毕业后的几年里,他已经成为世界上在定量方法加强解剖学、医学和健康教育方面的领先专家之一。这是他被邀请担任《解剖科学教育》副主编的主要原因之一,他利用这个角色来提高我们页面上发表的学术成果的质量。我们的期刊有很长的发表心理测量研究的历史,包括威尔逊博士研究小组的几篇重要文章。本月的特刊对这些研究进行了扩展,并将心理测量学带到了最前沿,这样我们就可以开始思考如何更严格地提高学习者的学习效果。关于心理测量学,我们需要知道的最重要的一点是,威尔逊博士在本月这期杂志的导言中雄辩地阐述了这一点:“没有人愿意对一个学习者做出一个高风险的决定,但后来却发现,做出决定的工具不能准确、可靠地描述这个人的知识、技能、行为或感知。”他接着说,“如果在这期特刊中有什么值得强调的,那就是研究结论和数据驱动的决策的质量只与这些论点所依据的基本心理测量证据一样好。这样的基础证据永远不应该被假设,而且应该始终透明地披露,以便进行适当的背景分析。”当然是明智的话。这些是所有教育工作者和教育学者所追求的原则。如果你想成为一名更好的医学教育家和学者,学习心理测量学黑盒子里的东西对你来说就像学习如何使用广义线性模型来比较基础科学家或临床科学家的样本总体均值一样重要。这里的区别在于,教育者和教育学者衡量的是人的表现,这可以决定个别学生的道路和成功。据我们估计,作为教育工作者和人文主义者,这种影响具有重大的后果。这期特刊是为了帮助你更好地理解为什么心理测量学对你很重要,以及它如何影响你作为教育者和学者对学生生活的影响。所以,潜入那个黑匣子吧!Jonathan J. Wisco:概念化;写作——原稿;写作——审阅和编辑;方法;资源;监督。 Jason Organ:概念化;写作——原稿;方法;写作——审阅和编辑;资源;supervision.N / A。作者声明没有利益冲突。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Anatomical Sciences Education
Anatomical Sciences Education Anatomy/education-
CiteScore
10.30
自引率
39.70%
发文量
91
期刊介绍: Anatomical Sciences Education, affiliated with the American Association for Anatomy, serves as an international platform for sharing ideas, innovations, and research related to education in anatomical sciences. Covering gross anatomy, embryology, histology, and neurosciences, the journal addresses education at various levels, including undergraduate, graduate, post-graduate, allied health, medical (both allopathic and osteopathic), and dental. It fosters collaboration and discussion in the field of anatomical sciences education.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信