Item Analysis of Multiple Choice and Extended Matching Questions in the Final MBBS Medicine and Therapeutics Examination

Alok Kumar, Colette George, Michael Harry Campbell, Kandamaran Krishnamurthy, Paula Michele Lashley, V. Singh, S. Motilal, S. Sakhamuri, Tamara Thompson, Corrine Sinquee-Brown, B. Sa, Md Anbarul Azim Mujumder
{"title":"Item Analysis of Multiple Choice and Extended Matching Questions in the Final MBBS Medicine and Therapeutics Examination","authors":"Alok Kumar, Colette George, Michael Harry Campbell, Kandamaran Krishnamurthy, Paula Michele Lashley, V. Singh, S. Motilal, S. Sakhamuri, Tamara Thompson, Corrine Sinquee-Brown, B. Sa, Md Anbarul Azim Mujumder","doi":"10.5812/jme-129450","DOIUrl":null,"url":null,"abstract":"Background: Most universities around the world use the multiple-choice question (MCQ) examination format to evaluate medical education. However, the suitability and advantages of traditional MCQs and extended matching questions (EMQs) continue to be debated. Objectives: This study mainly aimed to perform a comprehensive comparative analysis of the performance of the EMQ and traditional MCQ formats in the final MBBS exit examination. Methods: We conducted an item analysis of 80 EMQs, and 200 MCQs administered to 532 examinees across the four campuses of the University of the West Indies during the final MBBS medicine and therapeutics examination of 2019. Exam performance measures included central tendency, item discrimination, reliability, item difficulty, and distractor efficacy. Results: For the 532 students who sat the exam, the highest, lowest, and mean (± SD) scores for the EMQs were 93, 41, and 69.0 (± 9.8), respectively; for the MCQs, the respective values were 82, 41, and 62.7 (± 7.4). The predictive value of the EMQ and MCQ grades individually in the overall failure was 0.67 (95% CI = 0.39, 0.87) and 0.89 (95% CI = 0.65, 0.98), respectively. KR-20 coefficients for the EMQs and MCQs ranged from 0.52 to 0.70 and 0.71 to 0.79, respectively. The proportion of questions with two or more functional distractors was consistently higher for the MCQs than for the EMQs in all four cohorts of students. Conclusions: The MCQs were more predictive of the overall failure and had higher inter-item reliability, making the MCQ format more suitable for high-stakes examinations.","PeriodicalId":30594,"journal":{"name":"Journal of Medical Education Development","volume":"10 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Education Development","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5812/jme-129450","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Most universities around the world use the multiple-choice question (MCQ) examination format to evaluate medical education. However, the suitability and advantages of traditional MCQs and extended matching questions (EMQs) continue to be debated. Objectives: This study mainly aimed to perform a comprehensive comparative analysis of the performance of the EMQ and traditional MCQ formats in the final MBBS exit examination. Methods: We conducted an item analysis of 80 EMQs, and 200 MCQs administered to 532 examinees across the four campuses of the University of the West Indies during the final MBBS medicine and therapeutics examination of 2019. Exam performance measures included central tendency, item discrimination, reliability, item difficulty, and distractor efficacy. Results: For the 532 students who sat the exam, the highest, lowest, and mean (± SD) scores for the EMQs were 93, 41, and 69.0 (± 9.8), respectively; for the MCQs, the respective values were 82, 41, and 62.7 (± 7.4). The predictive value of the EMQ and MCQ grades individually in the overall failure was 0.67 (95% CI = 0.39, 0.87) and 0.89 (95% CI = 0.65, 0.98), respectively. KR-20 coefficients for the EMQs and MCQs ranged from 0.52 to 0.70 and 0.71 to 0.79, respectively. The proportion of questions with two or more functional distractors was consistently higher for the MCQs than for the EMQs in all four cohorts of students. Conclusions: The MCQs were more predictive of the overall failure and had higher inter-item reliability, making the MCQ format more suitable for high-stakes examinations.
MBBS医学与治疗学期末考试中多项选择题和扩展匹配题的项目分析
背景:世界上大多数大学都采用多项选择题(MCQ)考试形式来评估医学教育。然而,传统mcq和扩展匹配问题(emq)的适用性和优势仍在争论中。目的:本研究的主要目的是对EMQ和传统MCQ格式在MBBS毕业考试中的表现进行全面的比较分析。方法:我们对西印度群岛大学四个校区的532名考生在2019年MBBS医学和治疗学期末考试期间发放的80份emq和200份mcq进行了项目分析。考试成绩测量包括集中倾向、项目辨析、信度、项目难度和分心效能。结果:532名参加考试的学生,emq的最高、最低和平均(±SD)得分分别为93、41和69.0(±9.8)分;mcq分别为82、41和62.7(±7.4)。EMQ和MCQ评分在总体失败中的预测值分别为0.67 (95% CI = 0.39, 0.87)和0.89 (95% CI = 0.65, 0.98)。emq和mcq的KR-20系数分别为0.52 ~ 0.70和0.71 ~ 0.79。在所有四组学生中,具有两个或两个以上功能干扰因素的问题比例在mcq组中始终高于emq组。结论:MCQ更能预测整体失败,具有较高的项目间信度,使MCQ格式更适合高风险考试。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
0.30
自引率
0.00%
发文量
9
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信