Item Analysis of Multiple Choice and Extended Matching Questions in the Final MBBS Medicine and Therapeutics Examination

Journal of Medical Education Development Pub Date : 2022-10-19 DOI:10.5812/jme-129450

Alok Kumar, Colette George, Michael Harry Campbell, Kandamaran Krishnamurthy, Paula Michele Lashley, V. Singh, S. Motilal, S. Sakhamuri, Tamara Thompson, Corrine Sinquee-Brown, B. Sa, Md Anbarul Azim Mujumder

{"title":"Item Analysis of Multiple Choice and Extended Matching Questions in the Final MBBS Medicine and Therapeutics Examination","authors":"Alok Kumar, Colette George, Michael Harry Campbell, Kandamaran Krishnamurthy, Paula Michele Lashley, V. Singh, S. Motilal, S. Sakhamuri, Tamara Thompson, Corrine Sinquee-Brown, B. Sa, Md Anbarul Azim Mujumder","doi":"10.5812/jme-129450","DOIUrl":null,"url":null,"abstract":"Background: Most universities around the world use the multiple-choice question (MCQ) examination format to evaluate medical education. However, the suitability and advantages of traditional MCQs and extended matching questions (EMQs) continue to be debated. Objectives: This study mainly aimed to perform a comprehensive comparative analysis of the performance of the EMQ and traditional MCQ formats in the final MBBS exit examination. Methods: We conducted an item analysis of 80 EMQs, and 200 MCQs administered to 532 examinees across the four campuses of the University of the West Indies during the final MBBS medicine and therapeutics examination of 2019. Exam performance measures included central tendency, item discrimination, reliability, item difficulty, and distractor efficacy. Results: For the 532 students who sat the exam, the highest, lowest, and mean (± SD) scores for the EMQs were 93, 41, and 69.0 (± 9.8), respectively; for the MCQs, the respective values were 82, 41, and 62.7 (± 7.4). The predictive value of the EMQ and MCQ grades individually in the overall failure was 0.67 (95% CI = 0.39, 0.87) and 0.89 (95% CI = 0.65, 0.98), respectively. KR-20 coefficients for the EMQs and MCQs ranged from 0.52 to 0.70 and 0.71 to 0.79, respectively. The proportion of questions with two or more functional distractors was consistently higher for the MCQs than for the EMQs in all four cohorts of students. Conclusions: The MCQs were more predictive of the overall failure and had higher inter-item reliability, making the MCQ format more suitable for high-stakes examinations.","PeriodicalId":30594,"journal":{"name":"Journal of Medical Education Development","volume":"10 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Education Development","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5812/jme-129450","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Most universities around the world use the multiple-choice question (MCQ) examination format to evaluate medical education. However, the suitability and advantages of traditional MCQs and extended matching questions (EMQs) continue to be debated. Objectives: This study mainly aimed to perform a comprehensive comparative analysis of the performance of the EMQ and traditional MCQ formats in the final MBBS exit examination. Methods: We conducted an item analysis of 80 EMQs, and 200 MCQs administered to 532 examinees across the four campuses of the University of the West Indies during the final MBBS medicine and therapeutics examination of 2019. Exam performance measures included central tendency, item discrimination, reliability, item difficulty, and distractor efficacy. Results: For the 532 students who sat the exam, the highest, lowest, and mean (± SD) scores for the EMQs were 93, 41, and 69.0 (± 9.8), respectively; for the MCQs, the respective values were 82, 41, and 62.7 (± 7.4). The predictive value of the EMQ and MCQ grades individually in the overall failure was 0.67 (95% CI = 0.39, 0.87) and 0.89 (95% CI = 0.65, 0.98), respectively. KR-20 coefficients for the EMQs and MCQs ranged from 0.52 to 0.70 and 0.71 to 0.79, respectively. The proportion of questions with two or more functional distractors was consistently higher for the MCQs than for the EMQs in all four cohorts of students. Conclusions: The MCQs were more predictive of the overall failure and had higher inter-item reliability, making the MCQ format more suitable for high-stakes examinations.

查看原文本刊更多论文

MBBS医学与治疗学期末考试中多项选择题和扩展匹配题的项目分析

背景:世界上大多数大学都采用多项选择题(MCQ)考试形式来评估医学教育。然而，传统mcq和扩展匹配问题(emq)的适用性和优势仍在争论中。目的:本研究的主要目的是对EMQ和传统MCQ格式在MBBS毕业考试中的表现进行全面的比较分析。方法:我们对西印度群岛大学四个校区的532名考生在2019年MBBS医学和治疗学期末考试期间发放的80份emq和200份mcq进行了项目分析。考试成绩测量包括集中倾向、项目辨析、信度、项目难度和分心效能。结果:532名参加考试的学生，emq的最高、最低和平均(±SD)得分分别为93、41和69.0(±9.8)分;mcq分别为82、41和62.7(±7.4)。EMQ和MCQ评分在总体失败中的预测值分别为0.67 (95% CI = 0.39, 0.87)和0.89 (95% CI = 0.65, 0.98)。emq和mcq的KR-20系数分别为0.52 ~ 0.70和0.71 ~ 0.79。在所有四组学生中，具有两个或两个以上功能干扰因素的问题比例在mcq组中始终高于emq组。结论:MCQ更能预测整体失败，具有较高的项目间信度，使MCQ格式更适合高风险考试。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Medical Education Development

CiteScore

0.30

自引率

0.00%

发文量