Multiple choice questions as a tool for summative assessment in medical schools

Bulletin of Egyptian Society for Physiological Sciences Pub Date : 2024-01-01 DOI:10.21608/besps.2023.238298.1152

Saeed Alqahtani

{"title":"Multiple choice questions as a tool for summative assessment in medical schools","authors":"Saeed Alqahtani","doi":"10.21608/besps.2023.238298.1152","DOIUrl":null,"url":null,"abstract":"Objectives: To evaluate the quality of multiple-choice questions (MCQs) used in a summative assessment of a Central Nervous System (CNS) module at the Faculty of Medicine, Jazan University. Methods: Item analysis was conducted on a 70-item MCQ exam administered to 57 medical students after completing the CNS module. Various departments teach the module utilizing a systems-based curriculum. Item difficulty, discrimination, reliability, and standard error of measurement were analyzed. Results: Item difficulty ranged from 0.3 to 0.9 on the difficulty index for most items (moderate difficulty). Most items (62/70) appropriately discriminated between high-and low-scoring students. Reliability was very high (Kuder-Richardson 20 = 0.91). The standard error of measurement was 3.7. Analysis of validity evidence included evaluation of content validity through alignment of exam items with module learning objectives using a test blueprint, as well as analysis of internal structure validity supported by item difficulty and discrimination statistics. Discrimination indices above 0.2 indicate items distinguished well between students performing at the upper and lower score ranges. Feasibility of MCQs was evidenced by the resources required. Minimal training and no specialized equipment or longer administration/scoring times were needed compared to other assessment methods. MCQs were well-accepted by students and faculty involved in test development and implementation. Conclusion: Psychometric analysis of item and exam characteristics provides validity evidence that scores from this MCQ reasonably represent CNS module achievement. While not capturing higher-order skills, MCQs proved a feasible and effective summative assessment of this pre-clinical module when used within an integrated evaluation program. .","PeriodicalId":9347,"journal":{"name":"Bulletin of Egyptian Society for Physiological Sciences","volume":"2 3","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bulletin of Egyptian Society for Physiological Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21608/besps.2023.238298.1152","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Objectives: To evaluate the quality of multiple-choice questions (MCQs) used in a summative assessment of a Central Nervous System (CNS) module at the Faculty of Medicine, Jazan University. Methods: Item analysis was conducted on a 70-item MCQ exam administered to 57 medical students after completing the CNS module. Various departments teach the module utilizing a systems-based curriculum. Item difficulty, discrimination, reliability, and standard error of measurement were analyzed. Results: Item difficulty ranged from 0.3 to 0.9 on the difficulty index for most items (moderate difficulty). Most items (62/70) appropriately discriminated between high-and low-scoring students. Reliability was very high (Kuder-Richardson 20 = 0.91). The standard error of measurement was 3.7. Analysis of validity evidence included evaluation of content validity through alignment of exam items with module learning objectives using a test blueprint, as well as analysis of internal structure validity supported by item difficulty and discrimination statistics. Discrimination indices above 0.2 indicate items distinguished well between students performing at the upper and lower score ranges. Feasibility of MCQs was evidenced by the resources required. Minimal training and no specialized equipment or longer administration/scoring times were needed compared to other assessment methods. MCQs were well-accepted by students and faculty involved in test development and implementation. Conclusion: Psychometric analysis of item and exam characteristics provides validity evidence that scores from this MCQ reasonably represent CNS module achievement. While not capturing higher-order skills, MCQs proved a feasible and effective summative assessment of this pre-clinical module when used within an integrated evaluation program. .

查看原文本刊更多论文

作为医学院终结性评估工具的多项选择题

目的评估贾占大学医学院中枢神经系统 (CNS) 单元终结性评估中使用的选择题 (MCQ) 的质量。方法：对 70 道选择题进行项目分析：在完成中枢神经系统（CNS）模块后，对 57 名医科学生进行了 70 题的 MCQ 考试，并进行了题目分析。该模块由各系利用基于系统的课程进行教学。对题目难度、区分度、信度和测量标准误差进行了分析。结果显示大多数项目的难度指数在 0.3 至 0.9 之间（中等难度）。大多数项目（62/70）能适当区分高分和低分学生。信度非常高（Kuder-Richardson 20 = 0.91）。测量标准误差为 3.7。效度证据分析包括通过使用测试蓝图使考试项目与模块学习目标相一致来评估内容效度，以及通过项目难度和区分度统计来分析内部结构效度。区分度指数超过 0.2 表明，考试项目能很好地区分高分学生和低分学生。所需的资源证明了 MCQ 考试的可行性。与其他测评方法相比，微机问答只需少量培训，无需专门设备或较长的施测/评分时间。参与测试开发和实施的学生和教师都非常认可 MCQ。结论对题目和考试特点的心理测量分析提供了有效性证据，证明该 MCQ 的分数合理地代表了中枢神经系统模块的成绩。虽然 MCQ 无法反映高阶技能，但在综合评估计划中使用 MCQ 时，证明对临床前模块进行总结性评估是可行且有效的。.

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Bulletin of Egyptian Society for Physiological Sciences

自引率

0.00%

发文量