Peer Grading in a Course on Algorithms and Data Structures: Machine Learning Algorithms do not Improve over Simple Baselines

Proceedings of the Third (2016) ACM Conference on Learning @ Scale Pub Date : 2015-06-02 DOI:10.1145/2876034.2876036

Mehdi S. M. Sajjadi, Morteza Alamgir, U. V. Luxburg

引用次数: 33

Abstract

Peer grading is the process of students reviewing each others' work, such as homework submissions, and has lately become a popular mechanism used in massive open online courses (MOOCs). Intrigued by this idea, we used it in a course on algorithms and data structures at the University of Hamburg. Throughout the whole semester, students repeatedly handed in submissions to exercises, which were then evaluated both by teaching assistants and by a peer grading mechanism, yielding a large dataset of teacher and peer grades. We applied different statistical and machine learning methods to aggregate the peer grades in order to come up with accurate final grades for the submissions (supervised and unsupervised, methods based on numeric scores and ordinal rankings). Surprisingly, none of them improves over the baseline of using the mean peer grade as the final grade. We discuss a number of possible explanations for these results and present a thorough analysis of the generated dataset.

查看原文本刊更多论文

算法和数据结构课程的同侪评分:机器学习算法不会在简单基线上改进

“同伴评分”是学生们互相评阅作业(如提交的作业)的过程，最近已成为大规模在线开放课程(MOOCs)中使用的一种流行机制。被这个想法所吸引，我们在汉堡大学的一门关于算法和数据结构的课程中使用了它。在整个学期中，学生们反复提交作业，然后由助教和同伴评分机制对作业进行评估，从而产生一个庞大的教师和同伴评分数据集。我们应用了不同的统计和机器学习方法来汇总同龄人的成绩，以便为提交的作品得出准确的最终成绩(有监督和无监督，基于数字分数和顺序排名的方法)。令人惊讶的是，没有一个人的成绩比使用同龄人平均成绩作为最终成绩的基线有所提高。我们讨论了这些结果的一些可能的解释，并对生成的数据集进行了彻底的分析。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Third (2016) ACM Conference on Learning @ Scale

自引率

0.00%

发文量