Automatic programming error class identification with code plagiarism-based clustering

Proceedings of the 2nd International Code Hunt Workshop on Educational Software Engineering Pub Date : 2016-11-14 DOI:10.1145/2993270.2993271

Sébastien Combéfis, A. Schils

引用次数: 10

Abstract

Online platforms to learn programming are very popular nowadays. These platforms must automatically assess codes submitted by the learners and must provide good quality feedbacks in order to support their learning. Classical techniques to produce useful feedbacks include using unit testing frameworks to perform systematic functional tests of the submitted codes or using code quality assessment tools. This paper explores how to automatically identify error classes by clustering a set of submitted codes, using code plagiarism detection tools to measure the similarity between the codes. The proposed approach and analysis framework are presented in the paper, along with a first experiment using the Code Hunt dataset.

查看原文本刊更多论文

基于代码抄袭聚类的编程错误类自动识别

如今，学习编程的在线平台非常流行。这些平台必须自动评估学习者提交的代码，必须提供高质量的反馈，以支持他们的学习。产生有用反馈的经典技术包括使用单元测试框架对提交的代码执行系统的功能测试，或者使用代码质量评估工具。本文探讨了如何通过聚类一组提交的代码来自动识别错误类别，使用代码剽窃检测工具来度量代码之间的相似性。本文介绍了所提出的方法和分析框架，以及使用代码搜索数据集的第一个实验。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2nd International Code Hunt Workshop on Educational Software Engineering

自引率

0.00%

发文量