How to rate programming skills in programming experiments?: a preliminary, exploratory, study based on university marks, pretests, and self-estimation

Workshop on Evaluation and Usability of Programming Languages and Tools Pub Date : 2011-10-24 DOI:10.1145/2089155.2089161

Sebastian Kleinschmager, Stefan Hanenberg

引用次数: 19

Abstract

Rating of subjects is an important issue for empirical studies. First, it is desirable for studies that rely on comparisons between different groups to make sure that those groups are balanced, i.e. that subjects in different groups are comparable. Second, in order to understand to what extent the results of a study are generalizable it is necessary to understand whether the used subjects can be considered as representative. Third, for a deeper understanding of an experiment's results it is desirable to understand what different kinds of subjects achieved what results. This paper addresses this topic by a preliminary, exploratory study that analyzes three different possible criteria: university marks, self-estimation, and pretests. It turns out that neither university marks nor pretests yielded better results than self-estimation.

查看原文本刊更多论文

如何评价编程实验中的编程技能?基于大学分数、预试和自我评价的初步的、探索性的研究

被试评定是实证研究中的一个重要问题。首先，依靠不同组之间的比较来确保这些组是平衡的研究是可取的，即不同组的受试者是可比较的。其次，为了理解一项研究的结果在多大程度上是可概括的，有必要了解所使用的受试者是否可以被认为具有代表性。第三，为了更深入地理解实验结果，我们需要了解不同类型的受试者获得了什么结果。本文通过一项初步的探索性研究来解决这个问题，该研究分析了三种不同的可能标准:大学分数，自我估计和预测试。事实证明，无论是大学分数还是预考，结果都不如自我评估好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Workshop on Evaluation and Usability of Programming Languages and Tools

自引率

0.00%

发文量