Socratic Questioning of Novice Debuggers: A Benchmark Dataset and Preliminary Evaluations

Workshop on Innovative Use of NLP for Building Educational Applications Pub Date : 1900-01-01 DOI:10.18653/v1/2023.bea-1.57

Erfan Al-Hossami, Razvan C. Bunescu, Ryan Teehan, Laurel Powell, Khyati Mahajan, Mohsen Dorodchi

引用次数: 4

Abstract

Socratic questioning is a teaching strategy where the student is guided towards solving a problem on their own, instead of being given the solution directly. In this paper, we introduce a dataset of Socratic conversations where an instructor helps a novice programmer fix buggy solutions to simple computational problems. The dataset is then used for benchmarking the Socratic debugging abilities of GPT-based language models. While GPT-4 is observed to perform much better than GPT-3.5, its precision, and recall still fall short of human expert abilities, motivating further work in this area.

查看原文本刊更多论文

新手调试器的苏格拉底问题:一个基准数据集和初步评估

苏格拉底式提问是一种教学策略，引导学生自己解决问题，而不是直接给出解决方案。在本文中，我们介绍了一个苏格拉底对话的数据集，其中讲师帮助新手程序员修复简单计算问题的错误解决方案。然后使用该数据集对基于gpt的语言模型的苏格拉底式调试能力进行基准测试。虽然观察到GPT-4的表现比GPT-3.5好得多，但其精度和召回率仍然低于人类专家的能力，这激励了该领域的进一步工作。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Workshop on Innovative Use of NLP for Building Educational Applications

自引率

0.00%

发文量