A Model of How Students Engineer Test Cases With Feedback

IF 3.2 3区工程技术 Q1 EDUCATION, SCIENTIFIC DISCIPLINES

ACM Transactions on Computing Education Pub Date : 2023-10-20 DOI:10.1145/3628604

Austin M. Shin, Ayaan M. Kazerouni

{"title":"A Model of How Students Engineer Test Cases With Feedback","authors":"Austin M. Shin, Ayaan M. Kazerouni","doi":"10.1145/3628604","DOIUrl":null,"url":null,"abstract":"Background and Context. Students’ programming projects are often assessed on the basis of their tests as well as their implementations, most commonly using test adequacy criteria like branch coverage, or, in some cases, mutation analysis. As a result, students are implicitly encouraged to use these tools during their development process (i.e., so they have awareness of the strength of their own test suites). Objectives. Little is known about how students choose test cases for their software while being guided by these feedback mechanisms. We aim to explore the interaction between students and commonly used testing feedback mechanisms (in this case, branch coverage and mutation-based feedback). Method. We use grounded theory to explore this interaction. We conducted 12 think-aloud interviews with students as they were asked to complete a series of software testing tasks, each of which involved a different feedback mechanism. Interviews were recorded and transcripts were analyzed, and we present the overarching themes that emerged from our analysis. Findings. Our findings are organized into a process model describing how students completed software testing tasks while being guided by a test adequacy criterion. Program comprehension strategies were commonly employed to reason about feedback and devise test cases. Mutation-based feedback tended to be cognitively overwhelming for students, and they resorted to weaker heuristics in order to address this feedback. Implications. In the presence of testing feedback, students did not appear to consider problem coverage as a testing goal so much as program coverage . While test adequacy criteria can be useful for assessment of software tests, we must consider whether they represent good goals for testing, and if our current methods of practice and assessment are encouraging poor testing habits.","PeriodicalId":48764,"journal":{"name":"ACM Transactions on Computing Education","volume":"31 7","pages":"0"},"PeriodicalIF":3.2000,"publicationDate":"2023-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Computing Education","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3628604","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EDUCATION, SCIENTIFIC DISCIPLINES","Score":null,"Total":0}

引用次数: 0

Abstract

Background and Context. Students’ programming projects are often assessed on the basis of their tests as well as their implementations, most commonly using test adequacy criteria like branch coverage, or, in some cases, mutation analysis. As a result, students are implicitly encouraged to use these tools during their development process (i.e., so they have awareness of the strength of their own test suites). Objectives. Little is known about how students choose test cases for their software while being guided by these feedback mechanisms. We aim to explore the interaction between students and commonly used testing feedback mechanisms (in this case, branch coverage and mutation-based feedback). Method. We use grounded theory to explore this interaction. We conducted 12 think-aloud interviews with students as they were asked to complete a series of software testing tasks, each of which involved a different feedback mechanism. Interviews were recorded and transcripts were analyzed, and we present the overarching themes that emerged from our analysis. Findings. Our findings are organized into a process model describing how students completed software testing tasks while being guided by a test adequacy criterion. Program comprehension strategies were commonly employed to reason about feedback and devise test cases. Mutation-based feedback tended to be cognitively overwhelming for students, and they resorted to weaker heuristics in order to address this feedback. Implications. In the presence of testing feedback, students did not appear to consider problem coverage as a testing goal so much as program coverage . While test adequacy criteria can be useful for assessment of software tests, we must consider whether they represent good goals for testing, and if our current methods of practice and assessment are encouraging poor testing habits.

查看原文本刊更多论文

一个学生如何用反馈设计测试用例的模型

背景和背景。学生的编程项目通常根据他们的测试和实现进行评估，最常见的是使用测试充分性标准，比如分支覆盖率，或者在某些情况下，使用突变分析。因此，学生被暗中鼓励在他们的开发过程中使用这些工具(也就是说，这样他们就能意识到他们自己的测试套件的强度)。目标。在这些反馈机制的指导下，学生是如何为他们的软件选择测试用例的，我们所知甚少。我们的目标是探索学生和常用的测试反馈机制之间的互动(在这种情况下，分支覆盖和基于突变的反馈)。方法。我们使用扎根理论来探索这种相互作用。我们对学生进行了12次思考访谈，要求他们完成一系列软件测试任务，每个任务都涉及不同的反馈机制。我们对访谈进行了记录和分析，并提出了从我们的分析中出现的总体主题。发现。我们的发现被组织成一个过程模型，描述学生如何在测试充分性标准的指导下完成软件测试任务。程序理解策略通常用于推断反馈和设计测试用例。基于突变的反馈对学生来说往往是认知上的压倒性的，他们采取了较弱的启发式来解决这种反馈。的影响。在测试反馈存在的情况下，学生似乎没有像程序覆盖那样将问题覆盖视为测试目标。虽然测试充分性标准对于软件测试的评估是有用的，但是我们必须考虑它们是否代表了良好的测试目标，以及我们当前的实践和评估方法是否鼓励了不良的测试习惯。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM Transactions on Computing Education EDUCATION, SCIENTIFIC DISCIPLINES-

CiteScore

6.50

自引率

16.70%

发文量

期刊介绍： ACM Transactions on Computing Education (TOCE) (formerly named JERIC, Journal on Educational Resources in Computing) covers diverse aspects of computing education: traditional computer science, computer engineering, information technology, and informatics; emerging aspects of computing; and applications of computing to other disciplines. The common characteristics shared by these papers are a scholarly approach to teaching and learning, a broad appeal to educational practitioners, and a clear connection to student learning.