在敏捷环境中使用机器学习确定自动化测试的优先级

2019 Conference on Information Communications Technology and Society (ICTAS) Pub Date : 2019-03-01 DOI:10.1109/ICTAS.2019.8703639

L. Butgereit

{"title":"在敏捷环境中使用机器学习确定自动化测试的优先级","authors":"L. Butgereit","doi":"10.1109/ICTAS.2019.8703639","DOIUrl":null,"url":null,"abstract":"Automated software testing is an integral part of most Agile methodologies. In the case of the Scrum Agile methodology, the definition of done includes the completion of tests. As a software project matures, however, the number of tests increases to such a point that the time required to run all the tests often hinders the speed in which artifacts can be deployed. This paper describes a technique of using machine learning to help prioritize automated testing to ensure that tests which have a higher probability of failing are executed early in the test run giving the programmers an early indication of problems. In order to do this, various metrics are collected about the software under test including Cyclomatic values, Halstead-based values, and Chidamber-Kemere values. In addition, the historical commit messages from the source code control system is accessed to see if there had been defects in the various source classes previously. From these two inputs, a data file can be created which contains various metrics and whether or not there had been defects in these source files previously. This data file can then be sent to Weka to create a decision tree indicating which measurements indicate potential defects. The model created by Weka can then then be used in future to attempt to predict where defects might be in the source files and then prioritize testing appropriately.","PeriodicalId":386209,"journal":{"name":"2019 Conference on Information Communications Technology and Society (ICTAS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Using Machine Learning to Prioritize Automated Testing in an Agile Environment\",\"authors\":\"L. Butgereit\",\"doi\":\"10.1109/ICTAS.2019.8703639\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automated software testing is an integral part of most Agile methodologies. In the case of the Scrum Agile methodology, the definition of done includes the completion of tests. As a software project matures, however, the number of tests increases to such a point that the time required to run all the tests often hinders the speed in which artifacts can be deployed. This paper describes a technique of using machine learning to help prioritize automated testing to ensure that tests which have a higher probability of failing are executed early in the test run giving the programmers an early indication of problems. In order to do this, various metrics are collected about the software under test including Cyclomatic values, Halstead-based values, and Chidamber-Kemere values. In addition, the historical commit messages from the source code control system is accessed to see if there had been defects in the various source classes previously. From these two inputs, a data file can be created which contains various metrics and whether or not there had been defects in these source files previously. This data file can then be sent to Weka to create a decision tree indicating which measurements indicate potential defects. The model created by Weka can then then be used in future to attempt to predict where defects might be in the source files and then prioritize testing appropriately.\",\"PeriodicalId\":386209,\"journal\":{\"name\":\"2019 Conference on Information Communications Technology and Society (ICTAS)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 Conference on Information Communications Technology and Society (ICTAS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICTAS.2019.8703639\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Conference on Information Communications Technology and Society (ICTAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTAS.2019.8703639","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

摘要

自动化软件测试是大多数敏捷方法的一个组成部分。在Scrum敏捷方法中，完成的定义包括测试的完成。然而，随着软件项目的成熟，测试的数量会增加到运行所有测试所需的时间通常会阻碍工件部署的速度。本文描述了一种使用机器学习来帮助自动测试确定优先级的技术，以确保在测试运行的早期执行失败概率较高的测试，从而使程序员能够及早发现问题。为了做到这一点，收集关于被测软件的各种度量，包括Cyclomatic值、基于halstead的值和Chidamber-Kemere值。此外，还可以访问来自源代码控制系统的历史提交消息，以查看以前各种源类中是否存在缺陷。从这两个输入中，可以创建一个数据文件，其中包含各种度量，以及这些源文件以前是否存在缺陷。然后可以将该数据文件发送到Weka，以创建一个决策树，指示哪些度量表明了潜在的缺陷。Weka创建的模型可以在将来用于尝试预测源文件中可能存在缺陷的位置，然后适当地确定测试的优先级。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Using Machine Learning to Prioritize Automated Testing in an Agile Environment

Automated software testing is an integral part of most Agile methodologies. In the case of the Scrum Agile methodology, the definition of done includes the completion of tests. As a software project matures, however, the number of tests increases to such a point that the time required to run all the tests often hinders the speed in which artifacts can be deployed. This paper describes a technique of using machine learning to help prioritize automated testing to ensure that tests which have a higher probability of failing are executed early in the test run giving the programmers an early indication of problems. In order to do this, various metrics are collected about the software under test including Cyclomatic values, Halstead-based values, and Chidamber-Kemere values. In addition, the historical commit messages from the source code control system is accessed to see if there had been defects in the various source classes previously. From these two inputs, a data file can be created which contains various metrics and whether or not there had been defects in these source files previously. This data file can then be sent to Weka to create a decision tree indicating which measurements indicate potential defects. The model created by Weka can then then be used in future to attempt to predict where defects might be in the source files and then prioritize testing appropriately.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 Conference on Information Communications Technology and Society (ICTAS)

自引率

0.00%

发文量