基于机器学习方法的软件质量预测实验研究

2020 International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA) Pub Date : 2020-06-01 DOI:10.1109/HORA49412.2020.9152918

A. A. Ceran, Ö. Ö. Tanriöver

{"title":"基于机器学习方法的软件质量预测实验研究","authors":"A. A. Ceran, Ö. Ö. Tanriöver","doi":"10.1109/HORA49412.2020.9152918","DOIUrl":null,"url":null,"abstract":"Software quality estimation is an activity needed at various stages of software development. It may be used for planning the project's quality assurance practices and for benchmarking. In earlier previous studies, two methods (Multiple Criteria Linear Programming and Multiple Criteria Quadratic Programming) for estimating the quality of software had been used Also, C5.0, SVM and Neutral network were experimented with for quality estimation. These studies have relatively low accuracies. In this study, we aimed to improve estimation accuracy by using relevant features of a large dataset. We used a feature selection method and correlation matrix for reaching higher accuracies. In addition, we have experimented with recent methods shown to be successful for other prediction tasks. Machine learning algorithms such as Xgboost, Random Forest and Decision Tree are applied to the data to predict the software quality and reveal the relation between the quality and development attributes. The experimental results show that the quality level of software can be well estimated by machine learning algorithms.","PeriodicalId":166917,"journal":{"name":"2020 International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"An experimental study for software quality prediction with machine learning methods\",\"authors\":\"A. A. Ceran, Ö. Ö. Tanriöver\",\"doi\":\"10.1109/HORA49412.2020.9152918\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Software quality estimation is an activity needed at various stages of software development. It may be used for planning the project's quality assurance practices and for benchmarking. In earlier previous studies, two methods (Multiple Criteria Linear Programming and Multiple Criteria Quadratic Programming) for estimating the quality of software had been used Also, C5.0, SVM and Neutral network were experimented with for quality estimation. These studies have relatively low accuracies. In this study, we aimed to improve estimation accuracy by using relevant features of a large dataset. We used a feature selection method and correlation matrix for reaching higher accuracies. In addition, we have experimented with recent methods shown to be successful for other prediction tasks. Machine learning algorithms such as Xgboost, Random Forest and Decision Tree are applied to the data to predict the software quality and reveal the relation between the quality and development attributes. The experimental results show that the quality level of software can be well estimated by machine learning algorithms.\",\"PeriodicalId\":166917,\"journal\":{\"name\":\"2020 International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA)\",\"volume\":\"47 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HORA49412.2020.9152918\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HORA49412.2020.9152918","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

软件质量评估是软件开发各个阶段都需要的一项活动。它可以用于规划项目的质量保证实践和基准测试。在之前的研究中，已经使用了两种方法(多准则线性规划和多准则二次规划)来估计软件的质量，并尝试了C5.0、SVM和Neutral网络进行质量估计。这些研究的准确性相对较低。在本研究中，我们旨在利用大型数据集的相关特征来提高估计精度。为了达到更高的精度，我们使用了特征选择方法和相关矩阵。此外，我们已经用最近的方法进行了实验，这些方法在其他预测任务中被证明是成功的。将Xgboost、Random Forest、Decision Tree等机器学习算法应用于数据，预测软件质量，揭示质量与开发属性之间的关系。实验结果表明，机器学习算法可以很好地估计软件的质量水平。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An experimental study for software quality prediction with machine learning methods

Software quality estimation is an activity needed at various stages of software development. It may be used for planning the project's quality assurance practices and for benchmarking. In earlier previous studies, two methods (Multiple Criteria Linear Programming and Multiple Criteria Quadratic Programming) for estimating the quality of software had been used Also, C5.0, SVM and Neutral network were experimented with for quality estimation. These studies have relatively low accuracies. In this study, we aimed to improve estimation accuracy by using relevant features of a large dataset. We used a feature selection method and correlation matrix for reaching higher accuracies. In addition, we have experimented with recent methods shown to be successful for other prediction tasks. Machine learning algorithms such as Xgboost, Random Forest and Decision Tree are applied to the data to predict the software quality and reveal the relation between the quality and development attributes. The experimental results show that the quality level of software can be well estimated by machine learning algorithms.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA)

自引率

0.00%

发文量