{"title":"Software Effort Estimation for Agile Software Development Using a Strategy Based on k-Nearest Neighbors Algorithm","authors":"Eduardo Rodríguez Sánchez, Humberto Cervantes Maceda, Eduardo Vázquez-Santacruz","doi":"10.1109/ENC56672.2022.9882947","DOIUrl":null,"url":null,"abstract":"Agile development adoption in organizations is a trend that continues to accelerate according to the 15th State of Agile Report. Enterprises need to respond quickly to the needs of their customers and stakeholders and by adopting agile practices in IT teams, business value is raised in both performance and quality, so it is important to adopt practices and models that ensure the time, scope and cost of a project are achieved successfully. This paper presents a hybrid effort estimation model that uses a story point approach with machine learning techniques to estimate completion time and total cost of a project that is developed with agile methods like Scrum. The main machine learning technique used to implement the project is the k-Nearest Neighbors algorithm (KNN), its learning capabilities are assessed through 10-Fold cross validation and the estimates are compared with the original dataset and the results obtained from literature to show that estimates are competitive. The proposed approach uses category size labels that improve the original estimation model based on linear regression. The research uses 21 projects developed by six software houses, and training is done on a set created from a technique called data augmentation that generates 42 projects with a small amount of noise. Completion time is measured in days and total cost is valued in Pakistan rupees. All the results are evaluated through accuracy, Mean Squared Error, Mean Relative Error, variance and coefficient of determination.","PeriodicalId":145622,"journal":{"name":"2022 IEEE Mexican International Conference on Computer Science (ENC)","volume":"79 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Mexican International Conference on Computer Science (ENC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ENC56672.2022.9882947","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Agile development adoption in organizations is a trend that continues to accelerate according to the 15th State of Agile Report. Enterprises need to respond quickly to the needs of their customers and stakeholders and by adopting agile practices in IT teams, business value is raised in both performance and quality, so it is important to adopt practices and models that ensure the time, scope and cost of a project are achieved successfully. This paper presents a hybrid effort estimation model that uses a story point approach with machine learning techniques to estimate completion time and total cost of a project that is developed with agile methods like Scrum. The main machine learning technique used to implement the project is the k-Nearest Neighbors algorithm (KNN), its learning capabilities are assessed through 10-Fold cross validation and the estimates are compared with the original dataset and the results obtained from literature to show that estimates are competitive. The proposed approach uses category size labels that improve the original estimation model based on linear regression. The research uses 21 projects developed by six software houses, and training is done on a set created from a technique called data augmentation that generates 42 projects with a small amount of noise. Completion time is measured in days and total cost is valued in Pakistan rupees. All the results are evaluated through accuracy, Mean Squared Error, Mean Relative Error, variance and coefficient of determination.