{"title":"Predicting Software Perfection Through Advanced Models to Uncover and Prevent Defects","authors":"Tariq Shahzad, Sunawar Khan, Tehseen Mazhar, Wasim Ahmad, Khmaies Ouahada, Habib Hamam","doi":"10.1049/sfw2/8832164","DOIUrl":null,"url":null,"abstract":"<div>\n <p>Software defect prediction is a critical task in software engineering, enabling organizations to proactively identify and address potential issues in software systems, thereby improving quality and reducing costs. In this study, we evaluated and compared various machine learning models, including logistic regression (LR), random forest (RF), support vector machines (SVMs), convolutional neural networks (CNNs), and eXtreme Gradient Boosting (XGBoost), for software defect prediction using a combination of diverse datasets. The models were trained and tested on preprocessed and feature-selected data, followed by optimization through hyperparameter tuning. Performance evaluation metrics were employed to analyze the results comprehensively, including classification reports, confusion matrices, receiver operating characteristic–area under the curve (ROC-AUC) curves, precision–recall curves, and cumulative gain charts. The results revealed that XGBoost consistently outperformed other models, achieving the highest accuracy, precision, recall, and AUC scores across all metrics. This indicates its robustness and suitability for predicting software defects in real-world applications.</p>\n </div>","PeriodicalId":50378,"journal":{"name":"IET Software","volume":"2025 1","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/sfw2/8832164","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Software","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/sfw2/8832164","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
Software defect prediction is a critical task in software engineering, enabling organizations to proactively identify and address potential issues in software systems, thereby improving quality and reducing costs. In this study, we evaluated and compared various machine learning models, including logistic regression (LR), random forest (RF), support vector machines (SVMs), convolutional neural networks (CNNs), and eXtreme Gradient Boosting (XGBoost), for software defect prediction using a combination of diverse datasets. The models were trained and tested on preprocessed and feature-selected data, followed by optimization through hyperparameter tuning. Performance evaluation metrics were employed to analyze the results comprehensively, including classification reports, confusion matrices, receiver operating characteristic–area under the curve (ROC-AUC) curves, precision–recall curves, and cumulative gain charts. The results revealed that XGBoost consistently outperformed other models, achieving the highest accuracy, precision, recall, and AUC scores across all metrics. This indicates its robustness and suitability for predicting software defects in real-world applications.
期刊介绍:
IET Software publishes papers on all aspects of the software lifecycle, including design, development, implementation and maintenance. The focus of the journal is on the methods used to develop and maintain software, and their practical application.
Authors are especially encouraged to submit papers on the following topics, although papers on all aspects of software engineering are welcome:
Software and systems requirements engineering
Formal methods, design methods, practice and experience
Software architecture, aspect and object orientation, reuse and re-engineering
Testing, verification and validation techniques
Software dependability and measurement
Human systems engineering and human-computer interaction
Knowledge engineering; expert and knowledge-based systems, intelligent agents
Information systems engineering
Application of software engineering in industry and commerce
Software engineering technology transfer
Management of software development
Theoretical aspects of software development
Machine learning
Big data and big code
Cloud computing
Current Special Issue. Call for papers:
Knowledge Discovery for Software Development - https://digital-library.theiet.org/files/IET_SEN_CFP_KDSD.pdf
Big Data Analytics for Sustainable Software Development - https://digital-library.theiet.org/files/IET_SEN_CFP_BDASSD.pdf