Software reliability modeling based on SVM and virtual sample

2013 Proceedings Annual Reliability and Maintainability Symposium (RAMS) Pub Date : 2013-05-23 DOI:10.1109/RAMS.2013.6517763

Yumei Wu, Risheng Yang

{"title":"Software reliability modeling based on SVM and virtual sample","authors":"Yumei Wu, Risheng Yang","doi":"10.1109/RAMS.2013.6517763","DOIUrl":null,"url":null,"abstract":"Software reliability prediction models, which receive the most attention in software reliability engineering, use the failure data collected in testing phases to predict the failure occurrence in the operational environment. Currently, as the requirement for reliable software is increasing, ways to predict and estimate the reliability of software systems which require high reliability with small size test data is more problematic. What's more, there exists a difficult problem in software reliability modeling that the prediction capability of a model varies with failure data change. This inconsistency problem limits the promotion and application of software reliability techniques; mainly because the assumptions the models are based on may not be suitable for most cases. In addition, there exists a contradiction in traditional software reliability prediction methods. Prediction accuracy is low due to a lack of comprehensive consideration of factors affecting reliability. However, when more factors are taken into account, it is difficult to establish a statistical model and solve multivariate likelihood equations. For these reasons, software reliability prediction modeling method based on machine leaning techniques for small sample size is studied in this paper. Firstly, gene expression programming algorithm is used to analyze the small size sample by symbolic regression. The symbolic regression function acquired can then be viewed as the priori information of the data and used to generate a virtual sample. Then, with the virtual sample a regression model based on a Support Vector Machine (SVM) can be established, with which the software reliability can be predicted. Finally, a case study based on real failure data-sets is presented verifying the effectiveness of the method.","PeriodicalId":189714,"journal":{"name":"2013 Proceedings Annual Reliability and Maintainability Symposium (RAMS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 Proceedings Annual Reliability and Maintainability Symposium (RAMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RAMS.2013.6517763","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

Abstract

Software reliability prediction models, which receive the most attention in software reliability engineering, use the failure data collected in testing phases to predict the failure occurrence in the operational environment. Currently, as the requirement for reliable software is increasing, ways to predict and estimate the reliability of software systems which require high reliability with small size test data is more problematic. What's more, there exists a difficult problem in software reliability modeling that the prediction capability of a model varies with failure data change. This inconsistency problem limits the promotion and application of software reliability techniques; mainly because the assumptions the models are based on may not be suitable for most cases. In addition, there exists a contradiction in traditional software reliability prediction methods. Prediction accuracy is low due to a lack of comprehensive consideration of factors affecting reliability. However, when more factors are taken into account, it is difficult to establish a statistical model and solve multivariate likelihood equations. For these reasons, software reliability prediction modeling method based on machine leaning techniques for small sample size is studied in this paper. Firstly, gene expression programming algorithm is used to analyze the small size sample by symbolic regression. The symbolic regression function acquired can then be viewed as the priori information of the data and used to generate a virtual sample. Then, with the virtual sample a regression model based on a Support Vector Machine (SVM) can be established, with which the software reliability can be predicted. Finally, a case study based on real failure data-sets is presented verifying the effectiveness of the method.

查看原文本刊更多论文

基于支持向量机和虚拟样本的软件可靠性建模

软件可靠性预测模型是在软件可靠性工程中最受关注的一种软件可靠性预测模型，它利用测试阶段收集到的故障数据来预测运行环境中故障的发生。当前，随着对软件可靠性要求的不断提高，如何利用小尺寸的测试数据对可靠性要求高的软件系统进行可靠性预测和评估就显得更加困难。另外，在软件可靠性建模中存在一个难题，即模型的预测能力随故障数据的变化而变化。这种不一致性问题限制了软件可靠性技术的推广和应用;主要是因为模型所基于的假设可能并不适用于大多数情况。此外，传统的软件可靠性预测方法也存在一定的矛盾。由于缺乏对可靠性影响因素的综合考虑，预测精度较低。然而，当考虑的因素较多时，很难建立统计模型和求解多元似然方程。为此，本文研究了基于机器学习技术的小样本软件可靠性预测建模方法。首先，采用基因表达式编程算法对小样本进行符号回归分析。然后，所获得的符号回归函数可以被视为数据的先验信息，并用于生成虚拟样本。然后，利用虚拟样本建立基于支持向量机(SVM)的回归模型，对软件可靠性进行预测。最后，以实际故障数据集为例，验证了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2013 Proceedings Annual Reliability and Maintainability Symposium (RAMS)

自引率

0.00%

发文量