Software reliability modeling based on SVM and virtual sample

Yumei Wu, Risheng Yang
{"title":"Software reliability modeling based on SVM and virtual sample","authors":"Yumei Wu, Risheng Yang","doi":"10.1109/RAMS.2013.6517763","DOIUrl":null,"url":null,"abstract":"Software reliability prediction models, which receive the most attention in software reliability engineering, use the failure data collected in testing phases to predict the failure occurrence in the operational environment. Currently, as the requirement for reliable software is increasing, ways to predict and estimate the reliability of software systems which require high reliability with small size test data is more problematic. What's more, there exists a difficult problem in software reliability modeling that the prediction capability of a model varies with failure data change. This inconsistency problem limits the promotion and application of software reliability techniques; mainly because the assumptions the models are based on may not be suitable for most cases. In addition, there exists a contradiction in traditional software reliability prediction methods. Prediction accuracy is low due to a lack of comprehensive consideration of factors affecting reliability. However, when more factors are taken into account, it is difficult to establish a statistical model and solve multivariate likelihood equations. For these reasons, software reliability prediction modeling method based on machine leaning techniques for small sample size is studied in this paper. Firstly, gene expression programming algorithm is used to analyze the small size sample by symbolic regression. The symbolic regression function acquired can then be viewed as the priori information of the data and used to generate a virtual sample. Then, with the virtual sample a regression model based on a Support Vector Machine (SVM) can be established, with which the software reliability can be predicted. Finally, a case study based on real failure data-sets is presented verifying the effectiveness of the method.","PeriodicalId":189714,"journal":{"name":"2013 Proceedings Annual Reliability and Maintainability Symposium (RAMS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 Proceedings Annual Reliability and Maintainability Symposium (RAMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RAMS.2013.6517763","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

Abstract

Software reliability prediction models, which receive the most attention in software reliability engineering, use the failure data collected in testing phases to predict the failure occurrence in the operational environment. Currently, as the requirement for reliable software is increasing, ways to predict and estimate the reliability of software systems which require high reliability with small size test data is more problematic. What's more, there exists a difficult problem in software reliability modeling that the prediction capability of a model varies with failure data change. This inconsistency problem limits the promotion and application of software reliability techniques; mainly because the assumptions the models are based on may not be suitable for most cases. In addition, there exists a contradiction in traditional software reliability prediction methods. Prediction accuracy is low due to a lack of comprehensive consideration of factors affecting reliability. However, when more factors are taken into account, it is difficult to establish a statistical model and solve multivariate likelihood equations. For these reasons, software reliability prediction modeling method based on machine leaning techniques for small sample size is studied in this paper. Firstly, gene expression programming algorithm is used to analyze the small size sample by symbolic regression. The symbolic regression function acquired can then be viewed as the priori information of the data and used to generate a virtual sample. Then, with the virtual sample a regression model based on a Support Vector Machine (SVM) can be established, with which the software reliability can be predicted. Finally, a case study based on real failure data-sets is presented verifying the effectiveness of the method.
基于支持向量机和虚拟样本的软件可靠性建模
软件可靠性预测模型是在软件可靠性工程中最受关注的一种软件可靠性预测模型,它利用测试阶段收集到的故障数据来预测运行环境中故障的发生。当前,随着对软件可靠性要求的不断提高,如何利用小尺寸的测试数据对可靠性要求高的软件系统进行可靠性预测和评估就显得更加困难。另外,在软件可靠性建模中存在一个难题,即模型的预测能力随故障数据的变化而变化。这种不一致性问题限制了软件可靠性技术的推广和应用;主要是因为模型所基于的假设可能并不适用于大多数情况。此外,传统的软件可靠性预测方法也存在一定的矛盾。由于缺乏对可靠性影响因素的综合考虑,预测精度较低。然而,当考虑的因素较多时,很难建立统计模型和求解多元似然方程。为此,本文研究了基于机器学习技术的小样本软件可靠性预测建模方法。首先,采用基因表达式编程算法对小样本进行符号回归分析。然后,所获得的符号回归函数可以被视为数据的先验信息,并用于生成虚拟样本。然后,利用虚拟样本建立基于支持向量机(SVM)的回归模型,对软件可靠性进行预测。最后,以实际故障数据集为例,验证了该方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信