Searching for a Needle in a Haystack: Predicting Security Vulnerabilities for Windows Vista

2010 Third International Conference on Software Testing, Verification and Validation Pub Date : 2010-04-06 DOI:10.1109/ICST.2010.32

Thomas Zimmermann, Nachiappan Nagappan, L. Williams

{"title":"Searching for a Needle in a Haystack: Predicting Security Vulnerabilities for Windows Vista","authors":"Thomas Zimmermann, Nachiappan Nagappan, L. Williams","doi":"10.1109/ICST.2010.32","DOIUrl":null,"url":null,"abstract":"Many factors are believed to increase the vulnerability of software system; for example, the more widely deployed or popular is a software system the more likely it is to be attacked. Early identification of defects has been a widely investigated topic in software engineering research. Early identification of software vulnerabilities can help mitigate these attacks to a large degree by focusing better security verification efforts in these components. Predicting vulnerabilities is complicated by the fact that vulnerabilities are, most often, few in number and introduce significant bias by creating a sparse dataset in the population. As a result, vulnerability prediction can be thought of us preverbally “searching for a needle in a haystack.” In this paper, we present a large-scale empirical study on Windows Vista, where we empirically evaluate the efficacy of classical metrics like complexity, churn, coverage, dependency measures, and organizational structure of the company to predict vulnerabilities and assess how well these software measures correlate with vulnerabilities. We observed in our experiments that classical software measures predict vulnerabilities with a high precision but low recall values. The actual dependencies, however, predict vulnerabilities with a lower precision but substantially higher recall.","PeriodicalId":192678,"journal":{"name":"2010 Third International Conference on Software Testing, Verification and Validation","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"191","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Third International Conference on Software Testing, Verification and Validation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICST.2010.32","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 191

Abstract

Many factors are believed to increase the vulnerability of software system; for example, the more widely deployed or popular is a software system the more likely it is to be attacked. Early identification of defects has been a widely investigated topic in software engineering research. Early identification of software vulnerabilities can help mitigate these attacks to a large degree by focusing better security verification efforts in these components. Predicting vulnerabilities is complicated by the fact that vulnerabilities are, most often, few in number and introduce significant bias by creating a sparse dataset in the population. As a result, vulnerability prediction can be thought of us preverbally “searching for a needle in a haystack.” In this paper, we present a large-scale empirical study on Windows Vista, where we empirically evaluate the efficacy of classical metrics like complexity, churn, coverage, dependency measures, and organizational structure of the company to predict vulnerabilities and assess how well these software measures correlate with vulnerabilities. We observed in our experiments that classical software measures predict vulnerabilities with a high precision but low recall values. The actual dependencies, however, predict vulnerabilities with a lower precision but substantially higher recall.

查看原文本刊更多论文

大海捞针:预测Windows Vista的安全漏洞

很多因素都认为会增加软件系统的脆弱性;例如，一个软件系统部署得越广泛或越流行，它就越有可能受到攻击。在软件工程研究中，缺陷的早期识别一直是一个广泛研究的课题。通过在这些组件中集中更好的安全性验证工作，软件漏洞的早期识别可以在很大程度上帮助减轻这些攻击。由于漏洞通常数量很少，并且通过在总体中创建稀疏数据集而引入明显的偏差，因此预测漏洞是复杂的。因此，脆弱性预测可以被认为是我们在语言前“大海捞针”。在本文中，我们对Windows Vista进行了大规模的实证研究，在那里我们经验地评估了经典指标的有效性，如复杂性、流失、覆盖率、依赖度量和公司的组织结构，以预测漏洞，并评估这些软件度量与漏洞的相关性。我们在实验中观察到，传统的软件测量方法预测漏洞的精度高，但召回率低。然而，实际的依赖关系预测漏洞的精度较低，但召回率却很高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2010 Third International Conference on Software Testing, Verification and Validation

自引率

0.00%

发文量