On establishing a benchmark for evaluating static analysis alert prioritization and classification techniques

International Symposium on Empirical Software Engineering and Measurement Pub Date : 2008-10-09 DOI:10.1145/1414004.1414013

S. Heckman, L. Williams

引用次数: 110

Abstract

Benchmarks provide an experimental basis for evaluating software engineering processes or techniques in an objective and repeatable manner. We present the FAULTBENCH v0.1 benchmark, as a contribution to current benchmark materials, for evaluation and comparison of techniques that prioritize and classify alerts generated by static analysis tools. Static analysis tools may generate an overwhelming number of alerts, the majority of which are likely to be false positives (FP). Two FP mitigation techniques, alert prioritization and classification, provide an ordering or classification of alerts, identifying those likely to be anomalies. We evaluate FAULTBENCH using three versions of a FP mitigation technique within the AWARE adaptive prioritization model. Individual FAULTBENCH subjects vary in their optimal FP mitigation techniques. Together, FAULTBENCH subjects provide a precise and general evaluation of FP mitigation techniques.

查看原文本刊更多论文

建立评价静态分析、警报优先级和分类技术的基准

基准测试为以客观和可重复的方式评估软件工程过程或技术提供了实验基础。我们提供FAULTBENCH v0.1基准测试，作为对当前基准测试材料的贡献，用于评估和比较由静态分析工具生成的警报的优先级和分类技术。静态分析工具可能会生成大量警报，其中大多数可能是误报(FP)。两种FP缓解技术——警报优先级和分类——提供了警报的排序或分类，确定了可能是异常的警报。我们使用AWARE自适应优先级模型中的三个版本的FP缓解技术来评估FAULTBENCH。每个FAULTBENCH研究对象的最佳FP缓解技术各不相同。总之，FAULTBENCH主题提供了对FP缓解技术的精确和一般评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Symposium on Empirical Software Engineering and Measurement

自引率

0.00%

发文量