Learning from 6,000 Projects: Mining Models in the Large

2010 10th IEEE Working Conference on Source Code Analysis and Manipulation Pub Date : 2010-09-12 DOI:10.1109/SCAM.2010.23

A. Zeller

引用次数: 3

Abstract

Models - abstract and simple descriptions of some artifact - are the backbone of all software engineering activities. While writing models is hard, existing code can serve as a source for abstract descriptions of how software behaves. To infer correct usage, code analysis needs usage examples, though, the more, the better. We have built a lightweight parser that efficiently extracts API usage models from source code - models that can then be used to detect anomalies. Applied on the 200 million lines of code of the Gen too Linux distribution, we would extract more than 15 million API constraints, encoding and abstracting the "wisdom of Linux code".

查看原文本刊更多论文

从6000个项目中学习:大范围的挖掘模型

模型——一些工件的抽象和简单描述——是所有软件工程活动的支柱。虽然编写模型很困难，但是现有的代码可以作为抽象描述软件行为的来源。为了推断出正确的用法，代码分析需要用法示例，尽管越多越好。我们已经构建了一个轻量级的解析器，它可以有效地从源代码中提取API使用模型——然后可以使用这些模型来检测异常。应用于gentoo Linux发行版的2亿行代码，我们将提取超过1500万个API约束，编码和抽象“Linux代码的智慧”。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2010 10th IEEE Working Conference on Source Code Analysis and Manipulation

自引率

0.00%

发文量