VulnMiner: A comprehensive framework for vulnerability collection from C/C++ source code projects

IF 1.2 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Software Impacts Pub Date : 2024-11-01 DOI:10.1016/j.simpa.2024.100713

Guru Bhandari, Nikola Gavric, Andrii Shalaginov

引用次数: 0

Abstract

The study introduces VulnMiner, a comprehensive framework encompassing a data extraction tool tailored for identifying vulnerabilities in C/C++ source code. Moreover, it unveils an initial release of a vulnerability dataset, curated from prevalent projects and annotated with vulnerable and benign instances. This dataset incorporates projects with vulnerabilities labeled as Common Weakness Enumeration (CWE) categories. The developed open-source extraction tool collects vulnerability data utilizing static security analyzers. The study also fosters the machine learning (ML) and natural language processing (NLP) model’s effectiveness in accurately classifying vulnerabilities, evidenced by its identification of numerous weaknesses in open-source projects.

查看原文本刊更多论文

一个全面的框架，用于从C/ c++源代码项目中收集漏洞

该研究介绍了VulnMiner，这是一个全面的框架，包含一个专门用于识别C/ c++源代码漏洞的数据提取工具。此外，它还公布了一个漏洞数据集的初始版本，该数据集从流行的项目中挑选出来，并注释了脆弱和良性的实例。此数据集包含带有标记为常见弱点枚举（CWE）类别的漏洞的项目。开发的开源提取工具利用静态安全分析器收集漏洞数据。该研究还促进了机器学习（ML）和自然语言处理（NLP）模型在准确分类漏洞方面的有效性，其对开源项目中众多弱点的识别证明了这一点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊