VCMatch: A Ranking-based Approach for Automatic Security Patches Localization for OSS Vulnerabilities

2022 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER) Pub Date : 2022-03-01 DOI:10.1109/saner53432.2022.00076

Shichao Wang, Yun Zhang, Liagfeng Bao, Xin Xia, Ming-hui Wu

{"title":"VCMatch: A Ranking-based Approach for Automatic Security Patches Localization for OSS Vulnerabilities","authors":"Shichao Wang, Yun Zhang, Liagfeng Bao, Xin Xia, Ming-hui Wu","doi":"10.1109/saner53432.2022.00076","DOIUrl":null,"url":null,"abstract":"Nowadays, vulnerabilities in open source software (OSS) are constantly emerging, posing a great threat to application security. Security patches are crucial in reducing the risk of OSS vulnerabilities. However, many of the vulnerabilities disclosed by CVE/NVD are not accompanied by security patches. Previous research has shown that the auxiliary information in CVE/NVD can aid in the matching of a vulnerability to appropriate commits. The state-of-art research proposed a rank-based approach based on the multiple dimensions of features extracted from the auxiliary information in CVE/NVD. However, this approach ignores the semantic features in the vulnerability descriptions and commit messages, making the model still have room for improvement. In this paper, we propose a novel ranking-based approach VCMATCH (Vulnerability-Commit Match). In addition to extracting the shallow statistical features between the vulnerability and the patch commit, VCMATCH extracts the deep semantic features of the vulnerability descriptions and commit messages. Besides, VCMATCH applies three classification models (i.e., XGBoost, LightGBM, CNN) and uses a voting-based rank fusion method to combine the results of the three models to generate a better result. We evaluate VCMATCH with 1,669 CVEs from 10 OSS projects. The experiment results show that VCMATCH can effectively identify security patches for OSS vulnerabilities in terms of Recall@K and Manual Effort@K, and outperforms the state-of-art model by a statistically significant margin.","PeriodicalId":437520,"journal":{"name":"2022 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/saner53432.2022.00076","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Nowadays, vulnerabilities in open source software (OSS) are constantly emerging, posing a great threat to application security. Security patches are crucial in reducing the risk of OSS vulnerabilities. However, many of the vulnerabilities disclosed by CVE/NVD are not accompanied by security patches. Previous research has shown that the auxiliary information in CVE/NVD can aid in the matching of a vulnerability to appropriate commits. The state-of-art research proposed a rank-based approach based on the multiple dimensions of features extracted from the auxiliary information in CVE/NVD. However, this approach ignores the semantic features in the vulnerability descriptions and commit messages, making the model still have room for improvement. In this paper, we propose a novel ranking-based approach VCMATCH (Vulnerability-Commit Match). In addition to extracting the shallow statistical features between the vulnerability and the patch commit, VCMATCH extracts the deep semantic features of the vulnerability descriptions and commit messages. Besides, VCMATCH applies three classification models (i.e., XGBoost, LightGBM, CNN) and uses a voting-based rank fusion method to combine the results of the three models to generate a better result. We evaluate VCMATCH with 1,669 CVEs from 10 OSS projects. The experiment results show that VCMATCH can effectively identify security patches for OSS vulnerabilities in terms of Recall@K and Manual Effort@K, and outperforms the state-of-art model by a statistically significant margin.

查看原文本刊更多论文

VCMatch:基于排名的OSS漏洞安全补丁自动定位方法

目前，开源软件的漏洞不断涌现，对应用程序的安全构成了极大的威胁。安全补丁对于降低OSS漏洞的风险至关重要。然而，CVE/NVD披露的许多漏洞并没有附带安全补丁。先前的研究表明，CVE/NVD中的辅助信息可以帮助将漏洞与适当的提交进行匹配。基于CVE/NVD辅助信息提取特征的多维度，提出了一种基于秩的方法。但是，这种方法忽略了漏洞描述和提交消息中的语义特征，使得模型仍有改进的空间。在本文中，我们提出了一种新的基于排名的方法VCMATCH(漏洞-提交匹配)。VCMATCH除了提取漏洞与补丁提交之间的浅层统计特征外，还提取漏洞描述和提交消息的深层语义特征。此外，VCMATCH应用了三种分类模型(即XGBoost、LightGBM、CNN)，并使用基于投票的秩融合方法将三种模型的结果结合起来，以产生更好的结果。我们用来自10个OSS项目的1,669个cve来评估VCMATCH。实验结果表明，VCMATCH可以有效地识别Recall@K和Manual Effort@K两种OSS漏洞的安全补丁，并且在统计上显著优于当前的模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER)

自引率

0.00%

发文量