Source Code Vulnerability Detection Using Vulnerability Dependency Representation Graph

2022 IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom) Pub Date : 2022-12-01 DOI:10.1109/TrustCom56396.2022.00070

Hongyu Yang, Haiyun Yang, Liang Zhang, Xiang Cheng

{"title":"Source Code Vulnerability Detection Using Vulnerability Dependency Representation Graph","authors":"Hongyu Yang, Haiyun Yang, Liang Zhang, Xiang Cheng","doi":"10.1109/TrustCom56396.2022.00070","DOIUrl":null,"url":null,"abstract":"Aiming at the fact that the existing source code vulnerability detection methods did not explicitly maintain the semantic information related to the vulnerability in the source code, which made it difficult for the vulnerability detection model to extract the vulnerability sentence features and had a high detection false positive rate, a source code vulnerability detection method based on the vulnerability dependency graph is proposed. Firstly, the candidate vulnerability sentences of the function were matched, and the vulnerability dependency representation graph corresponding to the function was generated by analyzing the multi-layer control dependencies and data dependencies of the candidate vulnerability sentences. Secondly, abstracted the function name and variable name of the code sentences node and generated the initial representation vector of the code sentence nodes in the vulnerability dependency representation graph. Finally, the source code vulnerability detection model based on the heterogeneous graph transformer was used to learn the context information of the code sentence nodes in the vulnerability dependency representation graph. In this paper, the proposed method was verified on three datasets. The experimental results show that the proposed method have better performance in source code vulnerability detection, and the recall rate is increased by 1.50%~22.27%, and the F1 score is increased by 1.86%~16.69%, which is better than the existing methods.","PeriodicalId":276379,"journal":{"name":"2022 IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom)","volume":"83 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TrustCom56396.2022.00070","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Aiming at the fact that the existing source code vulnerability detection methods did not explicitly maintain the semantic information related to the vulnerability in the source code, which made it difficult for the vulnerability detection model to extract the vulnerability sentence features and had a high detection false positive rate, a source code vulnerability detection method based on the vulnerability dependency graph is proposed. Firstly, the candidate vulnerability sentences of the function were matched, and the vulnerability dependency representation graph corresponding to the function was generated by analyzing the multi-layer control dependencies and data dependencies of the candidate vulnerability sentences. Secondly, abstracted the function name and variable name of the code sentences node and generated the initial representation vector of the code sentence nodes in the vulnerability dependency representation graph. Finally, the source code vulnerability detection model based on the heterogeneous graph transformer was used to learn the context information of the code sentence nodes in the vulnerability dependency representation graph. In this paper, the proposed method was verified on three datasets. The experimental results show that the proposed method have better performance in source code vulnerability detection, and the recall rate is increased by 1.50%~22.27%, and the F1 score is increased by 1.86%~16.69%, which is better than the existing methods.

查看原文本刊更多论文

使用漏洞依赖表示图的源代码漏洞检测

针对现有的源代码漏洞检测方法未明确维护源代码中与漏洞相关的语义信息，导致漏洞检测模型难以提取漏洞句子特征，且检测误报率较高的问题，提出了一种基于漏洞依赖图的源代码漏洞检测方法。首先匹配函数的候选漏洞句，通过分析候选漏洞句的多层控制依赖关系和数据依赖关系，生成函数对应的漏洞依赖表示图;其次，对代码句节点的函数名和变量名进行抽象，生成漏洞依赖表示图中代码句节点的初始表示向量;最后，利用基于异构图转换器的源代码漏洞检测模型，学习漏洞依赖表示图中代码句子节点的上下文信息。本文在三个数据集上对该方法进行了验证。实验结果表明，该方法在源代码漏洞检测方面具有较好的性能，召回率提高1.50%~22.27%，F1分数提高1.86%~16.69%，优于现有方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom)

自引率

0.00%

发文量