CPGVA: Code Property Graph based Vulnerability Analysis by Deep Learning

2018 10th International Conference on Advanced Infocomm Technology (ICAIT) Pub Date : 2018-08-01 DOI:10.1109/ICAIT.2018.8686548

Wang Xiaomeng, Zhang Tao, Wu Runpu, Xin Wei, Hou Changyu

{"title":"CPGVA: Code Property Graph based Vulnerability Analysis by Deep Learning","authors":"Wang Xiaomeng, Zhang Tao, Wu Runpu, Xin Wei, Hou Changyu","doi":"10.1109/ICAIT.2018.8686548","DOIUrl":null,"url":null,"abstract":"The vast majority of security breaches encountered recent years are direct result of insecure source code. Therefore, the protection of software critically depends on the identification of security defect in source cod. The development and progress of relative technologies depend on the analysts' understanding of the safety issues and the accumulation of long-term experience, which provides a technical basis for the development of vulnerability analysis, but difficult to meet the growing demand of the code security industry. With the maturity of big data technology, the development of natural language processing, deep learning and data mining technology provided new ideas for vulnerability analysis. This paper exploited deep learning methods to review source code on basis of code property graph. We implemented our approach on public datasets Software Assurance Reference Dataset (SARD) of C/C++ command injection and compared with current popular methods, which proved that the proposed code property graph based vulnerability analysis by deep learning (CPGVA) method outper-formed the state of art deep learning source code defect analysis method with the improvement of about 4.5%, 4.2%, 1.7%, 7.9%, 8.1% respectively in femeasure, precision, false positive rate, true positive rate and false negative rate.","PeriodicalId":367029,"journal":{"name":"2018 10th International Conference on Advanced Infocomm Technology (ICAIT)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"26","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 10th International Conference on Advanced Infocomm Technology (ICAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIT.2018.8686548","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 26

Abstract

The vast majority of security breaches encountered recent years are direct result of insecure source code. Therefore, the protection of software critically depends on the identification of security defect in source cod. The development and progress of relative technologies depend on the analysts' understanding of the safety issues and the accumulation of long-term experience, which provides a technical basis for the development of vulnerability analysis, but difficult to meet the growing demand of the code security industry. With the maturity of big data technology, the development of natural language processing, deep learning and data mining technology provided new ideas for vulnerability analysis. This paper exploited deep learning methods to review source code on basis of code property graph. We implemented our approach on public datasets Software Assurance Reference Dataset (SARD) of C/C++ command injection and compared with current popular methods, which proved that the proposed code property graph based vulnerability analysis by deep learning (CPGVA) method outper-formed the state of art deep learning source code defect analysis method with the improvement of about 4.5%, 4.2%, 1.7%, 7.9%, 8.1% respectively in femeasure, precision, false positive rate, true positive rate and false negative rate.

查看原文本刊更多论文

CPGVA:基于深度学习的代码属性图漏洞分析

近年来遇到的绝大多数安全漏洞都是源代码不安全的直接结果。因此，对软件的保护关键取决于对源代码安全缺陷的识别。相关技术的发展和进步依赖于分析人员对安全问题的理解和长期经验的积累，这为漏洞分析的发展提供了技术基础，但难以满足代码安全行业日益增长的需求。随着大数据技术的成熟，自然语言处理、深度学习和数据挖掘技术的发展为漏洞分析提供了新的思路。本文利用深度学习方法在代码属性图的基础上对源代码进行审查。在C/ c++命令注入的公共数据集软件保障参考数据集(SARD)上实现了该方法，并与目前流行的方法进行了比较，结果表明，基于代码属性图的深度学习漏洞分析(CPGVA)方法在度量、精度、假阳性率、误差率等方面分别提高了4.5%、4.2%、1.7%、7.9%、8.1%，优于当前最先进的深度学习源代码缺陷分析方法。真阳性率和假阴性率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 10th International Conference on Advanced Infocomm Technology (ICAIT)

自引率

0.00%

发文量