CPGVA: Code Property Graph based Vulnerability Analysis by Deep Learning

Wang Xiaomeng, Zhang Tao, Wu Runpu, Xin Wei, Hou Changyu
{"title":"CPGVA: Code Property Graph based Vulnerability Analysis by Deep Learning","authors":"Wang Xiaomeng, Zhang Tao, Wu Runpu, Xin Wei, Hou Changyu","doi":"10.1109/ICAIT.2018.8686548","DOIUrl":null,"url":null,"abstract":"The vast majority of security breaches encountered recent years are direct result of insecure source code. Therefore, the protection of software critically depends on the identification of security defect in source cod. The development and progress of relative technologies depend on the analysts' understanding of the safety issues and the accumulation of long-term experience, which provides a technical basis for the development of vulnerability analysis, but difficult to meet the growing demand of the code security industry. With the maturity of big data technology, the development of natural language processing, deep learning and data mining technology provided new ideas for vulnerability analysis. This paper exploited deep learning methods to review source code on basis of code property graph. We implemented our approach on public datasets Software Assurance Reference Dataset (SARD) of C/C++ command injection and compared with current popular methods, which proved that the proposed code property graph based vulnerability analysis by deep learning (CPGVA) method outper-formed the state of art deep learning source code defect analysis method with the improvement of about 4.5%, 4.2%, 1.7%, 7.9%, 8.1% respectively in femeasure, precision, false positive rate, true positive rate and false negative rate.","PeriodicalId":367029,"journal":{"name":"2018 10th International Conference on Advanced Infocomm Technology (ICAIT)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"26","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 10th International Conference on Advanced Infocomm Technology (ICAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIT.2018.8686548","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 26

Abstract

The vast majority of security breaches encountered recent years are direct result of insecure source code. Therefore, the protection of software critically depends on the identification of security defect in source cod. The development and progress of relative technologies depend on the analysts' understanding of the safety issues and the accumulation of long-term experience, which provides a technical basis for the development of vulnerability analysis, but difficult to meet the growing demand of the code security industry. With the maturity of big data technology, the development of natural language processing, deep learning and data mining technology provided new ideas for vulnerability analysis. This paper exploited deep learning methods to review source code on basis of code property graph. We implemented our approach on public datasets Software Assurance Reference Dataset (SARD) of C/C++ command injection and compared with current popular methods, which proved that the proposed code property graph based vulnerability analysis by deep learning (CPGVA) method outper-formed the state of art deep learning source code defect analysis method with the improvement of about 4.5%, 4.2%, 1.7%, 7.9%, 8.1% respectively in femeasure, precision, false positive rate, true positive rate and false negative rate.
CPGVA:基于深度学习的代码属性图漏洞分析
近年来遇到的绝大多数安全漏洞都是源代码不安全的直接结果。因此,对软件的保护关键取决于对源代码安全缺陷的识别。相关技术的发展和进步依赖于分析人员对安全问题的理解和长期经验的积累,这为漏洞分析的发展提供了技术基础,但难以满足代码安全行业日益增长的需求。随着大数据技术的成熟,自然语言处理、深度学习和数据挖掘技术的发展为漏洞分析提供了新的思路。本文利用深度学习方法在代码属性图的基础上对源代码进行审查。在C/ c++命令注入的公共数据集软件保障参考数据集(SARD)上实现了该方法,并与目前流行的方法进行了比较,结果表明,基于代码属性图的深度学习漏洞分析(CPGVA)方法在度量、精度、假阳性率、误差率等方面分别提高了4.5%、4.2%、1.7%、7.9%、8.1%,优于当前最先进的深度学习源代码缺陷分析方法。真阳性率和假阴性率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信