自适应灰盒模糊测试与汤普森采样

Proceedings of the 11th ACM Workshop on Artificial Intelligence and Security Pub Date : 2018-01-15 DOI:10.1145/3270101.3270108

Siddharth Karamcheti, Gideon Mann, David S. Rosenberg

{"title":"自适应灰盒模糊测试与汤普森采样","authors":"Siddharth Karamcheti, Gideon Mann, David S. Rosenberg","doi":"10.1145/3270101.3270108","DOIUrl":null,"url":null,"abstract":"Fuzz testing, or \"fuzzing,\" refers to a widely deployed class of techniques for testing programs by generating a set of inputs for the express purpose of finding bugs and identifying security flaws. Grey-box fuzzing, the most popular fuzzing strategy, combines light program instrumentation with a data driven process to generate new program inputs. In this work, we present a machine learning approach that builds on AFL, the preeminent grey-box fuzzer, by adaptively learning a probability distribution over its mutation operators on a program-specific basis. These operators, which are selected uniformly at random in AFL and mutational fuzzers in general, dictate how new inputs are generated, a core part of the fuzzer's efficacy. Our main contributions are two-fold: First, we show that a sampling distribution over mutation operators estimated from training programs can significantly improve performance of AFL. Second, we introduce a Thompson Sampling, bandit-based optimization approach that fine-tunes the mutator distribution adaptively, during the course of fuzzing an individual program and outperforms offline training. A set of experiments across complex programs demonstrates that tuning the mutational operator distribution generates sets of inputs that yield significantly higher code coverage and finds more crashes faster and more reliably than both baseline versions of AFL as well as other AFL-based learning approaches.","PeriodicalId":132293,"journal":{"name":"Proceedings of the 11th ACM Workshop on Artificial Intelligence and Security","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"26","resultStr":"{\"title\":\"Adaptive Grey-Box Fuzz-Testing with Thompson Sampling\",\"authors\":\"Siddharth Karamcheti, Gideon Mann, David S. Rosenberg\",\"doi\":\"10.1145/3270101.3270108\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Fuzz testing, or \\\"fuzzing,\\\" refers to a widely deployed class of techniques for testing programs by generating a set of inputs for the express purpose of finding bugs and identifying security flaws. Grey-box fuzzing, the most popular fuzzing strategy, combines light program instrumentation with a data driven process to generate new program inputs. In this work, we present a machine learning approach that builds on AFL, the preeminent grey-box fuzzer, by adaptively learning a probability distribution over its mutation operators on a program-specific basis. These operators, which are selected uniformly at random in AFL and mutational fuzzers in general, dictate how new inputs are generated, a core part of the fuzzer's efficacy. Our main contributions are two-fold: First, we show that a sampling distribution over mutation operators estimated from training programs can significantly improve performance of AFL. Second, we introduce a Thompson Sampling, bandit-based optimization approach that fine-tunes the mutator distribution adaptively, during the course of fuzzing an individual program and outperforms offline training. A set of experiments across complex programs demonstrates that tuning the mutational operator distribution generates sets of inputs that yield significantly higher code coverage and finds more crashes faster and more reliably than both baseline versions of AFL as well as other AFL-based learning approaches.\",\"PeriodicalId\":132293,\"journal\":{\"name\":\"Proceedings of the 11th ACM Workshop on Artificial Intelligence and Security\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-01-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"26\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 11th ACM Workshop on Artificial Intelligence and Security\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3270101.3270108\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 11th ACM Workshop on Artificial Intelligence and Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3270101.3270108","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 26

摘要

模糊测试，或“模糊测试”，指的是一种广泛部署的技术，通过生成一组输入来测试程序，以找到错误和识别安全缺陷。灰盒模糊测试是最流行的模糊测试策略，它将轻量级程序检测与数据驱动过程相结合，以生成新的程序输入。在这项工作中，我们提出了一种基于AFL(卓越的灰盒模糊器)的机器学习方法，通过在特定程序的基础上自适应地学习其突变算子的概率分布。这些操作符通常在AFL和突变模糊器中均匀随机选择，它们决定了如何产生新的输入，这是模糊器效能的核心部分。我们的主要贡献有两个方面:首先，我们证明了从训练计划中估计的突变算子的抽样分布可以显著提高AFL的性能。其次，我们引入了一种基于强盗的汤普森采样优化方法，该方法在模糊化单个程序的过程中自适应微调突变子分布，并优于离线训练。一组跨复杂程序的实验表明，与AFL的基线版本以及其他基于AFL的学习方法相比，调整突变算子分布产生的输入集产生了显著更高的代码覆盖率，并且更快、更可靠地发现了更多的崩溃。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Adaptive Grey-Box Fuzz-Testing with Thompson Sampling

Fuzz testing, or "fuzzing," refers to a widely deployed class of techniques for testing programs by generating a set of inputs for the express purpose of finding bugs and identifying security flaws. Grey-box fuzzing, the most popular fuzzing strategy, combines light program instrumentation with a data driven process to generate new program inputs. In this work, we present a machine learning approach that builds on AFL, the preeminent grey-box fuzzer, by adaptively learning a probability distribution over its mutation operators on a program-specific basis. These operators, which are selected uniformly at random in AFL and mutational fuzzers in general, dictate how new inputs are generated, a core part of the fuzzer's efficacy. Our main contributions are two-fold: First, we show that a sampling distribution over mutation operators estimated from training programs can significantly improve performance of AFL. Second, we introduce a Thompson Sampling, bandit-based optimization approach that fine-tunes the mutator distribution adaptively, during the course of fuzzing an individual program and outperforms offline training. A set of experiments across complex programs demonstrates that tuning the mutational operator distribution generates sets of inputs that yield significantly higher code coverage and finds more crashes faster and more reliably than both baseline versions of AFL as well as other AFL-based learning approaches.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 11th ACM Workshop on Artificial Intelligence and Security

自引率

0.00%

发文量