AI/ML for Network Security: The Emperor has no Clothes

Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security Pub Date : 2022-11-07 DOI:10.1145/3548606.3560609

A. Jacobs, Roman Beltiukov, W. Willinger, R. Ferreira, Arpit Gupta, L. Granville

{"title":"AI/ML for Network Security: The Emperor has no Clothes","authors":"A. Jacobs, Roman Beltiukov, W. Willinger, R. Ferreira, Arpit Gupta, L. Granville","doi":"10.1145/3548606.3560609","DOIUrl":null,"url":null,"abstract":"Several recent research efforts have proposed Machine Learning (ML)-based solutions that can detect complex patterns in network traffic for a wide range of network security problems. However, without understanding how these black-box models are making their decisions, network operators are reluctant to trust and deploy them in their production settings. One key reason for this reluctance is that these models are prone to the problem of underspecification, defined here as the failure to specify a model in adequate detail. Not unique to the network security domain, this problem manifests itself in ML models that exhibit unexpectedly poor behavior when deployed in real-world settings and has prompted growing interest in developing interpretable ML solutions (e.g., decision trees) for \"explaining'' to humans how a given black-box model makes its decisions. However, synthesizing such explainable models that capture a given black-box model's decisions with high fidelity while also being practical (i.e., small enough in size for humans to comprehend) is challenging. In this paper, we focus on synthesizing high-fidelity and low-complexity decision trees to help network operators determine if their ML models suffer from the problem of underspecification. To this end, we present Trustee, a framework that takes an existing ML model and training dataset as input and generates a high-fidelity, easy-to-interpret decision tree and associated trust report as output. Using published ML models that are fully reproducible, we show how practitioners can use Trustee to identify three common instances of model underspecification; i.e., evidence of shortcut learning, presence of spurious correlations, and vulnerability to out-of-distribution samples.","PeriodicalId":435197,"journal":{"name":"Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security","volume":"97 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3548606.3560609","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 21

Abstract

Several recent research efforts have proposed Machine Learning (ML)-based solutions that can detect complex patterns in network traffic for a wide range of network security problems. However, without understanding how these black-box models are making their decisions, network operators are reluctant to trust and deploy them in their production settings. One key reason for this reluctance is that these models are prone to the problem of underspecification, defined here as the failure to specify a model in adequate detail. Not unique to the network security domain, this problem manifests itself in ML models that exhibit unexpectedly poor behavior when deployed in real-world settings and has prompted growing interest in developing interpretable ML solutions (e.g., decision trees) for "explaining'' to humans how a given black-box model makes its decisions. However, synthesizing such explainable models that capture a given black-box model's decisions with high fidelity while also being practical (i.e., small enough in size for humans to comprehend) is challenging. In this paper, we focus on synthesizing high-fidelity and low-complexity decision trees to help network operators determine if their ML models suffer from the problem of underspecification. To this end, we present Trustee, a framework that takes an existing ML model and training dataset as input and generates a high-fidelity, easy-to-interpret decision tree and associated trust report as output. Using published ML models that are fully reproducible, we show how practitioners can use Trustee to identify three common instances of model underspecification; i.e., evidence of shortcut learning, presence of spurious correlations, and vulnerability to out-of-distribution samples.

查看原文本刊更多论文

网络安全的AI/ML:皇帝没穿衣服

最近的一些研究工作提出了基于机器学习(ML)的解决方案，可以检测网络流量中的复杂模式，以解决各种网络安全问题。然而，如果不了解这些黑箱模型是如何做出决策的，网络运营商就不愿意信任它们，并将它们部署到生产环境中。这种不情愿的一个关键原因是，这些模型容易出现规格不足的问题，这里定义为未能充分详细地指定模型。这个问题并不是网络安全领域所独有的，当机器学习模型部署在现实环境中时，会表现出意想不到的不良行为，这促使人们对开发可解释的机器学习解决方案(例如，决策树)越来越感兴趣，以便向人类“解释”给定的黑盒模型是如何做出决策的。然而，综合这种可解释的模型，既能以高保真度捕获给定的黑盒模型的决策，又具有实用性(即，足够小，人类可以理解)，这是一项挑战。在本文中，我们专注于合成高保真度和低复杂性的决策树，以帮助网络运营商确定他们的机器学习模型是否存在规格不足的问题。为此，我们提出了受托人，这是一个框架，它将现有的ML模型和训练数据集作为输入，并生成高保真度，易于解释的决策树和相关的信任报告作为输出。使用已发布的完全可重复的ML模型，我们展示了从业者如何使用受托人来识别模型规格不足的三种常见实例;例如，捷径学习的证据，虚假相关性的存在，以及对分布外样本的脆弱性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security

自引率

0.00%

发文量