Proving data-poisoning robustness in decision trees

Proceedings of the 41st ACM SIGPLAN Conference on Programming Language Design and Implementation Pub Date : 2019-12-02 DOI:10.1145/3385412.3385975

Samuel Drews, Aws Albarghouthi, Loris D'antoni

引用次数: 14

Abstract

Machine learning models are brittle, and small changes in the training data can result in different predictions. We study the problem of proving that a prediction is robust to data poisoning, where an attacker can inject a number of malicious elements into the training set to influence the learned model. We target decision-tree models, a popular and simple class of machine learning models that underlies many complex learning techniques. We present a sound verification technique based on abstract interpretation and implement it in a tool called Antidote. Antidote abstractly trains decision trees for an intractably large space of possible poisoned datasets. Due to the soundness of our abstraction, Antidote can produce proofs that, for a given input, the corresponding prediction would not have changed had the training set been tampered with or not. We demonstrate the effectiveness of Antidote on a number of popular datasets.

查看原文本刊更多论文

证明决策树的数据中毒鲁棒性

机器学习模型是脆弱的，训练数据的微小变化可能导致不同的预测。我们研究了证明预测对数据中毒的鲁棒性问题，其中攻击者可以向训练集中注入许多恶意元素来影响学习模型。我们的目标是决策树模型，这是一种流行而简单的机器学习模型，是许多复杂学习技术的基础。我们提出了一种基于抽象解释的声音验证技术，并在一个名为Antidote的工具中实现。解毒剂抽象地训练决策树为一个棘手的大空间的可能有毒的数据集。由于我们抽象的合理性，Antidote可以证明，对于给定的输入，无论训练集是否被篡改，相应的预测都不会改变。我们在一些流行的数据集上展示了解毒剂的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 41st ACM SIGPLAN Conference on Programming Language Design and Implementation

自引率

0.00%

发文量