Robust Linear Regression Against Training Data Poisoning

Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security Pub Date : 2017-11-03 DOI:10.1145/3128572.3140447

Chang Liu, Bo Li, Yevgeniy Vorobeychik, Alina Oprea

{"title":"Robust Linear Regression Against Training Data Poisoning","authors":"Chang Liu, Bo Li, Yevgeniy Vorobeychik, Alina Oprea","doi":"10.1145/3128572.3140447","DOIUrl":null,"url":null,"abstract":"The effectiveness of supervised learning techniques has made them ubiquitous in research and practice. In high-dimensional settings, supervised learning commonly relies on dimensionality reduction to improve performance and identify the most important factors in predicting outcomes. However, the economic importance of learning has made it a natural target for adversarial manipulation of training data, which we term poisoning attacks. Prior approaches to dealing with robust supervised learning rely on strong assumptions about the nature of the feature matrix, such as feature independence and sub-Gaussian noise with low variance. We propose an integrated method for robust regression that relaxes these assumptions, assuming only that the feature matrix can be well approximated by a low-rank matrix. Our techniques integrate improved robust low-rank matrix approximation and robust principle component regression, and yield strong performance guarantees. Moreover, we experimentally show that our methods significantly outperform state of the art both in running time and prediction error.","PeriodicalId":318259,"journal":{"name":"Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security","volume":"120 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"76","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3128572.3140447","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 76

Abstract

The effectiveness of supervised learning techniques has made them ubiquitous in research and practice. In high-dimensional settings, supervised learning commonly relies on dimensionality reduction to improve performance and identify the most important factors in predicting outcomes. However, the economic importance of learning has made it a natural target for adversarial manipulation of training data, which we term poisoning attacks. Prior approaches to dealing with robust supervised learning rely on strong assumptions about the nature of the feature matrix, such as feature independence and sub-Gaussian noise with low variance. We propose an integrated method for robust regression that relaxes these assumptions, assuming only that the feature matrix can be well approximated by a low-rank matrix. Our techniques integrate improved robust low-rank matrix approximation and robust principle component regression, and yield strong performance guarantees. Moreover, we experimentally show that our methods significantly outperform state of the art both in running time and prediction error.

查看原文本刊更多论文

抗训练数据中毒的鲁棒线性回归

监督学习技术的有效性使其在研究和实践中无处不在。在高维环境中，监督学习通常依赖于降维来提高性能，并确定预测结果的最重要因素。然而，学习的经济重要性使其成为训练数据对抗性操纵的自然目标，我们称之为中毒攻击。先前处理鲁棒监督学习的方法依赖于对特征矩阵性质的强假设，例如特征独立性和低方差的亚高斯噪声。我们提出了一种集成的鲁棒回归方法，该方法放宽了这些假设，仅假设特征矩阵可以很好地由低秩矩阵近似。我们的技术集成了改进的鲁棒低秩矩阵近似和鲁棒主成分回归，并产生了强大的性能保证。此外，我们通过实验表明，我们的方法在运行时间和预测误差方面都明显优于现有的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security

自引率

0.00%

发文量