Software Framework for Data Fault Injection to Test Machine Learning Systems

2019 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW) Pub Date : 2019-10-01 DOI:10.1109/ISSREW.2019.00087

J. Nurminen, Tuomas Halvari, Juha Harviainen, Juha Mylläri, Antti Röyskö, Juuso Silvennoinen, T. Mikkonen

引用次数: 10

Abstract

Data-intensive systems are sensitive to the quality of data. Data often has problems due to faulty sensors or network problems, for instance. In this work, we develop a software framework to emulate faults in data and use it to study how machine learning (ML) systems work when the data has problems. We aim for flexibility: users can use predefined or their own dedicated fault models. Likewise, different kind of data (e.g. text, time series, video) can be used and the system under test can vary from a single ML model to a complicated software system. Our goal is to show how data faults can be emulated and how that can be used in the study and development of ML solutions.

查看原文本刊更多论文

用于测试机器学习系统的数据故障注入软件框架

数据密集型系统对数据质量非常敏感。例如，由于传感器故障或网络问题，数据经常出现问题。在这项工作中，我们开发了一个软件框架来模拟数据中的错误，并使用它来研究机器学习(ML)系统在数据有问题时如何工作。我们的目标是灵活性:用户可以使用预定义的或他们自己专用的故障模型。同样，可以使用不同类型的数据(例如文本，时间序列，视频)，并且被测试的系统可以从单个ML模型到复杂的软件系统。我们的目标是展示如何模拟数据错误，以及如何将其用于ML解决方案的研究和开发。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)

自引率

0.00%

发文量