A Hybrid Approach to Identifying Unknown Unknowns of Predictive Models

Proceedings of the ... AAAI Conference on Human Computation and Crowdsourcing Pub Date : 2019-10-28 DOI:10.1609/hcomp.v7i1.5274

C. Vandenhof

引用次数: 8

Abstract

When predictive models are deployed in the real world, the confidence of a given prediction is often used as a signal of how much it should be trusted. It is therefore critical to identify instances for which the model is highly confident yet incorrect, i.e. the unknown unknowns. We describe a hybrid approach to identifying unknown unknowns that combines the previous crowdsourcing and algorithmic strategies, and addresses some of their weaknesses. In particular, we propose learning a set of interpretable decision rules to approximate how the model makes high confidence predictions. We devise a crowdsourcing task in which workers are presented with a rule, and challenged to generate an instance that “contradicts” it. A bandit algorithm is used to select the most promising rules to present to workers. Our method is evaluated by conducting a user study on Amazon Mechanical Turk. Experimental results on three datasets indicate that our approach discovers unknown unknowns more efficiently than the state-of-the-art.

查看原文本刊更多论文

一种识别预测模型未知未知数的混合方法

当在现实世界中部署预测模型时，给定预测的置信度通常用作该预测值得信任的程度的信号。因此，识别模型高度自信但不正确的实例(即未知的未知数)是至关重要的。我们描述了一种混合方法来识别未知的未知，它结合了以前的众包和算法策略，并解决了它们的一些弱点。特别是，我们建议学习一组可解释的决策规则来近似模型如何做出高置信度的预测。我们设计了一个众包任务，在这个任务中，向工人们展示一条规则，并要求他们生成一个与之“矛盾”的实例。使用强盗算法选择最有希望的规则呈现给工人。我们的方法是通过对亚马逊土耳其机器人进行用户研究来评估的。在三个数据集上的实验结果表明，我们的方法比最先进的方法更有效地发现未知的未知数。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the ... AAAI Conference on Human Computation and Crowdsourcing

自引率

0.00%

发文量