Optimising Human-AI Collaboration by Learning Convincing Explanations

arXiv (Cornell University) Pub Date : 2023-11-13 DOI:10.48550/arxiv.2311.07426

Chan, Alex J., Huyuk, Alihan, van der Schaar, Mihaela

{"title":"Optimising Human-AI Collaboration by Learning Convincing Explanations","authors":"Chan, Alex J., Huyuk, Alihan, van der Schaar, Mihaela","doi":"10.48550/arxiv.2311.07426","DOIUrl":null,"url":null,"abstract":"Machine learning models are being increasingly deployed to take, or assist in taking, complicated and high-impact decisions, from quasi-autonomous vehicles to clinical decision support systems. This poses challenges, particularly when models have hard-to-detect failure modes and are able to take actions without oversight. In order to handle this challenge, we propose a method for a collaborative system that remains safe by having a human ultimately making decisions, while giving the model the best opportunity to convince and debate them with interpretable explanations. However, the most helpful explanation varies among individuals and may be inconsistent across stated preferences. To this end we develop an algorithm, Ardent, to efficiently learn a ranking through interaction and best assist humans complete a task. By utilising a collaborative approach, we can ensure safety and improve performance while addressing transparency and accountability concerns. Ardent enables efficient and effective decision-making by adapting to individual preferences for explanations, which we validate through extensive simulations alongside a user study involving a challenging image classification task, demonstrating consistent improvement over competing systems.","PeriodicalId":496270,"journal":{"name":"arXiv (Cornell University)","volume":"110 23","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv (Cornell University)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arxiv.2311.07426","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Machine learning models are being increasingly deployed to take, or assist in taking, complicated and high-impact decisions, from quasi-autonomous vehicles to clinical decision support systems. This poses challenges, particularly when models have hard-to-detect failure modes and are able to take actions without oversight. In order to handle this challenge, we propose a method for a collaborative system that remains safe by having a human ultimately making decisions, while giving the model the best opportunity to convince and debate them with interpretable explanations. However, the most helpful explanation varies among individuals and may be inconsistent across stated preferences. To this end we develop an algorithm, Ardent, to efficiently learn a ranking through interaction and best assist humans complete a task. By utilising a collaborative approach, we can ensure safety and improve performance while addressing transparency and accountability concerns. Ardent enables efficient and effective decision-making by adapting to individual preferences for explanations, which we validate through extensive simulations alongside a user study involving a challenging image classification task, demonstrating consistent improvement over competing systems.

查看原文本刊更多论文

通过学习令人信服的解释来优化人类与人工智能的协作

从准自动驾驶汽车到临床决策支持系统，机器学习模型正被越来越多地用于做出或协助做出复杂和高影响力的决策。这带来了挑战，特别是当模型具有难以检测的故障模式并且能够在没有监督的情况下采取行动时。为了应对这一挑战，我们提出了一种协作系统的方法，通过让人类最终做出决策，同时给模型提供最好的机会，用可解释的解释说服和辩论他们，从而保持安全。然而，最有用的解释因人而异，在陈述的偏好中可能不一致。为此，我们开发了一种算法，Ardent，通过互动有效地学习排名，并最好地帮助人类完成任务。通过采用协作方式，我们可以确保安全并提高绩效，同时解决透明度和问责制问题。我们通过广泛的模拟以及涉及具有挑战性的图像分类任务的用户研究来验证，显示出对竞争系统的持续改进，从而适应个人对解释的偏好，从而实现高效和有效的决策。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

arXiv (Cornell University)

自引率

0.00%

发文量