Justin Lidard, Hang Pham, Ariel Bachman, Bryan Boateng, Anirudha Majumdar
{"title":"Risk-Calibrated Human-Robot Interaction via Set-Valued Intent Prediction","authors":"Justin Lidard, Hang Pham, Ariel Bachman, Bryan Boateng, Anirudha Majumdar","doi":"arxiv-2403.15959","DOIUrl":null,"url":null,"abstract":"Tasks where robots must cooperate with humans, such as navigating around a\ncluttered home or sorting everyday items, are challenging because they exhibit\na wide range of valid actions that lead to similar outcomes. Moreover,\nzero-shot cooperation between human-robot partners is an especially challenging\nproblem because it requires the robot to infer and adapt on the fly to a latent\nhuman intent, which could vary significantly from human to human. Recently,\ndeep learned motion prediction models have shown promising results in\npredicting human intent but are prone to being confidently incorrect. In this\nwork, we present Risk-Calibrated Interactive Planning (RCIP), which is a\nframework for measuring and calibrating risk associated with uncertain action\nselection in human-robot cooperation, with the fundamental idea that the robot\nshould ask for human clarification when the risk associated with the\nuncertainty in the human's intent cannot be controlled. RCIP builds on the\ntheory of set-valued risk calibration to provide a finite-sample statistical\nguarantee on the cumulative loss incurred by the robot while minimizing the\ncost of human clarification in complex multi-step settings. Our main insight is\nto frame the risk control problem as a sequence-level multi-hypothesis testing\nproblem, allowing efficient calibration using a low-dimensional parameter that\ncontrols a pre-trained risk-aware policy. Experiments across a variety of\nsimulated and real-world environments demonstrate RCIP's ability to predict and\nadapt to a diverse set of dynamic human intents.","PeriodicalId":501062,"journal":{"name":"arXiv - CS - Systems and Control","volume":"36 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Systems and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2403.15959","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Tasks where robots must cooperate with humans, such as navigating around a
cluttered home or sorting everyday items, are challenging because they exhibit
a wide range of valid actions that lead to similar outcomes. Moreover,
zero-shot cooperation between human-robot partners is an especially challenging
problem because it requires the robot to infer and adapt on the fly to a latent
human intent, which could vary significantly from human to human. Recently,
deep learned motion prediction models have shown promising results in
predicting human intent but are prone to being confidently incorrect. In this
work, we present Risk-Calibrated Interactive Planning (RCIP), which is a
framework for measuring and calibrating risk associated with uncertain action
selection in human-robot cooperation, with the fundamental idea that the robot
should ask for human clarification when the risk associated with the
uncertainty in the human's intent cannot be controlled. RCIP builds on the
theory of set-valued risk calibration to provide a finite-sample statistical
guarantee on the cumulative loss incurred by the robot while minimizing the
cost of human clarification in complex multi-step settings. Our main insight is
to frame the risk control problem as a sequence-level multi-hypothesis testing
problem, allowing efficient calibration using a low-dimensional parameter that
controls a pre-trained risk-aware policy. Experiments across a variety of
simulated and real-world environments demonstrate RCIP's ability to predict and
adapt to a diverse set of dynamic human intents.