Poster Abstract: Safety Guaranteed Preference Learning Approach for Autonomous Vehicles

Proceedings of the 26th ACM International Conference on Hybrid Systems: Computation and Control Pub Date : 2023-05-09 DOI:10.1145/3575870.3589549

Ruya Karagulle, N. Aréchiga, A. Best, Jonathan DeCastro, N. Ozay

引用次数: 1

Abstract

In this work, we propose a safety-guaranteed personalization for autonomous vehicles by incorporating Signal Temporal Logic (STL) into preference learning problem. We propose a new variant of STL called Parametric Weighted Signal Temporal Logic with a new quantitative semantics, namely weighted robustness. Given a set of pairwise preferences, and by using gradient-based optimization methods, we learn a set of valuations for weights that reflect preferences such that preferred ones have greater weighted robustness value than their non-preferred matches. Traditional STL formulas fail to incorporate preferences due its complex nature. Our initial results with data from a human-subject on an intersection with stop sign driving scenario, in which the participant is asked their preferred driving behavior from pairs of vehicle trajectories, indicate that we can learn a new weighted STL formula that captures preferences while also encoding correctness.

查看原文本刊更多论文

摘要:自动驾驶汽车的安全保证偏好学习方法

在这项工作中，我们通过将信号时间逻辑(STL)纳入偏好学习问题，提出了一种安全保证的自动驾驶汽车个性化方案。我们提出了STL的一种新的变体，称为参数加权信号时间逻辑，它具有新的量化语义，即加权鲁棒性。给定一组成对偏好，通过使用基于梯度的优化方法，我们学习了一组反映偏好的权重估值，使得首选的权重鲁棒性值比非首选匹配值更大。传统的STL公式由于其复杂性而无法包含首选项。我们的初步结果来自一个人类受试者在一个有停车标志的十字路口驾驶场景的数据，在这个场景中，参与者被问及他们对车辆轨迹的偏好驾驶行为，表明我们可以学习一个新的加权STL公式，在捕获偏好的同时编码正确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 26th ACM International Conference on Hybrid Systems: Computation and Control

自引率

0.00%

发文量